GT, a leading multi-physics CAE simulation software provider, is looking for a DevOps and GenAI Backend Developer to design, implement, and maintain the backbone for their Generative AI services, ensuring reliability and scalability, and to contribute DevOps expertise to other areas like their modeling environment and cloud computing services.
Requirements
- Strong hands-on experience in managing and deploying web applications and cloud architecture on AWS. Ability to create Cloud Architecture for a given web-based product from scratch with focus on auto-scaling, load balancing, data migration and cost optimization.
- Proven track record of deploying and maintaining Machine Learning (ML) / Large Language Model (LLM) applications in production environment with active users.
- Minimum of 3 years’ experience and proficiency in Python for automating pipelines, integrating AI workflows, and optimizing deployments.
- Comfortable operating within environments that enforce strict access controls and compliance-driven workflows. Contribute to ensure systems remain resilient and protected.
- Basic understanding of Generative AI concepts (RAG, agents, data processing, prompt engineering); ability to work with engineers on prompt and pipeline improvements. Knowledge of GPU infrastructure management, token usage optimization, and scaling strategies for API-based LLMs
- Proficiency with tools dedicated to automated deployment like CloudFormation Templates / Terraform / Kubernetes yaml files / Ansible. (Preferred tool: CloudFormation Templates, Ansible)
- Can manage structured/unstructured data on platforms such as DynamoDB, PostgreSQL, or similar
Responsibilities
- Perform DevOps activities including cloud architecture definition, deployments, CI/CD pipelines enhancement, maintenance and system monitoring for Generative AI applications, and other cloud-based software.
- Create, maintain, and update architecture as code and pipeline scripts used to deploy products and their cloud architecture.
- Operate a wide range of cloud services deployed on Amazon Web Services (i.e. Amazon Bedrock, Amazon Cognito, EC2, ECS, SES, ELB)
- Work on hardening security of cloud-based environment (architecture and software) to match compliance with strong information security standards (like TISAX or ISO 27001)
- Troubleshoot incidents and coordinate with developers of products involved to provide support and fixes.
Other
- BS/MS in Computer Science, Engineering, AI, or related field / equivalent professional experience.
- Experience working closely with developers to manage staging environments, troubleshoot and debug with them to understand the inner workings of various products.
- This role will require you to be in the office 60% of the time.
- U.S. citizenship is required for this position, as the selected candidate will need to obtain and maintain a U.S. Department of Defense security clearance.