Columbia, MD is seeking a Sr. Software Engineer to join a full stack LLM integration and delivery team to architect and develop systems that power cutting-edge LLM applications, ensuring they perform reliably at enterprise scale while enabling rapid iteration and deployment.
Requirements
- 12+ years of experience in software engineering with focus on scalable systems.
- Strong full-stack development experience with user-facing applications.
- Strong programming skills in languages such as Python, Go, or Java.
- Extensive experience with cloud platforms (e.g., AWS, GCP, Azure) and their services.
- Proficiency in containerization technologies (Docker, Kubernetes).
- Experience with infrastructure-as-code tools (e.g., Terraform, Ansible, Puppet).
- Expertise in monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack).
Responsibilities
- Lead the design and development of scalable LLM-powered applications and services.
- Architect infrastructure solutions that support rapid iteration and deployment of AI features
- Build and maintain the platforms that enable your team to ship AI features quickly and reliably.
- Develop and manage automation tools to improve system reliability and development efficiency.
- Implement and maintain monitoring, alerting, and logging systems.
- Conduct capacity planning and performance tuning for AI workloads.
- Lead and participate in incident response and post-mortem analyses.
Other
- Collaborate directly with product teams to translate user needs into technical solutions.
- Mentor junior team members and contribute to the overall growth of the engineering team.
- Continuously identify and implement improvements to our systems and development
- Experience building products that prioritize user experience and product-market fit.
- Active TS/SCI with a polygraph