Leidos is seeking a High Compute Engineer to lead the design, optimization, and integration of GPU-centric high-performance compute environments to support advanced compute initiatives where performance, stability, and future-readiness are critical.
Requirements
- 5+ years experience supporting GPU compute environments in mission-critical or enterprise settings.
- Proficiency with NVIDIA technologies: A100, DGX-1, CUDA, cuDNN, NCCL.
- Strong background in Linux (RHEL/CentOS/Ubuntu), kernel tuning, and HPC stack deployment.
- Experience with containerized GPU workloads using Docker, Kubernetes, and NVIDIA GPU Operator.
- Familiarity with distributed compute frameworks (e.g., SLURM, Kubernetes, Ray).
- Strong scripting skills: Bash, Python, or similar.
- Candidate must, at a minimum, meet DoD 8570.11- IAT Level II certification requirements (currently Security+ CE, CCNA-Security, GICSP, GSEC, or SSCP along with an appropriate computing environment (CE) certification). An IAT Level III certification would also be acceptable (CASP+, CCNP Security, CISA, CISSP, GCED, GCIH, CCSP).
Responsibilities
- Manage, optimize, and monitor existing high-performance GPU systems including NVIDIA A100s and DGX-1 platforms.
- Architect integration plans for scaling GPU compute infrastructure, including newer platforms (e.g., H100, Grace Hopper, AMD Instinct).
- Collaborate with data science teams to fine-tune GPU workloads for AI/ML pipelines.
- Design and implement high-speed networking (InfiniBand/RDMA) and storage solutions optimized for GPU data flow.
- Develop automation workflows using infrastructure-as-code (IaC) tools (e.g., Ansible, Terraform, SaltStack).
- Ensure system security, compliance, and patch management in alignment with NIST, RMF, or agency-specific controls.
- Analyze compute performance metrics and provide strategic recommendations for system enhancements.
Other
- 100% on-site position. All work must be performed at the customer site in Bethesda at the Intelligence Community Campus.
- Active TS/SCI clearance with Polygraph required OR active TS/SCI and willingness to obtain and maintain a Poly.
- US Citizenship is required due to the nature of the government contracts we support.
- Bachelor's or higher degree in Computer Engineering, Computer Science, or a related field with at least 12 years of related technical experience. Additional years of experience may be considered in lieu of a degree.