Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Basis Research Institute Logo

Software Engineer, Infrastructure

Basis Research Institute

Salary not specified
Nov 23, 2025
New York, NY, US
Apply Now

Basis is looking to build the infrastructure that accelerates research and enables commercial deployment of Basis innovations. This includes creating reliable training and evaluation infrastructure, managing compute resources, developing SaaS platform offerings, and building the technical foundation for internal research and external customers.

Requirements

  • Building ML training or inference infrastructure for distributed systems
  • Developing cloud platforms or services used by multiple teams or customers
  • Creating developer tools, CI/CD systems, or deployment automation at scale
  • Contributing to infrastructure open-source projects or technical systems with high reliability requirements
  • Possess deep understanding of distributed systems principles including consistency, availability, fault tolerance, scalability patterns, and performance optimization for high-throughput, low-latency workloads.
  • Have hands-on experience with cloud platforms (AWS, GCP, Azure) including compute orchestration, storage systems, networking, and cost optimization strategies.
  • Be proficient in infrastructure technologies including Kubernetes, Docker, infrastructure as code (Terraform), CI/CD pipelines, monitoring and observability (Prometheus, Grafana), and modern DevOps practices.

Responsibilities

  • Design and build ML training infrastructure supporting medium-scale models with distributed training across GPU clusters, experiment tracking, checkpoint management, and reproducible pipelines.
  • Develop SaaS platform and API offerings that package Basis research innovations into commercial products, including backend services, API design, authentication, rate limiting, and customer-facing features.
  • Manage compute infrastructure as it scales, including capacity planning, resource allocation, cost optimization, cloud and on-premise orchestration, and efficiency monitoring.
  • Build developer tools and workflows that accelerate research velocity including CI/CD pipelines, testing frameworks, deployment automation, and development environment management.
  • Implement monitoring and observability providing comprehensive visibility into system health, performance, costs, and research progress through metrics, logging, alerting, and dashboards.
  • Ensure system reliability and scalability by designing fault-tolerant architectures, implementing graceful degradation, conducting load testing, and establishing SLAs appropriate for research and production workloads.
  • Collaborate with research teams to understand infrastructure needs, translate experimental techniques into scalable systems, and provide technical consultation on architecture and performance.

Other

  • We are in the office four days a week. Be prepared to attend multi-day Basis-wide in-person events.
  • Location: New York City.
  • Basis is a collaborative effort, both internally and with our external partners; we are looking for people who enjoy building infrastructure for problems larger than ones they can tackle alone.
  • Contribute to the culture and direction of Basis by modeling technical excellence, operational discipline, and focus on enabling high-impact research and commercial applications.
  • Exceptional candidates who may not meet all of the following criteria are still encouraged to apply.