Apollo is seeking a Staff Software Engineer to enhance their MCP Server and architect the MCP Gateway, aiming to create an enterprise-grade service that seamlessly exposes diverse tools and systems to AI agents, while also managing request routing, policy enforcement, and providing a runtime foundation for scalable multi-agent systems.
Requirements
- Expertise in agent-to-tool orchestration, routing, and coordination in scalable, fault-tolerant systems.
- Strong background in distributed systems, server architecture, and high-performance backend development.
- Proven experience with protocol design, message routing, and server-side orchestration frameworks.
- Experience building and maintaining robust runtime infrastructure that supports AI-driven workflows and enables reliable agent-to-tool interactions.
- Proven experience with protocol design, message routing, and building server-side frameworks that enable scalable, reliable multi-tool agent workflows.
- Hands-on experience with observability, monitoring, and debugging frameworks for complex systems.
- Experience in strategic system design, making architectural trade-offs, and planning for long-term scalability and maintainability.
Responsibilities
- Architect and scale an enterprise AI/MCP Server and Gateway that powers multi-agent workflows across Apollo, including routing, orchestration, and integration boundaries.
- Design and implement robust server infrastructure to ensure reliability, performance, and security at scale.
- Build and maintain tools for agent discovery, communication, and coordination.
- Define deployment strategies and runtime optimizations to maximize efficiency and minimize operational overhead.
- Develop frameworks and patterns that enable seamless multi-agent collaboration and AI-driven orchestration.
- Integrate observability, logging, and monitoring for full visibility into server and agent behavior.
- Explore and implement AI-enhanced developer workflows to optimize orchestration and agent interactions.
Other
- Strong technical leadership and mentorship, including guiding junior engineers and driving engineering best practices across teams.
- Ability to influence cross-team architecture decisions and align engineering efforts with product and business objectives.
- Production ownership experience: leading incident response, debugging, and performance optimization in high-impact backend systems.
- This is a remote position that can be done from anywhere in the United States or Canada.
- Apollo is proud to be an equal opportunity workplace dedicated to pursuing and hiring a talented and diverse workforce.