Crusoe is looking to accelerate the abundance of energy and intelligence by crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.
Requirements
- Advanced degree in Computer Science, Engineering, or a related field.
- Demonstrable experience in distributed systems design and implementation.
- Expertise in using cloud-based services, such as, elastic compute, object storage, virtual private networks, managed database, etc.
- Experience in Generative AI (Large Language Models, Multimodal).
- Familiarity with AI infrastructure, including training, inference, and ETL pipelines.
- Experience with container runtimes (e.g., Kubernetes) and microservices architectures.
- Experience using REST APIs and common communication protocols, such as gRPC.
Responsibilities
- Lead the design and implementation of core AI services, including resilient fault-tolerant queues for efficient task distribution.
- Build and scale infrastructure to handle millions of API requests per second.
- Optimize AI inference performance on GPU-based systems.
- Implement robust monitoring and alerting to ensure system health and availability.
- Collaborate closely with product management, business strategy, and other engineering teams.
- Influence the long-term vision and architectural decisions of the AI platform.
- Contribute to open-source AI frameworks and participate in the AI community.
Other
- Proactive and collaborative approach with the ability to work autonomously.
- Strong communication and interpersonal skills.
- Passion for building cutting-edge AI products and solving challenging technical problems.
- Advanced degree in Computer Science, Engineering, or a related field.
- Paid Parental Leave