Rilla builds the world’s leading conversation intelligence software for offline commerce. By recording, analyzing, and indexing conversations, our platform supplies unparalleled alpha to the critical, physical businesses that constitute the backbone of our economy. Our purpose is to bring the technology of the 21st century’s most innovative companies to the rest of the economy.
Requirements
- Experience building and deploying ML systems in production
- Familiarity with owning the model lifecycle from inception to production.
- We work primarily in Python and TypeScript, with FastAPI for API development
- AI/ML technologies including PyTorch, OpenAI APIs, Baseten, and LiteLLM
- Cloud infrastructure with AWS and LiveKit for real-time communications
- PostgreSQL, Redis, and S3 for data storage
Responsibilities
- architect and ship AI-powered systems that make it possible for people to talk to Rilla like they would a human
- build agents that operate natively on real-world audio
- extract insights from conversations no one else can even access
- push the boundaries of what’s possible in AI
- shape the foundations of our AI stack
- drive the invention of new techniques, tooling, and models
- work across the full AI lifecycle—from data acquisition to real-time inference and user-facing chat interfaces
Other
- Comfort working directly with customers to understand their needs and solve real-world problems
- Building an AI product for an industry untouched by modern software
- Constantly talking to and visiting customers in the field
- Working ~70 hrs/week in person with some of the most ambitious people in NYC
- Attempting to build a generational company, and the intensity required to do so