Letta is building the AI Operating System to turn stateless models into perpetual and self-improving intelligence, empowering developers to build state-of-the-art LLM agents.
Requirements
- R&D of the core agent framework, including context management techniques used to prompt the LLMs within the agent framework
- Prototyping new context management and memory systems (e.g. better state→prompt compilers, more complex memory systems, multi-threading of multiple LLM processes, improved planning for multi-step reasoning, etc.)
- Development of the agent interaction loop, encompassing tool execution, parsing, and more
- R&D of LLM serving methods to improve serving agent workloads (e.g. constrained decoding and prefix caching)
- Model evaluation and finetuning, such as finetuning state-of-the-art open-weights models on agent data traces (which we plan to release publicly as free models on HuggingFace)
Responsibilities
- R&D of the core agent framework, including context management techniques used to prompt the LLMs within the agent framework
- Prototyping new context management and memory systems (e.g. better state→prompt compilers, more complex memory systems, multi-threading of multiple LLM processes, improved planning for multi-step reasoning, etc.)
- Development of the agent interaction loop, encompassing tool execution, parsing, and more
- R&D of LLM serving methods to improve serving agent workloads (e.g. constrained decoding and prefix caching)
- Model evaluation and finetuning, such as finetuning state-of-the-art open-weights models on agent data traces (which we plan to release publicly as free models on HuggingFace)
- Writing and publishing technical blog posts and whitepapers
Other
- This role is in-person (no hybrid), 5 days a week in downtown San Francisco.
- You want to be an integral part of turning a tiny startup into a trillion dollar company.
- You are anti closed frontier AI that is controlled by a few private tech companies.
- At Letta everyone on the team engages directly with our customers and works across the stack.
- Paid in-person work trial (2 days onsite in SF)