Cloudglue is building foundational infrastructure that enables AI to understand videos for the first time. Our APIs enable developers to add video search, multi-video chat, and get structured data extraction from video content - reliably and at scale - in just a few lines of code.
Requirements
- Strong CS fundamentals (algorithms, data structures).
- Database proficiency (SQL, query optimization).
- Full-stack web experience (TypeScript/React, Next.js, Supabase, Vercel).
- Python backend + familiarity with AI orchestration frameworks (LangGraph, LangChain, Temporal, etc.).
- Experience with vector databases (Pinecone, Weaviate, Milvus, pgvector).
- Cloud deployment knowledge (AWS/GCP, Docker/Kubernetes).
Responsibilities
- Ship features end-to-end across our stack (React/TypeScript frontend, Node/Python backend).
- Integrate frontier video/audio AI models into production APIs.
- Propose new features.
- Build and ship features across frontend (React/TypeScript) and backend (Node, Python).
- Deploy and optimize cutting-edge multimodal AI models for video/audio understanding.
- Create intuitive developer tools and UIs that bring video/audio insights to life.
- Contribute ideas, own projects, and work closely with founders in a fast-paced startup environment.
Other
- Excellent communication and collaborative mindset.
- UI/UX instincts for building developer-facing tools.
- US citizen/visa only
- driven, deeply curious student