Prophecy is the world’s most advanced data integration platform, designed to make complex data work simple and powerful. Designed natively for modern cloud data platforms, Prophecy uniquely serves the needs of business analysts while ensuring enterprise-grade governance and compliance, giving organizations the confidence to innovate. Our AI-driven platform simplifies and automates data preparation, accelerating AI and analytics across Fortune 500 companies in banking, insurance, healthcare, life sciences, and technology. By enabling secure, self-service data transformation within a robust governance framework, Prophecy empowers every data user to develop, deploy, and monitor cloud-native data pipelines with ease.
Requirements
- Advanced ML/LLM knowledge: hands-on fine-tuning (SFT, DPO, RLHF), distillation, quantization/pruning for model optimization.
- Full ML lifecycle: from data acquisition to training to productization with PyTorch and open-source transformers (e.g., HuggingFace).
- Fluency in Python for dataset/model processing.
- Experience with backend pipelines/microservices software development in public cloud: AWS, k8s, java, scala.
- Experience taking ideas to production.
- ML/LLM work in code generation (e.g., Codex, text-to-SQL), semantic extraction, or knowledge graphs (e.g., Neo4j, Neptune).
- Optimization of ML models for low-latency, high-throughput production use.
Responsibilities
- Advanced ML/LLM knowledge: hands-on fine-tuning (SFT, DPO, RLHF), distillation, quantization/pruning for model optimization.
- Full ML lifecycle: from data acquisition to training to productization with PyTorch and open-source transformers (e.g., HuggingFace).
- Fluency in Python for dataset/model processing.
- Experience with backend pipelines/microservices software development in public cloud: AWS, k8s, java, scala.
- Builder mentality: Experience taking ideas to production.
- ML/LLM work in code generation (e.g., Codex, text-to-SQL), semantic extraction, or knowledge graphs (e.g., Neo4j, Neptune).
- Optimization of ML models for low-latency, high-throughput production use.
Other
- 1+ years in industry
- Ability to have your fingerprint on an innovative platform
- End-to-end ownership of your projects