Centria's AI Operations Engineer will be the technical expert responsible for the day-to-day health and long-term stability of our AI agents. They will be a key player in ensuring our AI solutions are always available, secure, and performing optimally.
Requirements
- Familiarity with machine learning frameworks
- Strong understanding of fundamental AI/ML concepts, including model training, evaluation, and deployment
- Experience with cloud platforms such as AWS, Google Cloud, or Azure
- Knowledge of databases (SQL, NoSQL) and data processing tools
- Familiarity with LLMs (Large Language Models) and Voice AI.
- Demonstrated ability to integrate, maintain, and troubleshoot a variety of third-party systems and APIs.
- Hands-on experience with at least one major cloud provider, either Google Cloud or Azure.
Responsibilities
- Manage and update integrations with critical third-party services, including VAPI, Twilio, OpenAI, Azure AKS, Gemini, and Google Workspace, to ensure continuous functionality and performance.
- Act as a first responder for our AI solutions. This role will be responsible for triaging, diagnosing, and resolving errors and incidents, often under tight deadlines.
- Proactively identify system weaknesses and develop fault-tolerant, self-healing systems to minimize downtime and manual intervention.
- Collaborate with development teams to revise and optimize generative prompts, improving the quality and relevance of our AI agents' outputs.
- Enforce and guarantee conformance to data security policies, including strict adherence to HIPAA regulations across all AI systems and data flows.
- Actively monitor the health and performance of AI systems, models, and infrastructure. Proactively identify potential issues and implement preventative measures to ensure system reliability and uptime.
- Provide on-call support during the AI agents' operational windows, which may extend beyond typical business hours, to ensure uninterrupted service for our clients.
Other
- Bachelor's degree in Information Technology, Computer Science, or a related discipline
- Compliance with Centria's Code of Conduct, policies and procedures, and Federal and State laws.
- Responsibility to report violations of Company policies or the Code of Conduct.
- Ability to produce high-quality, comprehensive, and user-friendly technical documentation.
- Ability to follow written instructions