Transforming the insurance industry by building innovative software and services, and creating a place where amazing career moments are made possible at Applied
Requirements
- Practical experience supporting AI/ML and LLM workflows by building data infrastructure, managing unstructured data, and orchestrating secure, cloud-based data pipelines using CI/CD and workflow automation practices
- 4+ years of cloud-based experience focused on data modeling, building, and maintaining data solutions
- Experience with Python to build high-performance, maintainable, and scalable codebases, leveraging advanced language features, libraries, and best practices for data engineering applications
- Experience working with Infrastructure-as-Code (IaC) and cloud native data solutions, such as BigQuery, Spark, Pub/Sub, and Object Storage
- Proficiency in SQL and developing high-level code with languages such as Python or Scala to manipulate, store, manage, or retrieve data assets
- Knowledge of data modeling, data warehousing concepts, and data architecture
- Advanced understanding and practice of supporting AI/ML and LLM workflows by building data infrastructure, managing unstructured data, and orchestrating secure, cloud-based data pipelines using CI/CD and workflow automation practices
Responsibilities
- Implement and maintain scalable data pipelines to support downstream AI/ML/LLM workflows including but not limited to data labeling, classification, and document parsing
- Work with Data Scientists and Data Labeling teams to ensure reliable access to structured and unstructured data sources used in AI/ML and LLM workflows
- Manage and optimize data storage, partitioning, and clustering strategies to ensure high performance and reliability of our data infrastructure
- Develop and implement features and enhancements leveraging your SQL and ETL expertise, and cloud-based data warehousing technologies
- Collaborate with cross-functional teams to understand AI data requirements and deliver solutions aligned with business objectives, security requirements, and guidelines for data governance
- Develop documentation for the team to support design discussions
- Ensure data integrity and quality by implementing robust data validation and error-handling mechanisms to prevent data corruption
Other
- Bachelors or Masters degree in Computer Science, MIS, or CIS, or equivalent experience
- Communication experience with global team members to confirm requirements, priorities, and plans for delivery within committed timelines
- Ability to work remotely or from an Applied office
- Medical, Dental, and Vision Coverage
- Holiday and Vacation Time
- Health & Wellness Days