Frontera Health is revolutionizing pediatric healthcare by developing a cutting-edge, tech-enabled platform that delivers essential therapies to rural families. Our platform leverages AI/ML to create a robust video-based data model for early intervention and developmental disorders. By collaborating closely with parents, caregivers, and clinical partners, we’re bridging the gap in access to care, improving health equity, and providing personalized treatment plans.
Requirements
- 7+ years of experience as a Data Engineer or Backend Engineer, with strong contributions to pipeline design, data modeling, and backend infrastructure.
- Proven success in leading greenfield data initiatives and building from scratch in fast-paced, startup environments.
- Expertise in ingesting and transforming unstructured data for use in ML models or analytics workflows.
- Strong programming skills in Python, SQL, and experience with frameworks like Airflow, Spark, Kafka, or cloud-native ETL tools.
- Deep understanding of storage solutions across structured, semi-structured, and unstructured data types (e.g., PostgreSQL, NoSQL, S3, object stores, search indices).
- Familiarity with API development and experience designing secure, performant data access layers.
- Practical knowledge of HIPAA compliance, secure data handling, RBAC, encryption, and audit requirements.
Responsibilities
- Design and evolve the foundational data model, ensuring consistency and alignment across ML, engineering, analytics, and product teams.
- Lead greenfield data initiatives from architecture through implementation, shaping the future of our data stack and practices.
- Ingest and process diverse, unstructured data (e.g., audio, video, clinical notes, PDFs), surfacing clinically meaningful information for downstream ML and analytics workflows.
- Build and maintain data ingestion pipelines connecting third-party tools, internal systems, cameras, microphones, and file-based sources.
- Identify and implement scalable data storage solutions optimized for various use cases—text, media, structured data, logs, etc.
- Build and support partner-facing APIs that expose data securely, ensuring alignment with product use cases and regulatory standards.
- Collaborate deeply with ML, engineering, product, analytics, and clinical teams to scale complex pipelines, support model training & evaluation, and enable data-driven product development.
Other
- Impactful Mission: Work on challenging and meaningful projects that leverage cutting-edge technologies (AI/ML) to improve pediatric healthcare in underserved communities.
- Growth & Innovation: Be at the forefront of innovation, collaborating with a talented and passionate team in a fast-paced, dynamic environment.
- Professional Development: Join a culture that values mentorship, learning, and continuous improvement.
- Global Collaboration: Engage with team members around the world, broadening your perspective and fostering diverse ideas.
- Make a Difference: Help shape the future of behavioral healthcare and positively influence the lives of children and families in rural communities.