phData is looking for a Machine Learning Solutions Architect to help global enterprises overcome their toughest data challenges by designing and implementing data solutions, providing thought leadership on technologies, and ensuring the performance, security, and scalability of machine learning models in production environments.
Requirements
- At least 6 years experience as a Machine Learning Engineer, Software Engineer, or Data Engineer
- Experience deploying machine learning models in a production setting
- Expertise in Python, Scala, Java, or another modern programming language
- The ability to build and operate robust data pipelines using a variety of data sources, programming languages, and toolsets
- Strong working knowledge of SQL and the ability to write, debug, and optimize distributed SQL queries
- Hands-on experience in one or more big data ecosystem products/languages such as Spark, Snowflake, Databricks, etc.
- Production experience in core data technologies (e.g. Spark, HDFS, Snowflake, Databricks, Redshift, & Amazon EMR)
Responsibilities
- Designing and implementing data solutions best suited to deliver on our customer needs — from model inference, retraining, monitoring, and beyond — across an evolving technical stack.
- Providing thought leadership by recommending the technologies and solution design for a given use case, from the application layer to infrastructure; and they have the team leadership and coding skills (e.g. Python, Java, and Scala) to build and operate in production; and to help ensure performance, security, scalability, and robust data integration.
- Design and create environments for data scientists to build models and manipulate data
- Work within customer systems to extract data and place it within an analytical environment
- Define the deployment approach and infrastructure for models and be responsible for ensuring that businesses can use the models we develop
- Reveal the true value of data by working with data scientists to manipulate and transform data into appropriate formats in order to deploy actionable machine learning models
- Partner with data scientists to ensure solution deployability—at scale, in harmony with existing business systems and pipelines, and such that the solution can be maintained throughout its life cycle
Other
- 4-year Bachelor's degree in Computer Science or a related field
- Excellent communication and presentation skills; previous experience working with internal or external customers
- A Master’s or other advanced degree in data science or a related field
- Relevant side projects (e.g. contributions to an open source technology stack)
- Remote-First Work Environment