Roche's Research and Early Development organisations at Genentech (gRED) and Pharma (pRED) need to leverage advances in AI, data, and computational sciences to accelerate drug discovery and development. Seamless data sharing and access to models across gRED and pRED are essential to maximizing these opportunities. The new Computational Sciences Center of Excellence (CoE) aims to harness the transformative power of data and Artificial Intelligence (AI) to assist scientists in delivering more innovative and transformative medicines.
Requirements
- Proven experience in designing, deploying, and managing infrastructure on Amazon Web Services (AWS), including services such as EC2, S3, RDS, EKS, SageMaker, etc.
- Strong proficiency with Git and Git repository management.
- Hands-on experience with Terraform for infrastructure provisioning and management.
- Experience with Helm for deploying and managing applications on Kubernetes.
- Proficiency in scripting languages (e.g., Python, Bash) for automation.
- Excellent problem-solving skills and a strong ability to debug complex issues.
- Applies software engineering best practices (linting automation, unit testing, documentation, CI/CD).
Responsibilities
- Design, implement, and maintain scalable and reliable ML infrastructure on AWS.
- Automate deployment, monitoring, alerting, and operational tasks using tools like Terraform and Helm.
- Manage and optimize CI/CD pipelines and Git repositories for ML projects, ensuring efficient version control to support collaboration and deployment.
- Collaborate closely with ML engineers and data scientists to understand their infrastructure needs and provide solutions.
- Troubleshoot and resolve infrastructure-related issues in a timely manner.
- Implement and enforce security best practices for ML infrastructure.
- Document infrastructure designs, processes, and operational procedures.
Other
- Contribute to initiatives independently as part of a team, delivering assigned outputs.
- Proactively identify issues and gaps, proposing ideas and suggestions for improvements.
- Strong communication and interpersonal skills to effectively collaborate with cross-functional teams and user-facing interactions.
- Demonstrated ability to take initiative, anticipate needs, and drive projects to completion.
- Ability to thrive in a fast-paced environment and adapt to evolving requirements while adhering to corporate guidelines.