Benchling is looking to unlock the power of biotechnology by bringing modern software to modern science, and the Data Platform engineer will play a key role in this mission by building the next generation of Data Platform services.
Requirements
- Experience with data processing technologies Kafka, Kinesis, Spark, Flink, or other open-source or commercial software
- Experience designing, building, and maintaining scalable, distributed systems
- Strong experience with scripting language (such as Python)
- Experience with deployment and configuration management frameworks such as Terraform, Ansible, or Chef and container management systems such as Kubernetes or Amazon ECS.
Responsibilities
- Own projects end-to-end, from initial design, to prototype, to large-scale rollout.
- Build & operate high throughput distributed messaging platform like Kafka/kinesis to enable data change capture and data integration across Benchling.
- Define and design data transformations and pipelines for cross-functional datasets, while ensuring that data integrity and data privacy are first-class concerns regarded proactively, instead of reactively.
- Define the right Service Level Objectives for the batch & streaming pipelines, and optimize their performance.
- Designing and creating CI/CD pipelines for platform provisioning, full lifecycle management. Building the platform control panel to operate the fleet of systems efficiently.
- Work closely with the team across Application and Platform to establish best practices around usage of our data platform.
Other
- Have 6+ years of experience or a proven track record in software engineering
- Driven by creating positive impact for our customers and Benchling's business, and ultimately accelerating the pace of research in the Life Sciences
- Comfortable with complexity in the short term but can build towards simplicity in the long term
- Strong communicator with both words and data - you understand what it takes to go from raw data to something a human understands
- Employees are expected to be on-site 3 days per week (Monday, Tuesday, and Thursday)