Cerebras Systems is looking to solve the problem of delivering industry-leading training and inference speeds for machine learning applications by building the world's largest AI chip, 56 times larger than GPUs, and empowering machine learning users to effortlessly run large-scale ML applications
Requirements
- Hands-on experience using lab equipment (oscilloscopes, logic analyzers, etc.)
- Experience with reading schematics, board layouts, and debugging electrical/mechanical issues
- Strong scripting skills (Python preferred) and experience working in Linux environments
- Familiarity with telemetry systems, sensor data, and reliability metrics
- Exposure to manufacturing test environments and data analysis workflows
- Experience with embedded systems with an emphasis on debugging complex issues
- Use scripting tools (e.g., Python) to automate tests and data collection workflows
Responsibilities
- Rapidly ramp up on electrical, thermal, and mechanical aspects of our wafer-scale systems
- Own and execute system bring-up, hardware integration, and validation test plans
- Isolate and debug complex failures across HW/SW boundaries; drive root cause analysis and corrective action
- Collaborate cross-functionally with silicon, hardware, software, validation, and manufacturing teams to resolve issues
- Analyze test results and provide actionable recommendations to improve design and processes
- Participate in design reviews and contribute design-for-test ideas for future products
- Interface with vendors, suppliers, and internal engineering stakeholders to ensure system-level performance and reliability
Other
- Bachelor’s degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field
- 4–7 years of industry experience in hardware validation, system bring-up, or integration roles
- Excellent analytical, troubleshooting, and communication skills
- Ability to work in a fast-paced, collaborative environment
- Commitment to creating an equal and diverse environment