Meta is seeking to improve the reliability of its data center operations by identifying and managing asset reliability risks that could adversely affect data center operations.
Requirements
- 10+ years of experience in reliability engineering (related to electrical or mechanical cooling equipment)
- Experienced in Reliability Centered Maintenance (RCM)and Failure Maintenance Effect Analysis activities for maintenance /process/equipment design optimization to meet reliability requirements
- Proficient in usage of EAM solutions to extract data and develop meaningful insights
- Certifications in Maintenance & Reliability such as CMRP, CRL, CRE
- Knowledgeable of relevant ISO standards (ISO 14224, ISO 17359, ISO 55000)
- Experience with data center equipment such as critical cooling systems, generators, main switchboards, network gear
- Proficient in data analysis techniques that can include Process Control, Reliability modeling and prediction, Fault Tree Analysis, Weibull Tree Analysis, Six Sigma (6σ) Methodology
Responsibilities
- Support the asset care and maintenance strategies for critical assets based on Meta Processes
- Support the development of standards, guidelines and processes to execute reliability program function
- Lead and facilitate asset criticality assessments, RCM studies, PM Optimization and other reliability studies
- Perform reliability analytics include Weibull distribution, Monte Carlo simulation and other reliability analysis
- Act as liaison between Reliability and other partner teams (AM, Quality, SSU, Retrofits)
- Support the development of standardized PM template to facilitate trending
- Works with appropriate technical teams to evaluate reliability and maintainability of data center equipment to significantly influence reliability and maintainability improvements
Other
- Bachelor’s degree in Mechanical, Electrical Reliability Engineering or similar technical discipline
- Ability to manage stakeholders spread across time zones
- Ability to work with Maintenance to analyze asset characteristics, including: asset availability, overall equipment effectiveness, remaining useful life
- Ability to provide technical support to Operations, Maintenance management, and technical personnel
- Individual compensation is determined by skills, qualifications, experience, and location