Mastercard's BizOps team is seeking a Director, Site Reliability Engineering to lead their DevOps transformation, focusing on CI/CD pipeline development, automation, and best practices to improve the reliability and efficiency of their Real-Time Payments products.
Requirements
- Experience with algorithms, data structures, scripting, pipeline management, and software design.
- Ability to help debug and optimize code and automate routine tasks.
- Experience in one or more of the following is preferred: C, C++, Java, Python, Go, Perl or Ruby.
- For work on our dev ops team, engineer with experience in industry standard CI/CD tools like Git/BitBucket, Jenkins, Maven, Artifactory, and Chef.
- Experience designing and implementing an effective and efficient CI/CD flow that gets code from dev to prod with high quality and minimal manual effort is required.
- Deep knowledge of architecture techniques and tools.
- Basic to Intermediate understanding about Cloud – AWS, Azure, Or vmWare PCF.
Responsibilities
- Help us solve problems, build our CI/CD pipeline and lead Mastercard in DevOps automation and best practices.
- Contributes significantly to the engineering strategy for all platforms across multiple application suite and to the production support response strategy by identifying and developing platform improvement and process improvement opportunities.
- Troubleshoots applications and implements fixes to decrease time to resolution and minimize dependency on advanced vendor support.
- Mentors staff and provides assistance to team members as needed.
- Maintains skills consistent with the technology roadmap and implements tasks leveraging new technologies as needed.
- Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation and refinement.
- Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
Other
- This is a people manager position, which will oversee the Alerting & Monitoring, Capacity Management, CI-CD, Agile, Production Support using SRE principles, ITIL practices like Incident Management, Change Management, Problem Management, and Eliminating toil through Automation best practices to deliver on a great customer experience & delight.
- We support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed.
- We need team members with an appetite for change and pushing the boundaries of what can be done with automation.
- Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must.
- Work with a global team spread across tech hubs in multiple geographies and time zones