Morgan Stanley is looking to solve the problem of ensuring the operational reliability of deployed software and implementing strategies to optimize performance and minimize downtime in their production environment.
Requirements
- Proficiency with Linux
- Understanding of agile methodologies (Scrum, Kanban)Use of agile facilitating toolsets (Rally, JIRA)
- Thorough understanding of SRE concepts and principles
- Familiarity with SDLC processes and management tools (Jira/GIT/Stashblue)
- Network diagnostic skills and experience with networks and realtime messaging technologies (multicast, TCP/IP, UDP, SNMP)
- Strong scripting skills, like, Python, Jscript or UNIX shell
- Understanding of electronic and/or algorithmic trading systems
Responsibilities
- Monitor and respond to user-reported issues as well as infrastructure alerts promptly and professionally; ensure issues are tracked through to resolution.
- Ensure efficient incident management, ensuring accurate communication to impacted groups and timely resolution.
- Facilitate root cause investigations and manage the implementation of corrective and preventative measures.
- Manage coverage during Asian and European market hours, including weekend pre-open ready-for-business checks.
- Proactively identify and respond promptly to failures.
- Partner with development teams to drive stability, operational excellence, and a culture of efficiency.
- Review, execute, and verify production changes in strict accordance with procedures defined in change documents.
Other
- Bachelor's degree in Computer Science or related field from an accredited college or university
- Excellent spoken and written English communication skills.
- Strategic mindset with specific focus on tooling, automation, and efficiency
- Able to troubleshoot, problem solver, analytical
- Expected base pay rates for the role will be between $90,000 and $135,000 per year at the commencement of employment.