Morgan Money platform handles trading for money market funds and requires management of active client and operational queries, ensuring seamless support for the application. The team needs to build, enhance, and deliver secure, stable, and scalable technology solutions, drive system robustness and efficiency, maintain high service standards, and advance the platform's capabilities.
Requirements
- Hands on experience in troubleshooting on prod issues with SRE/DevOps roles, with a strong background in cloud infrastructure, automation, and CI/CD.
- Hands-on practical experience delivering system design, application development, testing, and operational stability
- Excellent scripting and automation skills (e.g., Python) and observability tools .
- Proficiency in automation and continuous delivery methods
- Proficient in all aspects of the Software Development Life Cycle
- Advanced understanding of agile methodologies such as CI/CD, Application Resiliency, and Security
- Demonstrated proficiency in software applications and technical processes within a technical discipline (e.g., cloud, artificial intelligence, machine learning, mobile, etc.)
Responsibilities
- Lead and mentor a team for troubleshooting on issues with Devops/SRE mindset , fostering a culture of collaboration, continuous improvement, and innovation.
- Design, implement, and manage scalable, secure, and highly available cloud infrastructure on GAP/GKP/AWS.
- Utilize Infrastructure as Code (IaC) tools such as Terraform to automate the provisioning and management of infrastructure resources.
- Implement and maintain observability solutions to monitor system performance, detect anomalies, and ensure uptime.
- Utilize tools such as Open Telemetry, Prometheus, Grafana, Observability stack, or similar to provide actionable insights and proactive issue resolution.
- Develop and maintain CI/CD pipelines to automate the build, test, and deployment processes.
- Implement automation scripts and tools to streamline operations and reduce manual intervention.
Other
- Formal training or certification on software engineering concepts and 5+ years applied experience
- In-depth knowledge of the financial services industry and their IT systems
- Practical cloud native experience
- Communicate effectively with stakeholders to provide updates on system status, incidents, and improvements.
- Lead incident response efforts, perform root cause analysis, and implement corrective actions to prevent recurrence.