IBM is seeking to solve some of the world's most challenging problems in IT systems management and reliability engineering, specifically in maintaining and supporting enterprise IT systems, including servers, virtual machines, and cloud environments.
Requirements
- Basic understanding of operating systems (Linux, Unix, Windows, z/OS, zVM) and networking fundamentals.
- Familiarity with scripting languages (e.g., Python, Bash).
- Exposure to cloud platforms (IBM Cloud, AWS, Azure).
- Interest in SRE principles such as automation, observability, and fault tolerance.
- Familiarity with monitoring tools (e.g., Grafana, Prometheus).
- Experience with version control systems (e.g., Git) and CI/CD concepts.
Responsibilities
- Assist in maintaining and supporting enterprise IT systems including servers, virtual machines, and cloud environments.
- Monitor system performance and help identify reliability issues.
- Support automation efforts for routine administrative tasks using scripting tools.
- Participate in incident response and post-incident analysis.
- Help document system configurations, operational procedures, and reliability metrics.
- Collaborate with cross-functional teams on infrastructure and service delivery
Other
- High School Diploma/GED
- Currently pursuing a degree in Computer Science, Information Technology, or a related field.
- Bachelor's Degree (Preferred)
- Effective communication and collaboration abilities.
- Strong analytical and problem-solving skills.