Maintain complex capacity utilization standards across Truist's infrastructure.
Requirements
- Ability to operate, evaluate, manage large-scale, distributed systems while understanding interdependencies of competing constraints between various technologies.
- Strong experience with virtualization including hyperconverged technologies, VMWare, Nutanix with emphasis on performance utilization.
- Experience with tools such as Kubernetes, Docker, OpenShift.
- In-depth knowledge of major cloud providers and service offering from AWS, Azure, and Google Cloud.
- Proficiency in performance and utilization aspects of network operations including routing, switching, load balancing, firewalls and edge networks.
- Solid experience with large-scale database systems including MSSQL, Oracle / Exadata, Postgres, Mongo.
- Ability to design and implement scalable capacity models and forecasting tools for compute, storage, and network infrastructure.
Responsibilities
- Continuously monitor network, server, and storage utilization to identify potential bottlenecks, performance issues, and resource shortages.
- Forecast future resource needs based on business goals and growth, ensuring the infrastructure can handle increasing demands.
- Tune servers, networks, and applications to maximize efficiency and ensure optimal resource allocation.
- Design and implement scalable IT solutions, including public cloud infrastructure, to accommodate future growth.
- Diagnose and resolve complex infrastructure issues, provide technical expertise, and serve as an escalation point for capacity problems.
- Manage and configure physical and virtual servers, data storage, and network devices to meet capacity standards.
- Design, implement, and manage for capacity & utilization cloud-based infrastructure solutions and services.
Other
- This job is on-site 4 days per/week
- English (Required)
- Excellent written and verbal communication skills for planning and troubleshooting with diverse teams and stakeholders.
- Experience working effectively with engineering, operations, business teams, and vendors.
- Strong analytical and problem-solving skills to make data-driven decisions under uncertainty.