Developing tools for continuous deployment of a cloud-based networking service platform and operating/monitoring the service for a large scale environment
Requirements
- Strong grasp and practical experience with automation and virtualization of at least one cloud platform (e.g. VMWare, OpenStack, Cloudstack)
- Strong grasp and practical experience with at least one automation framework (e.g. Puppet, Chef, CFEngine)
- Expertise in building/maintaining/sustaining monitoring systems like: nagios, zabbix, icinga, xymon, opennms, whatsupgold, etc.
- Strong with at least one scripting and/or high level language: Python (preferred), Java, Shell, Perl, Golang
- Production experience working with load balancers and web stack based application environments
- Solid grasp of operating and managing NoSQL DB such as Cassandra, Hadoop
- Familiarity with messaging systems like ActiveMQ, RabbitMQ, and Kafka
Responsibilities
- Development and design of systems, with a focus on automation and maintainability at a large scale
- Software and hardware testing and deployment to staging and production environments
- Platform integration: Programmatic infrastructure management
- Develop software infrastructure and build tools useful in development lifecycle
- Comfortable with collaboration, open communication and reaching across dev, qa, monitoring and other cross functional teams
- Systems and application monitoring and performance tuning
- Production experience working with load balancers and web stack based application environments
Other
- Bachelor’s degree in computer science or related field
- Master’s preferred
- Self-motivated, with excellent oral and written communication skills
- 4+ years as Systems Administrator/Engineer, DevOps Engineer or platform developer in a mid to large Linux environment
- Collaboration, open communication and reaching across dev, qa, monitoring and other cross functional teams