Develop enterprise grade data platforms, services, and pipelines for Steampunk's clients
Requirements
- Python
- AWS
- Big data tools: Hadoop, Spark, Kafka, etc.
- Relational SQL and NoSQL databases, including Postgres and Cassandra
- Data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc.
- AWS cloud services: EC2, EMR, RDS, Redshift (or Azure equivalents)
- Data streaming systems: Storm, Spark-Streaming, etc.
Responsibilities
- Lead and architect migration of data environments with performance and reliability
- Assess and understand the ETL jobs, workflows, BI tools, and reports
- Address technical inquiries concerning customization, integration, enterprise architecture and general feature / functionality of data products
- Support an Agile software development lifecycle
- Contribute to the growth of our Data Exploitation Practice
- Craft database / data warehouse solutions in cloud (Preferably AWS. Alternatively Azure, GCP)
- Experience in data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc.
Other
- Ability to hold a position of public trust with the US government
- 5-7 years industry experience coding commercial software and a passion for solving complex problems
- 5-7 years direct experience in Data Engineering
- Experience working in an Agile environment
- Excellent communication and customer service skills