Fullthrottle.ai is seeking a Data Engineer to manage ETL processes, translate business requirements into data solutions, and contribute to AI data preparation and strategy.
Requirements
- Python: Strong proficiency is required. Familiarity with libraries like Pandas, NumPy, Scikit-learn, TensorFlow, Keras, PyTorch, and potentially others depending on the domain (e.g., OpenCV for image processing, NLTK/SpaCy for NLP).
- SQL: Excellent skills in writing and optimizing complex SQL queries for data extraction, manipulation, and analysis from relational databases.
- Hands on experience of the various AWS Database & ETL services, particularly: S3, Python code, Lambda, Cloud Formation and other AWS serverless resources.
- Solid understanding of various ML algorithms (supervised, unsupervised, reinforcement learning).
- Familiarity with ML frameworks and libraries mentioned above.
- 3+ years min experience in a similar ETL role is essential.
- Exposure to AI ML models and AI data solutions delivery a major plus.
Responsibilities
- Manage ETL process, translate complex business requirements into scalable and efficient data solutions.
- Investigate and troubleshoot data and user-related system errors to ensure data integrity.
- Implement ETL processes to extract, transform, and load data from multiple sources into a data warehouse.
- Cleaning, transform, and prepare large and complex datasets.
- Handling missing values, outliers, inconsistencies, and data quality issues.
- Data scaling, normalization, encoding, and feature engineering techniques.
- Applying ML models to our prepared Data to gain insight
Other
- Minimum of 3 years of ETL development experience
- Strong communication skills
- Solution-oriented mindset
- Self-directed and self-motivated with a proven ability to deliver results.
- Fluent written/spoken English