FocusKPI is looking for an LLM or GenAI Application Engineer to join one of their clients, a high-tech SaaS company, to contribute to LLM-based application development, evaluation, and testing of new features, as well as core technology, such as an agent framework, utilizing the latest technology stack, LLM technology.
Requirements
- Expertise with LLM and GenAI application development.
- Experience with deep learning frameworks such as TensorFlow, PyTorch, or JAX.
- Hands-on experience with transformer-based models (e.g., GPT, BERT, RoBERTa, LLaMA).
- Expertise in natural language processing (NLP) and sequence-to-sequence models.
- Familiarity with Hugging Face libraries and OpenAI APIs.
- Experience with MLOps tools like Docker, Kubernetes, and CI/CD pipelines.
- Strong understanding of distributed computing and GPU acceleration using CUDA.
Responsibilities
- Design, train, and fine-tune large language models (e.g., GPT, LLaMA, PaLM) for various applications.
- Research cutting-edge techniques in natural language processing (NLP) and machine learning to improve model performance.
- Explore advancements in transformer architectures, multi-modal models, and emergent AI behaviors.
- Collect, clean, and preprocess large-scale text datasets from diverse sources.
- Develop and implement data augmentation techniques to improve training data quality.
- Optimize model architecture to improve accuracy, efficiency, and scalability.
- Implement techniques to reduce latency, memory footprint, and inference time for real-time applications.
Other
- Required to have 5-7 years of industrial work experience along with research/academic experience.
- Advanced degree in Computer Science, Artificial Intelligence, Data Science, or a related field.
- The candidate must have a real (actual) product experience, application development, and GenAI-based application shipping.
- Actual or Industrial LLM or GenAI application experience of at least 2-3 years
- Hybrid role (4 days per week onsite)