Enhancing in-house Large Language Models (LLMs) and seamlessly integrating them into Rakuten's services to impact millions of users worldwide.
Requirements
- Deep understanding of the latest advancements in deep learning and generative AI, including LLMs, RAG, and AI agents and related areas. Familiarity with current research trends and open challenges in the field.
- Ability to translate complex business challenges into well-defined machine learning tasks, including the creation of novel datasets and evaluation metrics beyond existing benchmarks.
- Strong experimental design skills, with the ability to develop rigorous and objective evaluation methodologies to demonstrate the effectiveness of proposed solutions. Experience in analyzing experimental results and drawing meaningful conclusions.
- Expert-level coding skills in Python and PyTorch, with a strong understanding of software engineering principles. Ability to not only fine-tune existing models but also identify and implement improvements to model architectures and training frameworks. Experience in integrating cutting-edge research findings into in-house machine learning libraries.
- Extensive experience in developing and deploying large-scale machine learning models, including distributed training using frameworks such as FSDP and DeepSpeed.
- Ability to actively participate in project discussions, provide constructive feedback and technical guidance to junior members.
- Experience in developing software to process a large-scale dataset a distributed computing framework (e.g., Hadoop)
Responsibilities
- Drive and execute cutting-edge research to advance the state-of-the-art in generative AI and large language models, contributing to areas including multimodal modeling, domain adaptation, and high-performance small language models.
- Own and pursue an independent research agenda, identifying impactful research problems, designing innovative solutions, and carrying out long-running projects.
- Design and conduct experiments, including experimental design, code development, evaluations, and results analysis, ensuring research outcomes are reusable and impactful.
- Work collaboratively with globally distributed people in a range of roles, including researchers, engineers, product managers, designers, and other key product stakeholders to accomplish complex tasks that deliver value to our business.
- Stay up to date on research and product advances in AI to teach colleagues without research backgrounds state-of-the-art techniques in deep learning and generative AI.
- Mentor junior researchers and promote a healthy, cross-functional team environment.
- Contribute to publications, open-source projects, and knowledge-sharing efforts, enhancing the visibility and reach of research findings.
Other
- Master’s in computer science, Machine Learning or related field with 5+ years of relevant industry experience.
- Preferred PhD in Computer Science, Machine Learning or related field
- Proven experience in a research-focused role within industry, academia, or government institutions.
- Demonstrated ability to publish research findings in top-tier peer-reviewed conferences and journals.*
- Being reviewers for the top-tier conferences
- Proven leadership experience in managing and mentoring a team of 3+ researchers and engineers in research and development projects.
- Excellent communication and interpersonal skills, with the ability to effectively convey complex technical ideas to both technical and non-technical audiences.