PayPal is seeking to advance its AI capabilities through the development of cutting-edge large language models and foundation model architectures to solve complex business challenges within the global payments ecosystem.
Requirements
- Experience with ML frameworks like TensorFlow, PyTorch, or scikit-learn
- Familiarity with cloud platforms (AWS, Azure, GCP) and tools for data processing and model deployment
- Several years of experience in designing, implementing, and deploying machine learning models
- 1-3+ years of hands-on experience training and deploying large-scale language models (7B+ parameters)
- Deep expertise in transformer architectures, attention mechanisms, and modern training techniques
- Experience with distributed training frameworks (PyTorch, JAX, DeepSpeed, etc.)
- Strong background in NLP, deep learning, and statistical machine learning
Responsibilities
- Research and develop large-scale foundation models, including continuous pre-training, supervised fine-tuning, and alignment techniques
- Design novel architectures and training methodologies for domain-specific language models in financial services
- Build scalable ML pipelines for foundation model training, evaluation, and deployment at enterprise scale
- Conduct rigorous experimentation and benchmarking to ensure model quality, safety, and performance
- Deploy foundation models into production environments to drive business insights and enhance customer experiences
- Collaborate with cross-functional teams to identify high-impact use cases and translate research into practical solutions
- Stay current with latest developments in LLM and LLM-Agent research and contribute to the broader AI/ML community through publications and open-source contributions
Other
- 3+ years relevant experience and a Bachelor’s degree OR Any equivalent combination of education and experience
- PhD in Computer Science, Machine Learning, AI, or related field with focus on large language models and LLM Agent
- Proven track record of research publications in top-tier venues (NeurIPS, ICML, ACL, etc.)
- Mentor junior researchers and contribute to technical strategy for foundation model initiatives
- Ability to work in a hybrid work model with 3 days in the office and 2 days remote