NVIDIA is seeking to accelerate Apache Spark on GPUs to handle large data processing needs and reduce cloud computing costs and latency
Requirements
- Outstanding technical skills in designing and implementing high-quality distributed systems
- Excellent programming skills in C++, Java, and/or Scala
- OS kernel dev experience is a strong plus
- Experience with open source libraries like RAPIDS
- Experience with CUDA/C++
- Experience with distributed systems
- Experience with Apache Spark
Responsibilities
- Develop CUDA/C++ libraries to accelerate DataFrames and I/O operations on common file formats such as Parquet, ORC and JSON
- Collaborate with distributed systems teams to craft solutions to distributed processing problems challenges at large scale
- Work with open source communities to enhance libraries like RAPIDS, CCCL and UCX through technical discussion and code contributions
- Provide recommendations and feedback to teams regarding decisions surrounding topics such as infrastructure, continuous integration and testing strategy
- Build, test and optimize CUDA/C++ libraries across different platforms
Other
- BS, MS, or PhD in Computer Science, Computer Engineering, or closely related field (or equivalent experience)
- 12+ years of work experience in software development
- Ability to work with teams across organizational boundaries and geographies
- Highly motivated with strong interpersonal skills
- Ability to work in a diverse work environment