Scribd is looking to power metadata extraction, enrichment, and content understanding across all its brands by processing millions of documents and billions of images to deliver high-quality metadata for content discovery and trust. The role aims to integrate machine learning models and LLM-based services into production pipelines to deliver impactful, high-performance solutions for generative AI and metadata enrichment problems at a global scale.
Requirements
- 4+ years of professional software engineering experience
- Proficiency in Python, Scala, Ruby, or similar languages
- Experience designing and building distributed systems at scale
- Hands-on experience building, deploying, and optimizing solutions using ECS, EKS, or AWS Lambda
- Experience with infrastructure-as-code tools like Terraform (or similar)
- Experience working with a public cloud provider (AWS, Azure, or Google Cloud)
- Familiarity with data processing frameworks like Spark or Databricks for large-scale workloads
- Proven ability to test, profile, and optimize systems for performance, scalability, and reliability
- Bonus: Experience working with LLMs or integrating ML models into production systems
Responsibilities
- Design and build scalable systems to extract, enrich, and process metadata from millions of documents, images, and audio content.
- Leverage LLMs to integrate capabilities like summarization, classification, extraction, and enrichment into metadata pipelines.
- Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions.
- Optimize and refactor existing systems for performance, scalability, and reliability.
- Ensure data accuracy, integrity, and quality through automated validation and monitoring.
- Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase.
- Manage and maintain data pipelines, security and infrastructure
Other
- Employees must have their primary residence in or near one of the specified cities in the US, Canada, or Mexico.
- Demonstrating the intersection of passion and perseverance towards long term goals (GRIT: Goals, Results, Innovative, Team).
- Occasional in-person attendance is required for all Scribd employees, regardless of their location.
- Ability to set and achieve Goals.
- Achieve Results within their job responsibilities.