NewsBreak is seeking a Backend Engineer to develop and maintain content system solutions that collect and process data from various sources.
Requirements
- Strong proficiency in programming languages such as Python, Java, Golang.
- Knowledge of libraries and frameworks (e.g., Scrapy, Beautiful Soup).
- Familiarity with handling dynamic content, CAPTCHAs, and IP rotation.
- Understanding of HTML, XPATH, CSS, and JavaScript for parsing web content.
- Experience with database systems for storing and managing collected data.
- Knowledge of ethics and compliance with website terms of service.
- Experience with web scraping and data extraction.
Responsibilities
- Design, develop, and maintain content integration solutions to collect data from various sources.
- Collaborate with data scientists and analysts to define data extraction requirements.
- Monitor and troubleshoot content acquiring to ensure data accuracy and reliability.
- Implement data validation and quality checks to ensure the integrity of collected data.
- Stay up-to-date with industry trends and best practices in web scraping and data extraction.
- Work closely with cross-functional teams to integrate integrated data into our products or services.
- Identify and address related potential legal and ethical considerations.
Other
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration skills.
- Health, dental, and vision care for you and your family
- Top-tier 401(K) plan with company matching
- Paid time off and paid holidays
- Paid parental leave
- FSA and commuter benefits programs
- Team activity budget