CyberCube is looking to solve challenging problems and implement scalable systems for large-scale internet data collection to construct high-quality datasets that drive market-differentiating features within their product offerings.
Requirements
- 5+ years of experience in large-scale data collection / software engineering
- Deep understanding of the internet infrastructure and technologies such as DNS, CDN, ASN, ISP, and CSP
- Highly proficient in SQL and Python, strong fundamentals in data structures, algorithms, and base Python paradigms, including object-oriented programming as well as functional programming concepts; Familiarity with modules commonly used for data collection, analysis, and machine learning such as Pandas, NumPy, Scikit-learn etc.
- Experience with AWS technologies for computation, orchestration, and storage
- Experience with designing, deploying, and maintaining microservice applications
- Ability to think creatively and implement a plan to address bugs and urgent product issues promptly
- Aptitude for time-sensitive analysis and troubleshooting tasks
Responsibilities
- Own the strategy for the Data Collection function and develop and test ideas for new collection initiatives
- Manage the entire third-party data lifecycle, including sourcing, contracting, negotiation, and post-contract account management.
- Engage in cross-functional collaboration with stakeholders from engineering, product management, cyber risk modeling, actuarial science, economics, and client success to deliver on projects and product goals
- Deliver on key objectives and quarterly development goals across 3+ products
- Maintain, monitor, and apply design updates to existing data collection systems
- Develop and continually update technical documentation
- Perform code reviews and suggest opportunities for improvements
Other
- This role is onsite in San Francisco
- Excellent oral and written communication skills
- Experience working in and leading project teams
- Ability to work in an agile, fast-paced startup environment
- Ability to maintain relationships, collaborate effectively, and meet deadlines while working remotely