Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Senior Software Engineer - AI for Security - Model Evaluation

ByteDance

Salary not specified

Sep 4, 2025

San Jose, CA, USA

The Security Engineering team at ByteDance is looking to construct, implement, and sustain secure infrastructures, platforms, and technologies to serve and safeguard ByteDance products and infrastructures on a global scale.

Requirements

Strong coding and algorithm foundation: Excellent programming skills, strong knowledge of data structures and algorithms, proficiency in at least one mainstream programming language (e.g., Python, Java, C++).
Familiarity with AI-related tech stack: Solid understanding of NLP, CV, ML technologies, with in-depth knowledge of LLM-related stacks (e.g., Reward Model, GRPO/PPO/DPO, SFT/RFT, CT, PE).
Published research papers in top conferences/journals in CV/NLP/Security domain;
Experience with security-related models (e.g., vulnerability detection models, malicious code analysis models) is a plus.
Leading impactful projects or publishing significant papers in the LLM or AI security domain is preferred.

Responsibilities

Build and refine AI security evaluation datasets: Design and develop comprehensive, in-depth, and challenging evaluation datasets and benchmarks for AI-for-Security across different security scenarios.
Explore model consistency and performance prediction in security contexts: Conduct deep research on LLM performance during training on security tasks and assess the performance limits of models in security applications.
Develop security evaluation standards from an interpretability perspective: Propose interpretability-based evaluation standards grounded in model mechanisms to assess transparency and reliability of LLMs in security decision-making and remediation.
Red Teaming and model optimization: Perform Red Teaming from an evaluation perspective to systematically identify weaknesses of LLMs in security contexts and propose targeted optimization strategies.
Build RAG evaluation systems: Design end-to-end evaluation metrics and benchmarks for security-specific RAG systems, create automated evaluation workflows, and develop interpretability and traceability tools for RAG systems.

Other

Strong communication and collaboration skills: Ability to work closely with team members, explore new technologies collaboratively, and drive technological advancements.
Excellent problem-solving skills: Strong analytical and problem-solving capabilities with the ability to independently explore innovative solutions.