AI Content Red Team Analyst - Trust and Safety

Location:

Singapore

Team:

Global Operations

Employment Type:

Regular

Job Code:

A222833A

Share this listing:

Responsibilities

About the team: The Trust & Safety (T&S) GenAI & Emerging Product team's mission is to empower the development of GenAI models and applications. We do this by building a world-class safety, testing, and risk management system that ensures GenAI innovations are launched responsibly. The AI Content Red Team sits within the T&S GenAI and Emerging Products pillar. The team is responsible for conducting unstructured adversarial testing of ByteDance's generative AI products and models to uncover emerging risks, alongside our structured evaluations. This team combines attacker-minded testing, risk discovery, and clear operational feedback loops to inform product decisions, policy development, mitigations, and longer-term evaluation strategy. We probe models and product experiences across modalities, use cases, and abuse patterns to identify failure modes, stress-test safeguards, and help teams improve safety before and after launch. We work closely with Trust & Safety teams (policy, product, engineering, data science, operations), and business teams across global markets. Success in this team requires strong judgment, creativity, analytical rigor, and the ability to translate ambiguous findings into actionable recommendations. Key Responsibilities - Conduct structured adversarial testing on AI models, features, and policies to identify vulnerabilities and emerging risks. - Explore product behavior across contexts and user journeys, to identify model failure modes that may not be captured in standard evaluations. - Investigate jailbreaks, evasions, prompt-based attacks, and other adversarial techniques relevant to content safety. - Document findings clearly and consistently, including risk descriptions, reproduction steps, severity assessments, and mitigation recommendations. - Partner with cross functional stakeholders (policy, product, business teams) to ensure mitigation validation and root cause closure - Support development of testing playbooks, taxonomies, and internal knowledge bases - Stay updated on emerging adversarial trends (e.g., deepfakes, multimodal manipulation, coordinated abuse), and shifts in the external risk landscape

Qualifications

Minimum Qualification(s) - 3+ years in Trust & Safety, cybersecurity, risk/adversarial testing, or related fields - Experience with prompt testing, jailbreak analysis, LLM evaluation, or adversarial QA - Familiarity with AI safety risks (jailbreaks, hallucinations, bias, misuse patterns) - Strong interest in GenAI safety, and the ways AI systems can be compromised under adversarial conditions. - Demonstrated ability to independently investigate ambiguous problems, identify non-obvious failure modes and abuse patterns, and produce clear, evidence-based conclusions - Strong communication skills - Ability to manage multiple priorities, and collaborate effectively with cross-functional teams Preferred Qualification(s) - Experience working with agentic AI tools to scale your impact, including building/operating AI tools to make processes efficient and effective.

Job Information

About Us

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.​

Why Join ByteDance

Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.​
As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.​
Diversity & Inclusion​
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.​

Trust & Safety at ByteDance

Content that this role interacts with includes images, video, and text related to every-day life, but it can also include (but is not limited to) bullying; hate speech; child safety; depictions of harm to self and others, and harm to animals. Hence, it is possible that this role will be exposed to harmful content on a daily basis.​
​
ByteDance recognises that keeping our platform safe for the ByteDance communities is no ordinary job which can be both rewarding and psychologically demanding and emotionally taxing for some. This is why we are sharing the potential hazards, risks and implications in this unique line of work from the start, so our candidates are well informed before joining.​
​
We are committed to the wellbeing of all our employees and promise to provide comprehensive and evidence-based programs, to promote and support physical and mental wellbeing throughout each employee's journey with us. We believe that wellbeing is a relationship and that everyone has a part to play, so we work in collaboration and consultation with our employees and across our functions in order to ensure a truly person-centred, innovative and integrated approach.​