Red Team Engineer (Safeguards)

San Francisco 17 hours agoFull-time External
Negotiable
Requirements Demonstrated experience in penetration testing, red teaming, or application security,Strong technical skills in web application security, including hands-on expertise with security testing tools (Burp Suite, Metasploit, custom scripting frameworks, etc.),A track record of discovering novel attack vectors and chaining vulnerabilities in creative ways,A public body of work such as CVEs, blog posts, or disclosed bug bounty reports,Experience with security testing tools and the ability to build custom automation,Adaptability to understand and build engagements around emerging threats outside of your direct area of expertise,Strong written and verbal communication skills, with the ability to explain technical concepts to varied audiences,Proven ability to think like an attacker,(Desirable) Experience with AI/ML security or adversarial machine learning,(Desirable) Experience testing API security and rate limiting systems,(Desirable) Background in testing business logic vulnerabilities and authorization bypass techniques,(Desirable) Background in anti-fraud, trust & safety, or abuse prevention systems,(Desirable) Familiarity with distributed systems and infrastructure security,(Desirable) Understanding of AI safety considerations beyond traditional security,(Desirable) Familiarity with abuse detection mechanisms and the ability to engineer novel bypasses,Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience,We encourage you to apply even if you do not believe you meet every single qualification What the job involves Anthropic's Safeguards team is seeking a Red Team Engineer to help ensure the safety of our deployed AI systems and products,In this role, you'll take an adversarial approach to uncover vulnerabilities across our product ecosystem before they can be exploited by malicious actors,Your work will span from technical infrastructure vulnerabilities on our products to emergent risks from advanced AI capabilities,While you'll take best practices from traditional security approaches, the focus is on broader safety implications and novel abuse unique to advanced AI systems and associated products,You'll investigate the full spectrum of potential abuse: from coordinated account manipulation and payment fraud to novel exploitation of product features,You'll simulate sophisticated threat actors who chain multiple attack vectors to achieve their objectives,Conduct comprehensive adversarial testing across Anthropic’s product surfaces, developing creative attack scenarios that combine multiple exploitation techniques,Research and implement novel testing approaches for emerging capabilities, including agent systems, tool use, and new interaction paradigms,Design and execute 'full kill chain' attacks that emulate real-world threat actors attempting to achieve specific malicious objectives,Build and maintain systematic testing methodologies that evaluate every aspect of our systems,Develop automated testing frameworks to enable continuous assessment at scale,Collaborate with Product, Engineering, and Policy teams to translate findings into concrete improvements,Help establish metrics for measuring detection effectiveness of novel abuse