ML/Research Engineer, Safeguards
Anthropic
External ApplicationSan Francisco, CAHybridFull Time
$350,000 - $500,000 / year
Posted 3 weeks ago18 views
About the Role
Anthropic is looking for ML Engineers and Research Engineers to help detect and mitigate misuse of AI systems. As a member of the Safeguards ML team, you will build systems that identify harmful use — from individual policy violations to sophisticated, coordinated attacks — and develop defenses that keep products safe as capabilities advance. Salary: $350,000–$500,000 USD.
Requirements
You May Be a Good Fit If You
• Have 4+ years of experience in ML engineering, research engineering, or applied research
• Have proficiency in Python and experience building ML systems
• Are comfortable working across the research-to-deployment pipeline
• Have strong communication skills to explain complex technical concepts to non-technical stakeholders
Strong Candidates May Also Have
• Language modeling and transformers experience
• Building classifiers, anomaly detection systems, or behavioral ML
• Adversarial machine learning or red-teaming
• Interpretability or probes
• Reinforcement learning