ML/Research Engineer, Safeguards
Anthropic
External ApplicationSan Francisco, CAHybridFull Time
$350,000 - $500,000 / year
Posted 3 hours ago0 views
Applying takes you to the company's website. Udyra tracks the click but can't confirm whether you completed the application.
About the Role
Anthropic is looking for ML Engineers and Research Engineers to help detect and mitigate misuse of AI systems. As a member of the Safeguards ML team, you will build systems that identify harmful use — from individual policy violations to sophisticated, coordinated attacks — and develop defenses that keep products safe as capabilities advance. Salary: $350,000–$500,000 USD.
Requirements
You May Be a Good Fit If You
• Have 4+ years of experience in ML engineering, research engineering, or applied research
• Have proficiency in Python and experience building ML systems
• Are comfortable working across the research-to-deployment pipeline
• Have strong communication skills to explain complex technical concepts to non-technical stakeholders
Strong Candidates May Also Have
• Language modeling and transformers experience
• Building classifiers, anomaly detection systems, or behavioral ML
• Adversarial machine learning or red-teaming
• Interpretability or probes
• Reinforcement learning
Similar Jobs
Applying takes you to the company's website. Udyra tracks the click but can't confirm whether you completed the application.