ML/Research Engineer, Safeguards

Anthropic

External ApplicationSan Francisco, CAHybridFull Time

$350,000 - $500,000 / year

Reposted 6 days ago26 views

About the Role

Anthropic is looking for ML Engineers and Research Engineers to help detect and mitigate misuse of AI systems. As a member of the Safeguards ML team, you will build systems that identify harmful use — from individual policy violations to sophisticated, coordinated attacks — and develop defenses that keep products safe as capabilities advance. Salary: $350,000–$500,000 USD.

Requirements

You May Be a Good Fit If You • Have 4+ years of experience in ML engineering, research engineering, or applied research • Have proficiency in Python and experience building ML systems • Are comfortable working across the research-to-deployment pipeline • Have strong communication skills to explain complex technical concepts to non-technical stakeholders Strong Candidates May Also Have • Language modeling and transformers experience • Building classifiers, anomaly detection systems, or behavioral ML • Adversarial machine learning or red-teaming • Interpretability or probes • Reinforcement learning

Similar Jobs

Data Engineer, Analytics

Discord

On Site

Principal Engineer, Authentication

Databricks

On Site

Engineering Manager, Data Platform

Discord

On Site