Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Technical AI Policy Researcher, Model Behaviour - Trust and Safety

5 days ago San Francisco, CA

Responsibilities

The Trust & Safety (T&S) Responsible AI Policy team's mission is to ensure the development of GenAI models and applications are safe, fair and trustworthy. We do this by defining, measuring and mitigating safety and fairness AI model risks through policy frameworks, model risk assessments, and upstream policy solutions.

The T&S Responsible AI Policy team sits within the T&S GenAI and Emerging Products pillar. We work closely with Trust & Safety teams (product policy, product, engineering, data science, operations, red teaming), business and model teams, and cross-functional stakeholders (comms, legal, public policy) across global markets. Success in this team requires strong policy acumen, judgment, creativity, analytical rigour, and the ability to translate Generative AI risk to different stakeholders effectively.

As an AI Policy Researcher on the T&S Responsible AI Policy team, you will champion the responsible development and deployment of our frontier AI models across multiple businesses with a specialty on model bias, political risk, and model behaviour. You will accelerate technical policy research, incubate new research efforts, and drive end-to-end policy to evaluate workflows for your domain areas.

Responsibilities:
- Design and maintain multimodal GenAI policies across safety-relevant domains, including political and ideological bias, deceptive misuse, manipulation and persuasion, and fairness.
- Translate risk and harm models into clear behavioral specifications, evaluation criteria, grading guidance, and system-level safeguards.
- Define practical boundaries between beneficial uses of AI and assistance that could materially enable harm, exploitation, misuse, or unsafe outcomes.
- Build policy artifacts that support model training, evaluation, and deployment. Partner with safety researchers, engineers, product teams, and other stakeholders to operationalize policy into scalable model behavior and measurable safeguards.
- Design end-to-end policy development to pre-launch evaluation to post-launch monitoring workflows across safety-relevant domains, including golden set construction, labeling guidance, calibration, adjudication, and eval coverage analysis, to ensure policies can be reliably measured and improved.
- Use red-teaming results, deployment data, model failures, over-refusals, under-refusals, and ambiguous edge cases to improve policy and evaluation quality over time.
- Identify emerging capability areas where frontier AI systems could create new safety, fairness or bias challenges or lower barriers to harm.
- Monitor post-launch model activity to identify gaps in our policy framework to capture unsafe model behaviour.
- Champion research to strengthen the defensibility and operability of policy positions, including working with Outreach and Partnerships to incorporate external expert input into relevant policy positions.
- Combine longer-horizon safety research with hands-on launch and deployment work.
- Contribute to safety reports, policy documentation, launch reviews, and AI governance reviews on the company's approach to building AI responsibly.
- Support regulatory teams as a subject matter expert on AI compliance related initiatives.

Qualifications

Minimum Qualifications:
- 5 years in Trust & Safety, AI Safety Research, AI Ethics, technical AI Governance, or equivalent experience.
- Degree in Computer Science, Human-Computer Interaction, Engineering, Data Science or quantitative Social Sciences.
- Direct experience in policy development, AI evaluations, red-teaming, or AI governance work.
- Strong technical understanding of LLM, multimodel, or generative media model behavior, model failure modes, and safety risks.
- Demonstrated experience working with external experts and stakeholders, including civil society, government, and academia.
- Demonstrated success working in a fast-paced technology company or research organization conducting AI impact, risk assessments or algorithmic audits, and/or data science or product development related experience.
- Ability to advocate for safety amongst a wide variety of business stakeholders including Product Policy, Engineering, Public Policy, Legal, Communications, and Data Science.

Preferred Qualifications:
- Ability to explain complex technical concepts to non-technical stakeholders.
- Experience working with governments, frontier AI companies, or AI Safety organizations.
- Familiarity in Python and experience building ML systems
- Are comfortable working across the research-to-deployment pipeline, from exploratory experiments to production systems.

Job Information

[For Pay Transparency] Compensation Description (annually)

The base salary range for this position in the selected city is $93600 - $220400 annually.

Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.

Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).

Want more jobs like this?

Get Data and Analytics jobs in San Francisco, CA delivered to your inbox every week.

Job alert subscription


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

For Los Angeles County (unincorporated) Candidates:

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:

1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;

2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and

3. Exercising sound judgment.

Client-provided location(s): San Francisco, CA
Job ID: TikTok-7642509022655301941
Employment Type: OTHER
Posted: 2026-05-25T19:57:44

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Dental Insurance
    • Vision Insurance
    • HSA
    • Life Insurance
    • Fitness Subsidies
    • Short-Term Disability
    • Long-Term Disability
    • On-Site Gym
    • Mental Health Benefits
    • Virtual Fitness Classes
  • Parental Benefits

    • Fertility Benefits
    • Adoption Assistance Program
    • Family Support Resources
  • Work Flexibility

    • Flexible Work Hours
    • Hybrid Work Opportunities
  • Office Life and Perks

    • Casual Dress
    • Snacks
    • Pet-friendly Office
    • Happy Hours
    • Some Meals Provided
    • Company Outings
    • On-Site Cafeteria
    • Holiday Events
  • Vacation and Time Off

    • Paid Vacation
    • Paid Holidays
    • Personal/Sick Days
    • Leave of Absence
  • Financial and Retirement

    • 401(K) With Company Matching
    • Performance Bonus
    • Company Equity
  • Professional Development

    • Promote From Within
    • Access to Online Courses
    • Leadership Training Program
    • Associate or Rotational Training Program
    • Mentor Program
  • Diversity and Inclusion

    • Diversity, Equity, and Inclusion Program
    • Employee Resource Groups (ERG)

Company Videos

Hear directly from employees about what it is like to work at TikTok.