Senior Backend Software Engineer, Model Training - Trust and Safety - Singapore
Responsibilities
The Trust and Safety(TnS) engineering team is responsible for protecting our users from harmful content and abusive behaviors. With the continuous efforts of our trust and safety engineering team, TikTok can provide the best user experience and bring joy to everyone in the world. Our team is responsible for achieving goals by building content moderation process systems, rule engine, strategy systems, feature engine, human moderation platforms, risk insight systems and all kinds of supportive platforms across TnS organization.
Responsibilities - What You'II Do
1. Work closely with business teams to optimize the integration plan for algorithm applications, improve efficiency in evaluating and using algorithm applications across various business scenarios, and reduce the cost of managing and optimizing algorithm applications in different business scenarios.
2. Be responsible for the architectural design, development, and performance tuning of algorithm applications, solving technical challenges such as high concurrency, high reliability, and high scalability. Work includes multiple sub-areas: ML model training and evaluation, model optimization, model inference, model management, dataset management, workflow orchestration, etc.
Want more jobs like this?
Get jobs in Singapore delivered to your inbox every week.

3. Responsible for the design and development of Machine Learning infrastructure for LLM/AIGC, etc
4. Build up a super large machine learning system integrating GPUs, RDMA networking, and high-performance storage
5. Be responsible for researching and implementing cutting-edge engineering technologies related to LLM, NLP, CV.
Qualifications
Minimum Qualification
- At least 5 years of industrial experience;
- Hands-on experience in one or more of the following areas: Machine Learning, Deep Learning, Recommender Systems, Natural Language Processing, or Computer Vision
- Be proficient in 1 to 2 programming languages such as C++/Go/Python/Shell in Linux environment.
Understand the principles of distributed systems and have experience in design, development and maintenance of large-scale machine learning systems
- Be familiar with Kubernetes architecture, and have rich experience in system-level development and tuning.
Familiar with the ML Infrastructure of Large Model training and inference
- Strong understanding and engineering experience of cutting-edge LLM research and engineering (e.g., long context, multi modality, alignment research, agent ecosystem, etc.) and possess practical expertise in effectively implementing these advanced systems.
- Proficiency in programming languages such as Python, CUDA or C++ and a track record of working with deep learning frameworks (e.g., pytorch, deepspeed, megatron, vllm, etc.).
- Have experience with large scale data processing and parallel computing
Preferred Qualifications
- Excellent programming skills, data structure and algorithm skills, proficient in C/C++ or Python programming language, candidates with awards in ACM/ICPC, NOI/IOI, Top Coder, Kaggle and other competitions are preferred.
- Research or industry experience in the field of machine learning, especially in large language models (LLMs) and generative artificial intelligence.
- Distributed training framework optimizations such as DeepSpeed, FSDP, Megatron, GSPMD
- Experiences in in-depth CUDA programming and performance tuning (cutlass, triton)
- Experience with evaluation of ML models, LLM application & agent development is desirable.
- PhD/Master's degree required, with top artificial intelligence conference papers (NeurIPS, ICML, ICLR, CVPR, ACL, EMNLP, etc.) in machine learning (ML), computer vision (CV), natural language processing (NLP) and other fields.
Perks and Benefits
Health and Wellness
- Health Insurance
- Dental Insurance
- Vision Insurance
- HSA
- Life Insurance
- Fitness Subsidies
- Short-Term Disability
- Long-Term Disability
- On-Site Gym
- Mental Health Benefits
- Virtual Fitness Classes
Parental Benefits
- Fertility Benefits
- Adoption Assistance Program
- Family Support Resources
Work Flexibility
- Flexible Work Hours
- Hybrid Work Opportunities
Office Life and Perks
- Casual Dress
- Snacks
- Pet-friendly Office
- Happy Hours
- Some Meals Provided
- Company Outings
- On-Site Cafeteria
- Holiday Events
Vacation and Time Off
- Paid Vacation
- Paid Holidays
- Personal/Sick Days
- Leave of Absence
Financial and Retirement
- 401(K) With Company Matching
- Performance Bonus
- Company Equity
Professional Development
- Promote From Within
- Access to Online Courses
- Leadership Training Program
- Associate or Rotational Training Program
- Mentor Program
Diversity and Inclusion
- Diversity, Equity, and Inclusion Program
- Employee Resource Groups (ERG)
Company Videos
Hear directly from employees about what it is like to work at TikTok.