Backend Software Engineer (Architect) -Reliability -Singapore
Responsibilities
Team introduction:
Build Reliability at Global Scale
Every time a short video is posted or viewed on TikTok, our team is working behind the scenes to make sure it happens instantly and reliably. The Short Video Reliability team blends deep systems expertise with large-scale architecture design to keep TikTok running smoothly for billions of users.
We design for the unexpected. Whether it's a viral trend flooding the platform, a major global event, a cross-region migration, or disaster recovery, our systems are built to adapt and thrive.
We're now looking for experienced engineers and architects to join our Singapore team. In this role, you'll design, build, and scale the core reliability infrastructure that underpins TikTok's short video ecosystem. Your work will directly shape the performance, resilience, and evolution of one of the most-used platforms in the world.
Responsibilities:
- Architect and build self-healing systems that adapt to infrastructure changes, migrations, and global-scale challenges
- Design smart traffic and load management to keep performance steady during viral spikes, large events, and global campaigns
Want more jobs like this?
Get jobs in Singapore delivered to your inbox every week.

- Develop monitoring, alerting, and automation that spots and fixes issues before they affect users
- Lead the creation of reliability frameworks for topology mapping, capacity planning, automated recovery, and disaster readiness
- Continuously refine system architecture for better performance, fault tolerance, and maintainability
- Apply chaos engineering, fault injection, and failure simulations to stress-test our systems
- Use A/B testing to measure the real-world impact of your improvements
- Mentor engineers and help set the team's technical direction
Qualifications
Minimum Qualifications:
- 5+ years in backend, infrastructure, or reliability engineering
- Strong coding skills in Python, Go, Java, C++, or similar
- Solid grasp of distributed systems, networking, and fault-tolerant design
- Experience with Linux/Unix and large-scale infrastructure (cloud or on-prem)
- Proven track record delivering high-availability systems in production
- Strong debugging, analysis, and problem-solving skills
- Strong communication and writing skills.
Preferred Qualifications:
- Experience with video platforms, streaming, or CDN optimization
- Background in highly reliable production systems
- Knowledge of service mesh, edge routing, or traffic shaping at scale
- Hands-on experience with chaos engineering and incident response
- Strong system design and technical leadership skills
- Excellent communication and ability to work across global teams
Perks and Benefits
Health and Wellness
- Health Insurance
- Dental Insurance
- Vision Insurance
- HSA
- Life Insurance
- Fitness Subsidies
- Short-Term Disability
- Long-Term Disability
- On-Site Gym
- Mental Health Benefits
- Virtual Fitness Classes
Parental Benefits
- Fertility Benefits
- Adoption Assistance Program
- Family Support Resources
Work Flexibility
- Flexible Work Hours
- Hybrid Work Opportunities
Office Life and Perks
- Casual Dress
- Snacks
- Pet-friendly Office
- Happy Hours
- Some Meals Provided
- Company Outings
- On-Site Cafeteria
- Holiday Events
Vacation and Time Off
- Paid Vacation
- Paid Holidays
- Personal/Sick Days
- Leave of Absence
Financial and Retirement
- 401(K) With Company Matching
- Performance Bonus
- Company Equity
Professional Development
- Promote From Within
- Access to Online Courses
- Leadership Training Program
- Associate or Rotational Training Program
- Mentor Program
Diversity and Inclusion
- Diversity, Equity, and Inclusion Program
- Employee Resource Groups (ERG)
Company Videos
Hear directly from employees about what it is like to work at TikTok.