Site Reliability Engineer - Analytical Database
Responsibilities
About the team
TikTok and affiliate are developing the next-generation high-performance analytical database, with a mission to enable efficient and real-time data-driven decision-making on PB-level data sets. The initial product was forked from Clickhouse, after which large re-architecture had been taken place. The product now not only improves the efficiency of Clickhouse but also fits into the elastic cloud-native infrastructure with better scalability and resource utilization. With years of polishment in the internal EB-level scenarios, we are now ready to serve our business partners via various cloud vendors.
What you will be doing:
- Build tools, automation, monitoring for distributed data platforms running as SaaS and on-premises deployment.
- Collaborate with engineering, infrastructure, security, and product teams to implement DevOps solutions to ensure scalability, reliability of the system.
Want more jobs like this?
Get Software Engineering jobs in Singapore delivered to your inbox every week.

- Contribute to the architecture, design, and improvement of our DevOps processes.
- Research, design, and develop computer and network software or specialised utility programs.
- Analyse user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis.
- Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures.
- Work with computer hardware engineers to integrate hardware and software systems and develop specifications and performance requirements.
Qualifications
Minimum Qualifications:
- Bachelor's degree in Computer Science or a related technical background involving software/system engineering, or equivalent working experience.
- Expertise in DevOps technologies like Ansible, Terraform, Salt, Bash Scripting, etc.
- Expertise in containerization technologies including Docker and Kubernetes.
- Expertise in automating, analyzing, and troubleshooting large-scale distributed systems.
- Experience in building solutions with AWS, Google, Azures, AliCloud or other cloud services.
- Familiar with Unix/Linux operating systems.
Preferred Qualifications:
- 3 years of production-level experience in either Python, Java, or Go.
- 3 years experience in DevOps technologies like Ansible, Terraform, Salt, Bash Scripting, etc.
- 3 years experience in containerization technologies including Docker and Kubernetes.
- 3 years experience in automating, analyzing, and troubleshooting large-scale distributed systems.
Perks and Benefits
Health and Wellness
- Health Insurance
- Dental Insurance
- Vision Insurance
- HSA
- Life Insurance
- Fitness Subsidies
- Short-Term Disability
- Long-Term Disability
- On-Site Gym
- Mental Health Benefits
- Virtual Fitness Classes
Parental Benefits
- Fertility Benefits
- Adoption Assistance Program
- Family Support Resources
Work Flexibility
- Flexible Work Hours
- Hybrid Work Opportunities
Office Life and Perks
- Casual Dress
- Snacks
- Pet-friendly Office
- Happy Hours
- Some Meals Provided
- Company Outings
- On-Site Cafeteria
- Holiday Events
Vacation and Time Off
- Paid Vacation
- Paid Holidays
- Personal/Sick Days
- Leave of Absence
Financial and Retirement
- 401(K) With Company Matching
- Performance Bonus
- Company Equity
Professional Development
- Promote From Within
- Access to Online Courses
- Leadership Training Program
- Associate or Rotational Training Program
- Mentor Program
Diversity and Inclusion
- Diversity, Equity, and Inclusion Program
- Employee Resource Groups (ERG)
Company Videos
Hear directly from employees about what it is like to work at TikTok.