Responsibilities
The Infra SRE-Infrastructure-Assurance team extends TikTok infrastructure's operability, observability, visibility, and automation. We aim to provide holistic insights and solutions to TikTok infrastructure with minimal manual interventions. We're young and fast-growing. Our team values transparency, collaboration, hard-work and innovations. We believe in planning and long-term strategies rather than short-term gains. Join us in solving large-scale complex issues in a hyper-growth team; Embracing challenges with a fearless curious mind.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Want more jobs like this?
Get Software Engineering jobs in Mountain View, CA delivered to your inbox every week.
Responsibilities:
- Perform SRE duties and operations on supported services in production, including but not limited to: on-call rotations, maintenance, change management, monitoring, incident response, capacity planning, disaster recovery.
- Maximize system uptime, availability and stability, to ensure functional and performance SLAs.
- Contribute to existing documentations and build effective documentations such as operational runbooks, SOPs, SLA/SLO.
- Initiate and lead scripting/tooling/automation to streamline processes and minimize human resource.
- Work cross functionally and regionally with SRE/Dev/QA/PM teams to handle incidents and improve processes.
- Manage and prioritize tasks/projects for high productivity and precise deliveries.
Qualifications
Minimum Qualifications:
-Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
-demonstrated experience in software development with one or more programming languages.
-experience in Linux Operating Systems, Networking, Database concepts, Monitoring, Shell scripting.
-Superb analytical ability, problem solving and critical thinking skills.
-Excellent communicator, team-player, self-starter and fast learner.
Preferred Qualifications:
-Master's degree in Computer Science, Engineering or a related field.
-Proficient in any of the following languages: Python, GoLang, C++.
-Expertise in any of the following: SRE philosophy, AIOPS, APM, Disaster Recovery.
-Expertise in any of these tech stacks: Kubernetes, ElasticSearch, ClickHouse, Message Queue, OpenTSDB, Service Mesh.
Candidates for this position must be legally authorized to work in the United States. This position is not eligible for visa sponsorship or support.
Job Information
[For Pay Transparency] Compensation Description (annually)
The base salary range for this position in the selected city is $116000 - $250000 annually.
Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.
Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).
The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
For Los Angeles County (unincorporated) Candidates:
Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:
1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;
2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and
3. Exercising sound judgment.