Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
NTT DATA Services

Site Reliability Engineer - Onsite

San Leandro, CA

Company Overview:
Req ID: 278807
NTT DATA Services strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.
We are currently seeking a Site Reliability Engineer to join our team in San Leandro, California (US-CA), United States (US).

Job Description:

  • 5 + years of experience in Production support/SRE teams with continued focus on improving Platform health
  • Experience working in Micro service architecture.
  • Hands-on Java coding exp and able to analyze and trouble shoot production issues by reading stack trace and exceptions.
  • Familiar with Agile or other rapid application development practices
  • Hands-on expertise in building monitoring dashboards and setting up alerts using Splunk.
  • Hands-on experience in writing Oracle SQL queries and MongoDB queries.
  • Experience with distributed (multi-tiered) systems, algorithms, and relational databases.
  • Must have working knowledge of APM tools such as splunk, ELK, Grafana, Prometheus etc
  • Knowledge & Exposure caching tools (Redis, memcache) or messaging tools such as MQ, Kafka is a plus
  • Working knowledge of CICD is a plus - Source control like Git/Bitbucket , Continuous Integration - Jenkins / UCD Release etc .
  • Ability to work with Engineering teams across the ecosystem such as Security , Networking & Infrastructure challenges which can impact platform health & resiliency.
  • Shell Scripting / DevOps tools like Ansible with good knowledge of yaml file to write playbooks .
  • Experience with distributed storage technologies like NFS as well as dynamic resource management frameworks PCF, Kubernetes / OpenShift.
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.

Want more jobs like this?

Get Software Engineering jobs in San Leandro, CA delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

Expectations:
  • You will be a core member of a SRE support team, will be utilizing the latest technology tools to write code, test cases, working with API specs and automate to maintain the resiliency, performance and availability of Digital Sales & Marketing platforms.
  • Strong & relevant experience in supporting Web/API platforms built using Java/java script Stack (Spring/Spring boot, Javascript -Angular/react)
  • Proficiency in dealing with Legacy infrastructure along with cloud infrastructure (on prem & 3rd party) such as PCF or Azure.
  • Identifying opportunities to adopt to new technologies while improving the efficiency by removing toil and continues to drive efficiency & optimization.
  • Proactive monitoring of app performance through splunk, App dashboards, App dynamics & Dynatrace etc.
  • Represent Platform engineering teams during production outages and collaborate with engineering teams to resolve production outages. Collaborate with stake holders across engineering function to own/derive RCA & work towards permanent resolution.
  • Plan, support, execute and comply with governance programs/processes in support of a strong control environment in your functional area. Leverage process documentation to improve operational controls and identify and remediate process deficiencies.
  • Proactively identify, communicate, mitigate and escalate risk originating from non-compliance of processes, operational errors, and data integrity issues in all applicable processes.
  • Ability to influence SRE practices with in and outside teams to enable a strong DevOps culture with in the organization
  • Responsible for working with Engineering teams to maintain the SLAs & SLOs. Constantly looking out for opportunities to improve platform metrics & communicate the same to stakeholders.
  • Tech Stack : Java/J2EE ( Spring, spring boot, python, shell scripting).
  • Exposure and proficiency in different API styles such as SOAP, REST, Micro services etc.

#indist
#li-ist

About NTT DATA Services:

NTT DATA Services is a recognized leader in IT and business services, including cloud, data and applications, headquartered in Texas. As part of NTT DATA, a $30 billion trusted global innovator with a combined global reach of over 80 countries, we help clients transform through business and technology consulting, industry and digital solutions, applications development and management, managed edge-to-cloud infrastructure services, BPO, systems integration and global data centers. We are committed to our clients' long-term success. Visit nttdata.com or LinkedIn to learn more.

NTT DATA Services is an equal opportunity employer and considers all applicants without regarding to race, color, religion, citizenship, national origin, ancestry, age, sex, sexual orientation, gender identity, genetic information, physical or mental disability, veteran or marital status, or any other characteristic protected by law. We are committed to creating a diverse and inclusive environment for all employees. If you need assistance or an accommodation due to a disability, please inform your recruiter so that we may connect you with the appropriate team.

Where required by law, NTT DATA provides a reasonable range of compensation for specific roles. The starting pay range for this role is $71/hour. Actual compensation will depend on several factors, including the candidate's relevant experience, technical skills, and other qualifications. This position may also be eligible for incentive compensation based on individual and/or company performance

Client-provided location(s): San Leandro, CA, USA
Job ID: NTT_DATA-24-00900
Employment Type: Other