Systems Development Engineer
- Pune, India
DESCRIPTION
Systems Development Engineer
AWS TechOps team is Amazon's central defense against large-scale incidents and drives operational excellence across all of Amazon businesses. Our key offering to Amazon is best-in-class Incident Management. Our engineers are front-and-center in driving down event duration through experience in operational excellence, current best practices and incident management tools. We're looking for engineers who have owned or participated in operations and incident management for at least one large-scale enterprise. You should have a passion for working with new technologies and are not afraid to exercise your creativity in pushing the boundaries of existing technologies. Running incident management for AWS is unique in that AWS supports businesses around the globe, and our ability to identify and mitigate issues impacting customers is of upmost importance. Because of our unique role, you will have limitless exposure to all things Amazon. TechOps engineers are encouraged to build solutions to problems while sharing the benefit of those solutions with other AWS service teams. This is an excellent opportunity to join one of Amazon's world-class engineering teams, and work with some of the best and brightest while also developing your skills and career within one of the most dynamic, innovative and progressive technology companies anywhere. In addition to a stimulating and fun working environment, Amazon offers mentoring programs with experienced engineers, regular tech talks with technology Principals, and well-defined career paths for motivated engineers who want to contribute to our culture of operational excellence and customer-focused technical innovation.
Responsibilities
• Provide critical support, incident response, and management to internal customers across all of Amazon including management of communications and coordination of service owners via conference calls
• Design, build, and enhance incident detection and management tools
• Be a technology evangelist and use your deep knowledge to solve business problems
• Reduce mean time to resolution for all incident types
• Participate in Agile sprints to evolve business processes and technologies
• Get there first; be the first to detect and diagnose high-severity service-impacting events
• Identify and troubleshoot recurring platform issues and engage service owners to assist with resolution
• Automate tasks through creation and maintenance of scripts and tools
• Respond to and complete customer requests within SLA via a trouble ticketing system
• Take part in a "follow the sun" rotation split between Seattle, Dublin and Sydney sites, including weekends and holidays
• Create and review documentation, design new standard operating procedures
• Mentor peers in your areas of technical and operational strength
• Participate in the interviewing process
BASIC QUALIFICATIONS
• 3+ years relevant work experience
• Experience with server hardware management across multiple vendors
• Experience in automation via shell scripting ,Perl, Python, Ruby or other DevOps oriented language
• Knowledge of standard internet protocols (Ethernet, ARP, IP, ICMP, UDP, TCP, SSL, DNS, HTTP, etc.)
• Excellent English language written and verbal communication skills to facilitate efficient and effective interaction with peers and customers
PREFERRED QUALIFICATIONS
• 5+ years relevant experience in a large-scale online technical operations environment
• Third level qualification in Computer Science or other technical degree or a related field
• Experience with Linux, network troubleshooting and administration
• Experience with systems management and monitoring software (home-grown or commercially available)
• Previous experience with network automation (e.g. automated provisioning and remote configuration of switches and routers; flow-based analysis and predictive modeling of traffic in dynamic routing environments.)
• Experience with performance testing and tuning
• Computer Science fundamentals in object-oriented design, data structures and algorithm design.
• Proven ability to work effectively in a cross-functional team
• Experience with scalable distributed systems and service oriented architecture
Back to top