Software Development Engineer

1 month agoNew York, NY


Let's start with you, not us. As a Software Development Engineer on the Consumer Reliability Engineering (CRE) Red Team, you will build software to automate chaos experiments at all levels of granularity. You will build software that produces Prime Day/Cyber Monday scale traffic simulations for the global Amazon websites ahead of large scale events. If you've ever wanted to shake the tree and see what falls out of Amazon-scale software systems, this is your opportunity to do so. The learnings from your software and chaos experiments will fundamentally change how Amazon measures the resilience posture of its retail websites, and will ultimately drive the improvement of the software owned and operated by literally thousands of developers. When you find something groundbreaking, it will even inform our AWS partners.

You will be responsible for delivering projects end-to-end and jumping into problem spaces where the right answer is rarely clear from the start. You will personally deliver results, while also coaching your peers to design and deliver against time sensitive targets. You will be comfortable developing systems that push the limits of AWS, serverless concurrency, and control planes responsible for hundreds of thousands of compute resources. At the end of the day, the systems you create will provide the closest empirical approximation of how resilient Amazon's complex service graph is. This informs critical business decisions year round. You'll be directly helping build resilience against anomalous conditions created by software and process failures, unexpected customer behavior, and even attackers.

We're particularly excited about candidates with a proven track record of shipping software solutions that solve complex problems. The simpler your software, the better. We want you to solve complex problems, preferably with simple and eloquent solutions. You will have the ability to dive deep into a wide variety of problems and technologies to guide the right technical decisions for the products and the businesses you will support. You will bring an optimistic attitude towards owning and operating solutions of scale. You will be a strong communicator and appreciate a well written and concise document.

You've read this far we'd love to tell you more about us. The Consumer Reliability Engineering (CRE) Red Team drives programs and software initiatives to safely probe the known and unknown failure modes of Amazon; and we do it in production. We're a chaos engineering team at heart (we're not an InfoSec Red Team), and we're fundamentally reshaping how Amazon maintains an always-ready resilience posture. Our software explores and measures the resilience of software applications, and localizes externally facing problems before customers do it for us.

We love fast-paced, startup-like environments where being first to market is the name of the game. We don't know exactly what we'll find when we run these chaos experiments, so we place a heavy emphasis on rapid prototypes and agility. We deal with high levels of ambiguity and convert head tilting ideas into streamlined workflows that impact hundreds of teams. We're building multiple systems from the ground up that focus on chaos engineering at scale (think 800k+ vCPUs) and safety mechanisms that help us prevent disasters in production environments. Our work is intentionally provocative and centered around challenging the status quo inside of Amazon. We find this work exciting, rewarding, and plentiful, and we hope you will too. Our leaders care personally and we're excited to invest in you and your passions as an individual and as a software engineer.

We have a formal mentor search application that lets you find a mentor that works best for you based on location, job family, job level etc. Your manager can also help you find a mentor or two, because two is better than one. In addition to formal mentors, we work and train together so that we are always learning from one another, and we celebrate and support the career progression of our team members.

Here at Amazon, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and we host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon's culture of inclusion is reinforced within our 14 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.

Work-Life Balance
Our team also puts a high value on work-life balance. Striking a healthy balance between your personal and professional life is crucial to your happiness and success here, which is why we aren't focused on how many hours you spend at work or online. Instead, we're happy to offer a flexible schedule so you can have a more productive and well-balanced lifeboth in and outside of work.


  • Programming experience with at least one modern language such as Java, C++, or C# including object-oriented design
  • 1+ years of experience contributing to the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems.
  • 2+ years of non-internship professional software development experience


  • Experience building systems that provide solutions to complicated, multifaceted business problems
  • Experience working with AWS offerings. We use a lot of the following AWS services and we hope you find using or learning about them exciting: ECS, EC2, Lambda, Fargate, IAM, S3, DynamoDB, X-Ray, Step Functions, Amplify, Kinesis, CloudFront, CloudWatch, API Gateway, SQS/SNS, CloudFormation and Secrets Manager.
  • While not required, we love to see familiarity with any of the following: chaos engineering, headless browsers, fault injection, distributed tracing, incident response, incident management, distributed systems, or multi-variant monitoring systems
  • Experience deflecting endless design document revisions and encouraging iterative development instead. We thrive on the opinion that working software solving customer problems is the only measurement of progress.
  • Experience influencing software engineers and operators on best practices (full software development life cycle, including coding standards, code reviews, build processes, testing, and operations)
  • Excellent writing skills. This is deeply engrained in Amazon culture and we leverage this to our advantage.
  • We use a lot of Java, React + TypeScript/Javascript, and Node.js, in that order. While not required out of the gates, you'll likely be more comfortable from the start with experience using some combination of these.
By submitting your application here, you can apply once to be considered for multiple Software Engineer openings across various Amazon teams. If you are successful in passing through the initial application review and assessment, you will be asked to submit your career and personal preferences so that our dedicated recruiters can match you to the right role based on these preferences.

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, visit

Client-provided location(s): New York, NY, USA
Job ID: Amazon-1551769

Company Videos

Hear directly from employees about what it's like to work at Amazon.