Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Sr. Software Engineer, Product SRE

1 week ago Flexible / Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed - we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We're also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We're always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you.

About the Role:

As a Senior Engineer (Typically equivalent to Staff or Sr Staff titles in other companies) in our Embedded Reliability team, you'll work directly within CrowdStrike product groups alongside product engineers and their leadership. You'll partner with engineering leaders to shape reliability roadmaps while doing hands-on work solving complex distributed systems problems at scale. This is hands-on systems engineering work focused on writing code, building foundational infrastructure, and solving complex problems rather than day-to-day operations or ticket management. While we embrace the SRE moniker, you'll find that it means something much more service-oriented at Crowdstrike, and affords you no shortage of Golang development initiatives, as well as the freedom to move up/down and laterally across the stack as & when needed. It is far and away our most self-driven & autonomous backend development role.

CrowdStrike Falcon processes trillions of events per day. You'll work on the critical production systems that power this platform by improving, rearchitecting, and scaling them to meet growing demands. You'll write production code, debug complex distributed systems issues, and tackle problems spanning scale and resiliency, performance engineering, foundational observability and instrumentation, cost optimization, and failure modeling.

Product engineers and engineering leaders will come to you for guidance on architectural decisions because you've earned credibility through hands-on work and delivering results. You'll ensure follow through on incident retrospectives with concrete improvements that eliminate entire classes of failures. You'll identify opportunities to extract common patterns into shared libraries or tools, or partner with platform teams on improvements that benefit multiple product groups. Recent examples from the team include resolving critical issues in leader election libraries and building infrastructure-as-code tools that eliminate manual deployment processes.

Why This Role Matters: CrowdStrike Falcon is the industry standard in cloud-native cybersecurity and threat hunting. Our customers depend on us to protect their businesses from sophisticated threats, and reliability isn't optional - it's fundamental to our mission. As an Embedded SRE, your work directly impacts whether organizations around the world can defend themselves against cyberattacks. You'll be working on problems that matter, at a scale that few companies can match, with the autonomy to make real architectural decisions.

What You'll Do:

  • Partner with engineering leadership to define and drive reliability roadmaps
  • Design and implement architectural improvements to services, libraries, and platforms that impact teams across CrowdStrike
  • Establish foundational observability practices: ensure teams instrument services properly, react to signals effectively, and leverage observability to drive automation like continuous delivery
  • Lead performance and cost optimization: profiling, bottleneck analysis, capacity planning, and efficiency improvements across cloud infrastructure
  • Define and implement service-level objectives that drive decision-making and prioritization
  • Conduct resilience engineering: chaos experiments, failure injection, and designing for graceful degradation
  • Provide technical leadership during complex incidents and drive systemic improvements
  • Mentor and coach engineers, building a culture of excellence and driving architectural standards across the organization
What You'll Need:
  • 7-10+ years building and operating distributed systems at scale
  • Expert-level proficiency in at least one programming language; willingness to become proficient in Go
  • Deep understanding of distributed systems: e.g. consensus algorithms, replication, consistency, failure modes, scalability patterns
  • Proven experience scaling backend systems: e.g sharding, partitioning, horizontal scaling, capacity planning, performance optimization
  • Track record of making impactful architectural decisions and seeing them through to production
  • Strong systems thinking and ability to influence without direct authority across organizational boundaries
  • Degree in Computer Science or equivalent experience in data structures/algorithms/distributed systems
Bonus Points:

Want more jobs like this?

Get jobs in Flexible / Remote delivered to your inbox every week.

Job alert subscription
  • Experience driving reliability improvements in organizations with hundreds or thousands of microservices
  • Deep knowledge of Kubernetes, cloud platforms, or other large-scale orchestration systems
  • Experience with AWS, Cassandra, Kafka, OpenSearch, or similar large-scale distributed systems
  • Track record of building internal platforms or tools that other engineers use
  • Experience in infrastructure cost optimization at scale
  • Background in performance engineering: profiling, optimization, understanding system bottlenecks
  • Experience with chaos engineering or resilience testing practices
  • History of establishing SLO/SLI frameworks and error budgets in production environments
  • Background in cybersecurity or intelligence fields
  • Experience building developer platforms or improving developer experience
#LI-MP2

#LI-DG1

#LI-HTF

#HTF
This role will require the candidate to periodically undergo and pass additional background and fingerprint check(s) consistent with government customer requirements.

Benefits of Working at CrowdStrike:
  • Market leader in compensation and equity awards
  • Comprehensive physical and mental wellness programs
  • Competitive vacation and holidays for recharge
  • Paid parental and adoption leaves
  • Professional development opportunities for all employees regardless of level or role
  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
  • Vibrant office culture with world class amenities
  • Great Place to Work Certified™ across the globe
CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program.

CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements.

If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Find out more about your rights as an applicant.

Client-provided location(s): Flexible / Remote, Austin, TX, Sunnyvale, CA, Redmond, WA
Job ID: CrowdStrike-R26949
Employment Type: FULL_TIME
Posted: 2026-02-12T00:01:52

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Health Reimbursement Account
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • FSA With Employer Contribution
    • HSA
    • HSA With Employer Contribution
    • FSA
  • Parental Benefits

    • Birth Parent or Maternity Leave
    • Non-Birth Parent or Paternity Leave
  • Work Flexibility

    • Office Life and Perks

      • Vacation and Time Off

        • Paid Vacation
        • Paid Holidays
        • Personal/Sick Days
      • Financial and Retirement

        • 401(K)
        • Company Equity
        • Stock Purchase Program
        • Performance Bonus
      • Professional Development

        • Promote From Within
        • Mentor Program
        • Shadowing Opportunities
        • Access to Online Courses
        • Lunch and Learns
      • Diversity and Inclusion