Principal Application Support Engineer (SRE Lead)
Job Description
Are you ready to make an impact at DTCC?
Do you want to work on innovative projects, collaborate with a dynamic and supportive team, and receive investment in your professional development? At DTCC, we are at the forefront of innovation in the financial markets. We are committed to helping our employees grow and succeed. We believe that you have the skills and drive to make a real impact. We foster a thriving internal community and are committed to creating a workplace that looks like the world that we serve.
The Information Technology group delivers secure, reliable technology solutions that enable DTCC to be the trusted infrastructure of the global capital markets. The team delivers high-quality information through activities that include development of essential, building infrastructure capabilities to meet client needs and implementing data standards and governance.
Pay and Benefits:
- Competitive compensation, including base pay and annual incentive
- Comprehensive health and life insurance and well-being benefits, based on location
- Pension / Retirement benefits
- Paid Time Off and Personal/Family Care, and other leaves of absence when needed to support your physical, financial, and emotional well-being.
- DTCC offers a flexible/hybrid model of 3 days onsite and 2 days remote (onsite Tuesdays, Wednesdays and a third day unique to each team or employee).
Want more jobs like this?
Get jobs in Coppell, TX delivered to your inbox every week.

The Impact You Will Have in This Role
The Enterprise Application Support (EAS) organization provides critical application support across the ITP and ECS lines of business, ensuring enterprise platforms are reliable, scalable, and operationally resilient.
As a Principal Application Support Engineer (SRE Lead), you will play a pivotal role in safeguarding the stability and readiness of mission - critical systems. This is a hands - on, senior individual contributor role with deep ownership of production reliability and operational risk.
You will act as a reliability authority across the delivery lifecycle-driving the adoption of Site Reliability Engineering (SRE) best practices, influencing system design through non - functional requirements (NFRs), strengthening observability and resiliency, and leading response during critical production events. Through close partnership with application delivery, infrastructure, security, and risk teams, you will help reduce incidents, improve mean time to restore service (MTTR), and enable confident production releases.
Your Primary Responsibilities:
- Agile & Delivery Engagement
- Participate in planning, design, and sprint zero activities to ensure reliability, observability, resiliency, and operational readiness are embedded early in the SDLC
- Partner with delivery teams to champion non - functional requirements (NFRs) from design through production
- System Reliability & Architecture
- Drive the design and evolution of reliable, resilient, and scalable system architectures
- Influence redundancy, fault tolerance, and disaster recovery strategies
- Provide design recommendations that enable automated recovery and minimize manual intervention
- Develop and maintain application recovery runbooks to improve recovery consistency and reduce downtime
- Monitoring, Alerting & Observability
- Design and implement comprehensive monitoring and observability solutions.
- Define actionable alerts and establish Service Level Indicators (SLIs) and Service Level Objectives (SLOs).
- Proactively identify and mitigate potential issues before they impact users.
- Incident Management & Root Cause Analysis
- Serve as incident commander during critical system outages, coordinating cross - functional response and driving timely resolution.
- Lead post - incident reviews and root cause analyses, ensuring corrective actions prevent recurrence.
- Continuously improve incident response processes and MTTR.
- Automation & Tooling
- Develop and maintain automation to streamline operational tasks, including:
- Self - healing mechanisms
- Application deployments
- Scaling strategies
- Infrastructure and operational workflows
- Development & Cross - Functional Collaboration
- Work closely with development teams to integrate SRE practices into the SDLC.
- Promote reliability, observability, and operational excellence from design through production.
- Collaborate with infrastructure, network, security, Scrum Masters, and internal/external stakeholders.
- Security & Risk Integration
- Partner with security teams to ensure systems are resilient against cyber threats.
- Incorporate security best practices into operational and reliability designs.
- Collaborate with IT Embedded Risk Managers to identify and remediate operational and reliability risks.
- Operational Readiness
- Lead operational readiness reviews with EAS L2 support teams at key project milestones.
- Identify operational risks and gaps; validate NFRs in UAT environments to ensure production readiness.
- Capacity & Performance Management
- Proactively assess capacity needs and plan for future growth.
- Implement scaling strategies to support high - load and peak usage scenarios.
- Analyze performance metrics, identify bottlenecks, and drive performance optimization initiatives.
- Metrics & Continuous Improvement
- Define and track KPIs to demonstrate operational improvements and system reliability.
- Drive continuous improvement through data - driven insights and engineering best practices.
Qualifications
- Minimum of 8-10 years of relevant experience
- Bachelor's degree preferred or equivalent professional experience
Talent Needed for Success
- Proficiency in one or more languages such as Python, Java, Go, or similar, for automation and tooling.
- Strong experience with Linux/Unix systems, networking concepts, and cloud platforms (AWS, Azure, GCP). Mainframe experience is a plus.
- Hands - on experience designing monitoring and alerting solutions using tools such as Splunk, Dynatrace, ITSI, or similar platforms.
- Proven experience leading or participating in high - severity incident response under pressure.
- Strong focus on reliability, resiliency, performance, and operational excellence.
- Ability to influence and partner across engineering, infrastructure, security, and risk teams.
- Supports an environment where individuals are respected and valued for their contributions.
The salary range is indicative for roles at the same level within DTCC across all US locations. Actual salary is determined based on the role, location, individual experience, skills, and other considerations. We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
About Us
With over 50 years of experience, DTCC is the premier post-trade market infrastructure for the global financial services industry. From 20 locations around the world, DTCC, through its subsidiaries, automates, centralizes, and standardizes the processing of financial transactions, mitigating risk, increasing transparency, enhancing performance and driving efficiency for thousands of broker/dealers, custodian banks and asset managers. Industry owned and governed, the firm innovates purposefully, simplifying the complexities of clearing, settlement, asset servicing, transaction processing, trade reporting and data services across asset classes, bringing enhanced resilience and soundness to existing financial markets while advancing the digital asset ecosystem. In 2024, DTCC's subsidiaries processed securities transactions valued at U.S. $3.7 quadrillion and its depository subsidiary provided custody and asset servicing for securities issues from over 150 countries and territories valued at U.S. $99 trillion. DTCC's Global Trade Repository service, through locally registered, licensed, or approved trade repositories, processes more than 25 billion messages annually. To learn more, please visit us at www.dtcc.com or connect with us on LinkedIn , X , YouTube , Facebook and Instagram .
DTCC proudly supports Flexible Work Arrangements favoring openness and gives people freedom to do their jobs well, by encouraging diverse opinions and emphasizing teamwork. When you join our team, you'll have an opportunity to make meaningful contributions at a company that is recognized as a thought leader in both the financial services and technology industries. A DTCC career is more than a good way to earn a living. It's the chance to make a difference at a company that's truly one of a kind.
Learn more about Clearance and Settlement by clicking here .
About the Team
Serves as a dedicated technology resource for advancing DTCC's business opportunities and providing industry thought leadership for leveraging new technology. The goal of this new department is to partner internally with IT, our business and regulatory divisions and externally with clients, regulators, and fintech vendors, to help build new platforms and business models to advance DTCC's mission to support the financial markets.
Perks and Benefits
Health and Wellness
- Health Insurance
- Dental Insurance
- Vision Insurance
- Life Insurance
- Short-Term Disability
- FSA
- HSA With Employer Contribution
- Long-Term Disability
- HSA
- Pet Insurance
- Mental Health Benefits
Parental Benefits
- On-site/Nearby Childcare
- Adoption Assistance Program
- Family Support Resources
- Birth Parent or Maternity Leave
- Non-Birth Parent or Paternity Leave
- Return-to-Work Program
Work Flexibility
- Hybrid Work Opportunities
- Work-From-Home Stipend
Office Life and Perks
- Casual Dress
- Snacks
- On-Site Cafeteria
- Commuter Benefits Program
- Company Outings
- Holiday Events
Vacation and Time Off
- Paid Vacation
- Paid Holidays
- Personal/Sick Days
- Leave of Absence
- Volunteer Time Off
Financial and Retirement
- 401(K) With Company Matching
- Performance Bonus
- Financial Counseling
- Pension
Professional Development
- Work Visa Sponsorship
- Leadership Training Program
- Associate or Rotational Training Program
- Tuition Reimbursement
- Learning and Development Stipend
- Promote From Within
- Mentor Program
- Shadowing Opportunities
- Access to Online Courses
- Lunch and Learns
- Internship Program
- Professional Coaching
Diversity and Inclusion
- Diversity, Equity, and Inclusion Program
- Employee Resource Groups (ERG)
- Unconscious Bias Training