Every time you pay for an Uber ride or book on Airbnb, you’re using our product. Braintree lets you move money from one place to another safely and securely. It sounds complex (and it is), but we make it so simple, you wouldn't know we were there.
At Braintree, opportunities to shine happen daily. We value what makes you different, and encourage you to act on your ideas -- no matter how pie-in-the-sky. You bring skills and a customer-first mentality, and we'll bring the tools and environment you need to do the best work of your life.
About the Role:
Site Reliability Engineering keeps our lights on, keeps our technology humming and is essential to our ongoing evolution and growth. These engineers use their knowledge of technology and operational best practices to deliver an experience that drives available, scalable and reliable customer experiences.
These folks help create, develop and manage the deployment architecture for the application, develop the monitoring architecture and implement monitoring agents, dashboards, escalations and alerts. These technical investigators create playbooks and stakeholder communication mechanisms, oversee change management and configuration management to bring improvements and efficiencies across Braintree.
As a member of the team, you’ll face interesting and challenging problems as we take our software to the next level of scale and sustainability. You’ll be the tactical “feet on the streets” as we watch and monitor what’s happening in the business, day-to-day.
-Evaluate and track application performance, metrics, and availability
-Monitor data center hardware, networking, and software platforms
-Build and maintain alerting tools and metrics
-React to system inefficiencies and resolve issues quickly to ensure system availability and performance
-Coordinate engineering, customer support, and external communications
-BS in Computer Science or related field
-Knowledge of Linux internals and command-line tools
-Familiarity with basic networking concepts - TCP, UDP, ping, traceroute, and the Linux network stack
-Troubleshooting experience tracking down performance, load, networking, I/O utilization, and memory problems
-Experience with monitoring and metrics tools - Nagios, Graphite, New Relic, etc.
Open Dev Day — two days a month our engineers work on projects that interest them to improve their craft
Product conferences — attend two conferences … all on the Braintree dime
Daily catered lunches — salad bar and entree buffet. Yum!
Tuition reimbursement — we take education and skills development seriously
Tracking-Free Vacations — employees self track their vacation to ensure you get the time off you need
Back to top