Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
Thought Machine

Site Reliability Engineer

Singapore

Thought Machine's mission is bold - to properly and permanently rid the world's banks of legacy technology. To achieve this, we have developed the foundations of modern banking through core and payments technology which run natively in the cloud. What we are attempting is hard and means we need great people working together to build great technology.

We have grown rapidly in the past few years - growing our team to more than 550 individuals across offices in London, New York, Singapore and Sydney. We have raised more than $500m in funding and are now valued at $2.7bn. Our investors include Temasek, Standard Chartered Ventures, Molten Ventures, Eurazeo, Intesa Sanpaolo, Nyca Partners, JPMorgan Chase Strategic Investments, and more.

Want more jobs like this?

Get Software Engineering jobs in Singapore delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

We have created a culture enabling our team to produce the best work in the industry, ensuring we have fun along the way. We're regularly cited as having a fantastic workplace culture and have been recognised by Sifted magazine as having one of the highest Glassdoor ratings for a UK fintech company and the most generous employee share package in the industry. We've been named in the IDC list of top 100 fintechs, and the Singapore HR Awards awarded us Gold and Silver for our workplace culture and employee experience.

We are spinning up a new regional SaaS platform team responsible for providing a world-class SaaS offering, by continuously improving and maintaining our SaaS platform. The team will be geographically distributed across our two main hubs: UK, SG.

Joining this team is an excellent opportunity to get exposure to how mission-critical systems are run in production. You will be part of a team that owns the system end-to-end and have a deeper understanding of exactly how our clients use the system (for example by extracting usage analytics).

The team will own the platform end-to-end, making use of existing infrastructure, improving core Terraform modules, as well as developing operators, tooling and additional infrastructure where appropriate. They will also be responsible for L2 support (for client-initiated support requests) and L1 (for alerting-based incidents). Support will be provided during working hours, with a follow-the-sun model and handovers happening between the 3 regions.

Definition and development of the SaaS roadmap is another critical responsibility of this team. Alongside the Product Management function, they will define technical requirements, features and implement them with the goal of offering an excellent SaaS experience to our clients.

Duties

  • Provision SaaS environments as new clients are onboarded.
  • Be part of the on-call rota (during business hours), responsible for resolving alerts generated by proactive monitoring and working closely with CANs to provide L2 support for client-initiated support requests.
  • Define and implement the feature roadmap to improve the SaaS platform, for example by implementing self-service functionality, exposing metrics to clients, improving automation and self-healing properties of the system.
  • Improving the scalability, security and performance of the SaaS platform, by implementing automated compliance and controls, testing different Kafka and DB setups (e.g. Aurora vs RDS) and running load tests at every level of the stack.
  • Implementing and regularly testing DR strategies to ensure the highest level of resilience and fault tolerance of the platform.

Requirements

Essential

  • Strong background in Linux/Unix administration, e.g. Ubuntu, Debian
  • A strong background in at least one of Go, Python or Java
  • A strong background in one of the following: database administration, Kafka, observability tools (such as Prometheus or Zipkin) or infrastructure automation.
  • Experience with AWS or GCP is essential
  • Experience or knowledge of container orchestration tools, e.g. Kubernetes

Desirable

  • Experience in supporting production systems
  • Experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible

Benefits

  • Highly competitive salary
  • Bonus incentive
  • Healthcare
  • 25 days holiday and public holidays
  • Competitive maternity and paternity leave
  • $1,500 SGD per year flexible spend benefit
  • All the latest tech you need
  • A talented and experienced team as your colleagues
  • An environment where we encourage learning and progress

Thought Machine is committed to making a measurable positive impact on people's everyday lives. We are an equal-opportunity employer and value diversity at our company. We actively hire candidates who demonstrate technical excellence in their field and welcome people of all ages and backgrounds, providing everyone with equal access to professional development. You are encouraged to apply even if your experience doesn't accurately match the job description.

Client-provided location(s): Singapore
Job ID: Thought_Machine-277
Employment Type: Other

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
  • Work Flexibility

    • Flexible Work Hours
    • Hybrid Work Opportunities
  • Office Life and Perks

    • Casual Dress
    • Snacks
    • Some Meals Provided
  • Vacation and Time Off

    • Personal/Sick Days
    • Paid Vacation
    • Paid Holidays
  • Financial and Retirement

    • Company Equity
    • 401(K) With Company Matching
  • Professional Development

    • Access to Online Courses
    • Lunch and Learns
  • Diversity and Inclusion

    • Diversity, Equity, and Inclusion Program