Senior Site Reliability Engineer

SoundHound is creating and productizing transformative technologies that improve life. Join us.

After amassing a global user community well over 300 million strong, a broad portfolio of core technologies and award-winning products ... we're just getting started.

At SoundHound we value creativity, innovation, hard work, open communication and fast iteration, which allow us to act on valuable feedback from employees and users alike. SoundHound's culture is one of the impromptu coffee breaks, less-impromptu fitness sessions, group lunches, and weekly happy hours.

We offer a competitive salary, SoundHound stock options, unique camaraderie, catered lunches, and the opportunity to call a company home that's simultaneously changing the way we discover music AND interact with machines. Yep, it's that cool. 

Responsibilities:

  • SRE with a focus on system architecture, production automation, software engineering, monitoring systems and operations

  • Experience with multi-cloud PaaS platforms such as GCP, AWS, Azure

  • Automate and Design a framework for deployment, customization, upgrades, and monitoring through CI/CD tools (Ansible, Terraform, Consul, Vault, Jenkins, Kubernetes, Docker, etc)

  • Ensure application and infrastructure security are in compliance with ISO 27K / SOC

  • Develop, maintain and enforce SLI/SLO/SLA to ensure service and infrastructure availability

  • Experience with monitoring and alerting tools such as Prometheus, Grafana, Sensu, Kafka, ELK stack, and PagerDuty

  • Collaborate with engineering, quality engineering and product management to architect and build systems / features that are highly available, reliable and secure

Minimum qualifications:

  • BS or MS degree in Computer Science, Information Systems, or equivalent experience

  • 5+ years supporting public and private cloud services in a high-volume customer-facing environment

  • Skilled in public and private cloud security best practices. Penetration testing, vulnerability scanning, source code analysis, social engineering

  • Demonstrable knowledge of TCP/IP, Linux operating system internals, filesystems, disk/storage technologies

  • Strong network and security experience including L2/L3 protocols, firewalls, pen testing, network assessment, DHCP, DNS, IPV4 and IPV6, VPN, SSH protocols, SSL/TLS certificate management

  • Comfortable working with wide range of relevant server-side technologies such as Hadoop, Spark, MongoDB, Postgres, Mysql, Redis, ELK, Riak, etc

  • Ability to program in one or more languages: Python, Go, familiarity with the JVM as well as Bash scripting

  • Be able to participate in on-call rotation

Preferred qualifications:

  • Understanding of GCP tools and dev stack: App/Compute/Kubernetes Engine. Storage: Bigtable, Spanner. Networking: VPN, Load Balancer, Cloud DNS, Cloud NAT, Cloud router, Cloud Armor. StackDriver, Tools and Big Data stack

  • Knowledge of relational and non-relational databases such as MySQL, Postgres, Redis, Memcached, and MongoDB

  • Strong general technical, analytical, troubleshooting, automate routine tasks and problem-solving abilities

  • Ability to work independently with minimal supervision

  • Excellent verbal and written communication skills


Back to top