Software Engineer - Data Engineering

At SoundHound Inc., we believe every brand should have a voice. As the leading innovator of conversational technologies, we’re trusted by top brands around the globe. Houndify, our independent Voice AI platform, with 70,000+ users, allows brands to create custom voice assistants that deliver results with unprecedented speed and accuracy.

Our mission is to enable humans to interact with the things around them in the same way we interact with each other: by speaking naturally. We’re making that a reality through our SoundHound music discovery app and Hound voice assistant and through our strategic partnerships with brands like Mercedes-Benz, Hyundai, Deutsche Telekom, and Pandora. Today, our customized voice AI solutions allow people to talk to phones, cars, smart speakers, mobile apps, coffee machines, and every other part of the emerging ‘voice-first’ world.

Our diverse team of engineers, UX/UI designers, writers, data scientists and linguists are all passionate about creating a world with more conversations. With more than 14 years of expertise in voice technology, we have hundreds of millions of end users, and a worldwide team in six countries building solutions for a voice-first world.

About the Role:

We view our data and our engineering team as two of our biggest assets. This role lives at the intersection of the two. We have a huge amount of data from hundreds of millions of users of:

  • SoundHound: music app featuring search, discovery, and play with LiveLyrics
  • Hound: newly released app featuring unprecedented speech recognition and natural language understanding
  • Houndify: platform enabling developers to add voice enabled conversational interface to anything

We aspire to leverage this data to make informed decisions to steer product development, marketing and user engagement. We have only scratched the surface of the kind of advanced analytics and insight generation we'd like to do! This is an opportunity to work on interesting data engineering and data science problems, build large scale distributed machine learning systems from the ground up, and use cutting edge Big Data technologies like Spark, Kafka, HBase and Hive.


  • Design and implement data pipelines empowering real time insights
  • Leverage massive datasets for modeling, recommendations, and reporting solutions
  • Build user-facing scalable systems powering ad targeting, push and/or in app
  • Drive framework for A/B tests, exposing the results through visualization tools like Tableau


  • Strong coding skills preferably in Java, Scala, or Python
  • Hands-on experience with large scale Big Data environments (Spark, Kafka, Hive, Hadoop)
  • Ability to handle multiple competing priorities in a fast-paced environment
  • BS/MS in Computer Science or equivalent

Nice to Haves:

  • Familiarity with NoSQL stores including HBase/Cassandra, Redis, Riak, and/or Mongo
  • Familiarity with data modeling, machine learning, frameworks like Spark MLlib
  • Experience with analytical tools supporting data analysis and reporting (eg. Tableau)

Back to top