Sr. Data Engineer - Data Platform

3+ months agoNew York, NY / Remote

Who we are:

Founded in 2012, Socure is the leader in high-assurance digital identity verification technology.  Named to Forbes’ 2019 AI 50 list as one of America’s most promising AI companies, and a recent winner of API World’s Best Data API, Socure’s technology applies artificial intelligence and machine learning techniques with trusted online/offline data intelligence from email, address, phone, IP, social media and the broader Internet to verify identities in real-time. Customers include three of the top five U.S. banks, seven of the top 10 U.S. card issuers, as well as the majority of leading digital banks, lenders and insurers across the U.S. We are funded by some of the world's best investors and entrepreneurs including Scale Venture Partners, Commerce Ventures, Work-Bench, Santander InnoVentures and Two Sigma Ventures.

Our trophy case includes numerous industry awards and accolades, including being named one of Forbes America’s Best Startup Employers 2021 as well as the Best New Technology Introduced over the Last 12 months – Data and Data Services at the 2020 American Financial Technology Awards (AFTAs), being ranked #70 on Deloitte’s Technology Fast 500™, getting listed as a Gartner Cool Vendor, and winning Finovate’s Award for Best Use of AI/ML, to name a few!

The only way we can further our mission of becoming the single, trusted source of identity verification and eliminating identity fraud is by building the best team on the planet. This is where you come in!

What the role is:

Socure is looking for a Senior Data Engineer to join our US engineering team and build our data platform and core data pipelines. In our mission to become the single, trusted source of identity verification and eliminate identity fraud from the internet, data is at the core of what we build. It’s how we innovate and how we offer the most accurate Identity Verification on the market. With the company growing very fast and our customer needs even faster, the only way for us to succeed in our mission is to significantly scale how we work with data. 

We are in the early days of designing a data platform to accelerate all our data operations and unlock the creation of our future products, and we’d love you to join us and lead the way!

What you'll do: 

  • Work in close collaboration with our Engineering, Data Science, Infrastructure and Product teams to design and deliver core parts of our data platform
  • Apply cutting-edge big data and cloud native technologies to build highly reliable and scalable data pipelines to deliver data insight with high quality and low latency
  • Own the end-to-end delivery of projects related to our data platform initiative, from conception and design to development and production monitoring
  • Enable our Data Scientists to perfect our products and expand our offering and offer easy and secure access to data for engineering teams to deliver faster
  • Democratize access to data and aim to automate operations of large amounts of sensitive data efficiently, securely and reliably

What you'll bring:

  • Comfortable working cross-functionally to ensure technical alignment
  • Ability to think at scale and design, develop and operate production data stores, pipelines and services that meet goals of low latency, high availability, resiliency, security and quality
  • Empathy for people and how they use your work
  • Experience in architecting and building data solutions in modern cloud environment (AWS, GCP, etc.)
  • Hands-on experience with tools such as Hadoop, Hive, Spark, Presto, Snowflake, or Airflow, etc.
  • Hands-on experience in designing data lake solutions with deep understanding of query/storage/cost optimization techniques 
  • Proficiency in one or more of the programming languages such as Java, Scala, or Python. Proficient in SQL and data modeling
  • 5+ years of practical experience in building high scale, production distributed systems or data applications

Nice to have:

  • Experience in data governance and compliance (GDPR, CCPA or PII)
  • Experience in NoSQL solutions such as Redis, Cassandra, DynamoDB or Elasticsearch
  • Experience in building Real-time pipelines, and familiarity with Kafka, Flink or Spark Streaming
  • Experience in the modern CI/CD process in delivering platform features
  • Prior experience in identity fraud detection or entity resolution

Perks & Benefits: 

  • Competitive base salary
  • Equity - every employee is a stakeholder in our upside
  • Medical, dental and vision benefits for employees and their dependents 
  • Parental leave and fertility support
  • Flexible PTO
  • 401K with company match
  • Stipend to supply your home office
  • Annual professional development stipend

A Message on COVID-19:

Socure's number one priority is to safeguard the health and well-being of our team members, our families and our communities. During this unprecedented time, we are closely monitoring COVID-19 developments and updating our response plan quarterly. We are regularly soliciting feedback from our employees to help inform our return-to-office strategy. For our team members who loved going into the office, we are looking forward to meeting once again! But until then, we are striving to ensure that Socureans have the resources and support they need to excel from home. This includes a work-from-home stipend so you can build your home office and fun, virtual events so you can continue to feel connected to your coworkers.

We are an equal opportunity employer and value diversity of all kinds at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Job ID: 4176414003