Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
State Street

Staff Data Engineer

Atlanta, GA

Who we are looking for

The State Street Security Architecture, Analytics & Fusion Engineering (SA2FE) team is looking for a Staff Data Engineer, Integrations Lead . The Fusion Analytics and Data Engineering team delivers models, insights, and tooling to help Cybersecurity teams make faster, more informed decisions as we work to secure State Street's digital footprint. As a Data/Analytics Engineer, you will develop the data flows, analytics pipelines, and production machine-learning systems -- in collaboration with data product managers, architects, engineers, and other team members -- to create analytics & ML-driven data products that support our mission to build predictive models and intelligent systems that help secure State Street's information and infrastructure. This is a unique greenfield project and we are looking for an experienced technical leader with broad experience in architecture, operations, and engineering to help design and deliver a model platform, and lead and mentor a growing team of cybersecurity data professionals.

Want more jobs like this?

Get Data and Analytics jobs delivered to your inbox every week.

Select a location
By signing up, you agree to our Terms of Service & Privacy Policy.

Due to the role requirements this job needs to be performed primarily in the office with some flex work opportunities available.

What you will be responsible for

As a Staff Cyber Data Engineer, Integrations Lead you will

  • Design and build global distributed petabyte scale data-mesh systems for high availability, high throughput, data consistency, security, and privacy, defining our next generation of security data analytics tooling.
  • Use your understanding of large scale data processing and analytics to wrangle our unique cybersecurity data and create analyses and tools that point to the most significant business, governance, and risk management impacts.
  • Design and build petabyte scale systems for high availability, high throughput, data consistency, security, and end user privacy, defining our next generation of data analytics tooling
  • Build data modeling and ELT workflows to produce Raw, Rationalized, co-Related, and Reporting data flows for graph, timeseries, structured, and semi-structured cybersecurity data
  • Work alongside the global cybersecurity architecture & engineering leadership to develop and deliver capabilities to support Cyber Data Science initiatives both internally and in partnership with detection and response teams, and governance and risk management teams across our CISO organization.
  • Mentor and train engineers and architects, and build out a Data Mesh, Lakehouse, Kappa Streaming and Data Security Architecture practice.

What we value

These skills will help you succeed in this role

  • 10+ years of experience with Python, Java, or similar languages, with cloud infrastructure (e.g. AWS, GCP, Azure), and deep experience working with big data processing infrastructures and ELT orchestration
  • Experience developing distributed batch and real-time feature stores, and developing coordinated batch, streaming and online model execution workflows, building and optimizing large scale data processing jobs in Spark, GraphX/GraphFrames, Spark Structured Streaming, as well as scaling graph and time-series native operations.
  • Experience with designing for data lineage, federation, governance, compliance, security, and privacy - hands on experience with commercial DataSecOps platforms like Immuta, Satori and/or experience building custom access control (RBAC/ABAC), data masking, tokenization, and FPE systems for cloud data lake environments. Experience with globally distributed federated data systems is highly desirable.
  • Experience with data quality monitoring and with building continuous data pipelines and implementing history and time-travel using modern data lake storage layers like Delta Lake, Iceberg, and LakeFS
  • Experience with MLOps and iterative cycles of end-to-end development, MRM coordination, deployment, and monitoring of production grade ML models in a regulated high-growth tech environment5+ years of experience with Python, Java, or similar languages, with cloud infrastructure (e.g. AWS, GCP, Azure), and deep experience working with big data processing infrastructures and ELT orchestration

Education & Preferred Qualifications

  • B.S., M.S., or PhD. in Computer Science or equivalent work experience
  • 8+ years of experience building large scale distributed systems and data analytics processes on cloud native, in-memory, and fit-for-purpose hybrid infrastructure. Experience with cybersecurity data and globally distributed log & event processing systems with data mesh and data federation as the architectural core is highly desirable.
  • Experience in big data technologies like Presto/Trino, Spark & Flink, Airflow & Prefect, RedPanda & Kafka, Iceberg & Delta Lake, Snowflake & Databricks, MemGraph & Neo4J as well as modern security tooling like Splunk, Panther, Datadog, Elastic, Arcsight etc.
  • Experience designing and building data warehouse, data lake or lake house using batch, streaming, lambda and data mesh solutions and with improving efficiency, scalability, and stability of system resources.
  • Experience working with data warehouses or Databases like Snowflake, Redshift, Postgres, Cassandra etc
  • Experience writing and optimizing complex SQL and ETL development and designing and building data warehouse, data lake or lake house solutions. Experience building Data APIs and integrations using tools like GraphQL, Apache Arrow, gRPC, ProtoBuf, designing large scale stream processing systems with Flink, Kafka, NiFI, and similar technologies.
  • Experience with distributed systems and distributed data storage and large-scale data warehousing solutions, like BigQuery, Athena, Snowflake, Redshift, Presto, etc.
  • Experience working with large datasets and best in class data processing technologies for both stream and batch processing, graph and time series data, notebooks and analytic visualization environments.
  • Strong communication and collaboration skills particularly across teams or with functions like data scientists or business analyst.

Are you the right candidate? Yes!

We truly believe in the power that comes from the diverse backgrounds and experiences our employees bring with them. Although each vacancy details what we are looking for, we don't necessarily need you to fulfil all of them when applying. If you like change and innovation, seek to see the bigger picture, make data driven decisions and are a good team player, you could be a great fit.

Why this role is important to us

Our technology function, Global Technology Services (GTS), is vital to State Street and is the key enabler for our business to deliver data and insights to our clients. We're driving the company's digital transformation and expanding business capabilities using industry best practices and advanced technologies such as cloud, artificial intelligence and robotics process automation.

We offer a collaborative environment where technology skills and innovation are valued in a global organization. We're looking for top technical talent to join our team and deliver creative technology solutions that help us become an end-to-end, next-generation financial services company.

Join us if you want to grow your technical skills, solve real problems and make your mark on our industry.

About State Street

What we do. State Street is one of the largest custodian banks, asset managers and asset intelligence companies in the world. From technology to product innovation, we're making our mark on the financial services industry. For more than two centuries, we've been helping our clients safeguard and steward the investments of millions of people. We provide investment servicing, data & analytics, investment research & trading and investment management to institutional clients.

Work, Live and Grow. We make all efforts to create a great work environment. Our benefits packages are competitive and comprehensive. Details vary by location, but you may expect generous medical care, insurance and savings plans, among other perks. You'll have access to flexible Work Programs to help you match your needs. And our wealth of development programs and educational support will help you reach your full potential.

Inclusion, Diversity and Social Responsibility. We truly believe our employees' diverse backgrounds, experiences and perspectives are a powerful contributor to creating an inclusive environment where everyone can thrive and reach their maximum potential while adding value to both our organization and our clients. We warmly welcome candidates of diverse origin, background, ability, age, sexual orientation, gender identity and personality. Another fundamental value at State Street is active engagement with our communities around the world, both as a partner and a leader. You will have tools to help balance your professional and personal life, paid volunteer days, matching gift programs and access to employee networks that help you stay connected to what matters to you.

State Street is an equal opportunity and affirmative action employer.

Salary Range:
$140,000 - $222,500 Annual

The range quoted above applies to the role in the primary location specified. If the candidate would ultimately work outside of the primary location above, the applicable range could differ.

Client-provided location(s): Atlanta, GA, USA; Boston, MA, USA; Austin, TX, USA; Princeton, NJ, USA; Jersey City, NJ, USA; Berwyn, PA, USA
Job ID: StateStreet-R-750189
Employment Type: Full Time