Watson Health - Data Engineer

Job Description
The IBM's Watson Health business unit is now looking for talented individuals ready to usher in the next era of healthcare. We live in a moment of remarkable change and opportunity. The convergence of data and technology is transforming healthcare and life sciences organizations in every way.

Position: Data Engineer
Location: Cambridge, MA (Boston) or Raleigh, NC

Job Description:
Are you passionate about DATA and want to work with Data Scientist to solve real-world problems? The AI Data Curation team, develops and operates a data platform to ensure the utmost quality of our datasets that are used for training the next generation of AI solutions to enhance clinical decision making. We're looking for a senior data engineer to help design, develop, and manage elastic and highly-available data platform for the large volume of medical datasets.

Essential Responsibilities:

  • Design and develop scalable data infrastructure for a large volume of medical data, structured and unstructured from various sources in batch mode and near real-time.
  • Develop data ingestion, integration, transformation pipelines and manage data warehouses.
  • Work with Data Scientist to develop tools/automation and build analytics capabilities to expedite the use of data for AI training purposes.
  • Develop tools to analyze large data sets and perform data verification, data integrity and quality check.
  • Responsibilities will vary from developing and maintaining new data sets, data warehouse development and generating reports on data usage, access controls and performing data integrity checks.
  • All team members are expected to contribute broadly, with an agile and growth mindset.

Required Professional and Technical Expertise:
  • Undergraduate degree in Computer Science or related field of study for software development.
  • 8+ years of experience in building data platform, data engineering, or software engineering.
  • Software engineer skills in one more language (Java, Python), data manipulation (SQL).
  • Experience in designing and building efficient and scalable solutions for big data.
  • 3+ experience working with agile methodologies and cloud technologies.
  • Experience working with medical data and knowledge of any one or more of the following: HL7, FHIR, DICOM, PACs, VNA, EMR, Epic, Cerner, data anonymization
  • Strong communication, negotiation and consensus building skills when dealing with stakeholders and team members.

Preferred Professional and Technical Expertise:
  • Have developed statistical models, machine learning algorithms


Required Technical and Professional Expertise

  • Undergraduate degree in Computer Science or related field of study for software development.
  • 8+ years of experience in building data platform, data engineering, or software engineering.
  • Software engineer skills in one more language (Java, Python), data manipulation (SQL).
  • Experience in designing and building efficient and scalable solutions for big data.
  • 3+ experience working with agile methodologies and cloud technologies.
  • Experience working with medical data and knowledge of any one or more of the following: HL7, FHIR, DICOM, PACs, VNA, EMR, Epic, Cerner, data anonymization
  • Strong communication, negotiation and consensus building skills when dealing with stakeholders and team members.


Preferred Tech and Prof Experience

  • Have developed statistical models, machine learning algorithms


EO Statement
IBM is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.


Back to top