Data Scientist - Research Informatics

Job Title: Data Scientist Research Informatics

Requisition #: 9555


TheData Scientistwill be inquisitive in his/her approach to figuring out the right biomedical data set that needs to be mined from multiple clinical and clinical research systems at City of Hope (COH). Resource should be comfortable in asking questions about the data source, figure out the business rules around how it was put together, and look forward to aid scientists in their biomedical data identification and retrieval needs:

  • Will use data mining, statistics, statistical modeling and machine learning to understand relationships and patterns in data. The insights gathered from this analysis are translated into actionable steps and best practices
  • Responsible for developing tools to enable visualization to translate analytics into information that can be used by clinical and operational leadership
  • Will take a data driven approach to decision making in projects.

The Data Scientist with Clinical Research Focus position requires exceptional communication skills, a collaborative approach and attitude of service with a sense of urgency. The candidate must be self-driven and eager to learn new methods of data munging, data visualization and data analysis.

Main Responsibilities Includes:

  • Integrate multiple systems and perform analytics on large data sets
  • Perform explanatory data analyses, generate and test working hypotheses, prepare and analyze historical data, identify patterns and correlations among various data points and discover new insights
  • Perform machine learning, natural language, and statistical analysis methods
  • Meet with researchers to explain data, data analysis, insights and relationships within data
  • Analyze and interpret research data, extract scientific relevance and formulate
  • Produce reports from research data to meet end-user needs
  • Collaborate and develop working relationships with clinical informatics, bioinformatics and other computational groups at City of Hope
  • Collaborate with other database and software developers.

Minimum Education and Experience:

  • Masters degree in statistics, mathematics, informatics, machine learning, computer science, epidemiology or a related field
  • 5-8 years of experience with data analytics
  • 3+ years of applicable experience developing and successfully implementing statistical models for data mining, predictive risk modeling, clustering, and classification
  • 2+ years of experience with decision trees
  • Experience in using analytics to generate and validate insights from large data sets
  • Experience with relational databases and SQL queries

Other Experience or Certification (Preferred):

  • Ph.D. degree preferred
  • Experience in a health care environment
  • Experience in health plans/payer environment
  • EHR experience
  • 3+ years of applicable experience developing and successfully implementing statistical models for data mining, predictive (risk) modeling, clustering, and classification
  • 2+ years of experience with decision trees

Skills and Abilities:

  • A doctorate degree in relevant discipline (e.g., medicine, basic sciences, clinical sciences, Health Sciences, applied statistics, data mining, machine learning, etc.)
  • Deep familiarity with clinical research (clinical trials), clinical or healthcare domain is required
  • Five years of experience in working with biomedical datasets in cancer research environment
  • A good understanding of statistical and predictive modeling concepts, machine-learning approaches, clustering and classification techniques and stratification analysis.
  • Proficiency in SQL, MySQL and/or PostgreSQL.
  • Expertise in one of the programing languages
  • Demonstrated understanding of machine learning and predictive analytics techniques
  • Ability to mine large datasets and analyze large, complex, multi-dimensional datasets
  • Must be knowledgeable in at least one statistical and analytic packages such as R, MATLAB, SAS, SPSS or Weka
  • Proficiency in at least one scripting language (e.g. Python, Perl)
  • Ability to exercise independence and use creative approaches to problem-solving.
  • Experience in data visualization of biomedical datasets.
  • Comfortable communicating complex technical subjects to non-technical audiences using practical examples, and prototypes


City of Hope, an innovative biomedical research, treatment and educational institution with over 4000 employees, is dedicated to the prevention and cure of cancer and other life-threatening diseases and guided by a compassionate, patient-centered philosophy.

Founded in 1913 and headquartered in Duarte, California, City of Hope is a remarkable non-profit institution, where compassion and advanced care go hand-in-hand with excellence in clinical and scientific research. City of Hope is a National Cancer Institute designated Comprehensive Cancer Center and a founding member of the National Comprehensive Cancer Network, an alliance of the nation's 20 leading cancer centers that develops and institutes standards of care for cancer treatment.


  • One of only 47comprehensive cancer centers, the highest designation bestowed by the National Cancer Institute
  • Ranked as one of America's Best Hospitals in cancer by US News & World Report
  • We value workplace diversity and are committed to the training and development of our employees
  • We offer a comprehensive benefit and total rewards package

City of Hope is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, national origin, sex, sexual orientation, gender identity, age, status as a protected veteran, or status as a qualified individual with disability. #LI-DA1 | *CB-DA

Meet Some of City of Hope's Employees

Libby F.

Sr Prospect Research Analyst

Working within the Foundation and Relations Department, Libby prepares and provides frontline fundraisers with prospective donor profiles—financing City of Hope’s outstanding patient care and clinical research.

Sharee D.

Organization Development Training & Data Analyst

Sharee assesses data and devises the most effective means of achieving City of Hope’s virtuous health care goals. She explores all options available to assure the very best possible outcomes for patients in need.

Back to top