Medicare Stars Data Intelligence Data Scientist
At CVS Health, we're building a world of health around every consumer and surrounding ourselves with dedicated colleagues who are passionate about transforming health care.As the nation's leading health solutions company, we reach millions of Americans through our local presence, digital channels and more than 300,000 purpose-driven colleagues - caring for people where, when and how they choose in a way that is uniquely more connected, more convenient and more compassionate. And we do it all with heart, each and every day.Position Summary
Aetna, a CVS Health company, is the nation's premier health innovation company helping people on their path to better health. We are building a new health care model that is easier to use, less expensive, and puts the consumer at the center of their care.
Aetna's Medicare Stars Team is growing and expanding! This is exciting opportunity to join a high performing, collaborative team as a Data Scientist. This individual will focus on leveraging the Stars Data Ecosystem for Business Analytics for improved understanding of business patterns, predict future trends, and implement effective solutions to various challenges. This position will manage and be responsible for the successful delivery of scalable analytics solutions on Snowflake that combine structured and unstructured data, enable agentic document analytics at scale, and expose insights via natural language queries powered by LLMs (including OpenAI). Own end-to-end implementation: ingestion, unified data modeling, vectorization/embeddings, retrieval pipelines, Snowflake compute (Snowpark), and integration with LLM-driven applications. This role will be utilizing their expertise in statistical analysis, machine learning, and data visualization to analyze complex data sets, then derive actionable insights that inform business strategies and decisions. By leveraging data, they help the Stars organization understand patterns, predict future trends, and implement effective solutions to various challenges.
Key Responsibilities
- Design and implement unified data pipelines in Snowflake that combine structured tables, semi-structured data (JSON/Parquet), and large collections of documents (PDF, DOCX, text).
- Build agentic document analytics workflows: large-scale document ingestion, text extraction, cleaning, chunking, embeddings, vector stores, and efficient retrieval for analytic queries.
- Implement Natural Language Query (NLQ) interfaces that translate user text prompts into analytic queries or retrieval flows and return explainable results.
- Integrate Snowflake with LLMs (OpenAI or equivalent) for tasks such as summarization, question-answering, classification, and code-generation-using External Functions, Snowpark, and/or secure API patterns.
- Create performant retrieval-augmented generation (RAG) architectures that leverage Snowflake-stored embeddings and external or internal vector indexes as appropriate.
- Author Snowpark/Python/SQL transformations, Streams & Tasks, and job orchestration to enable near-real-time and batch analytical workloads.
- Implement data modeling and governance patterns within Snowflake: schemas, role-based access control, masking, lineage, and metadata to support analytics and compliance.
- Partner with product, ML/AI, BI, and engineering teams to translate business requirements into robust production-ready solutions.
- Build monitoring, observability, and cost controls for compute, storage, and API usage related to document analytics and LLM integration.
- Produce technical documentation, runbooks, and clear explanations of model/LLM behavior and limitations to non-technical stakeholders.
Technical Skills
- Statistical Analysis: Proficiency in statistical methods and software, such as R, SAS, and Python, to analyze data.
- Data Visualization: Ability to create compelling visualizations using tools like Tableau, Power BI, and D3.js.
- Database Management: Knowledge of SQL and NoSQL databases for efficient data storage and retrieval.
- Programming: Strong programming skills, particularly in languages such as Python, Java, and C++.
- Design and implement unified data pipelines in Snowflake that combine structured tables, semi-structured data (JSON/Parquet), and large collections of documents (PDF, DOCX, text).
- Build agentic document analytics workflows: large-scale document ingestion, text extraction, cleaning, chunking, embeddings, vector stores, and efficient retrieval for analytic queries.
- Implement Natural Language Query (NLQ) interfaces that translate user text prompts into analytic queries or retrieval flows and return explainable results.
- Integrate Snowflake with LLMs (OpenAI or equivalent) for tasks such as summarization, question-answering, classification, and code-generation-using External Functions, Snowpark, and/or secure API patterns.
- Create performant retrieval-augmented generation (RAG) architectures that leverage Snowflake-stored embeddings and external or internal vector indexes as appropriate.
- Author Snowpark/Python/SQL transformations, Streams & Tasks, and job orchestration to enable near-real-time and batch analytical workloads.
- Implement data modeling and governance patterns within Snowflake: schemas, role-based access control, masking, lineage, and metadata to support analytics and compliance.
- Partner with product, ML/AI, BI, and engineering teams to translate business requirements into robust production-ready solutions.
- Build monitoring, observability, and cost controls for compute, storage, and API usage related to document analytics and LLM integration.
- Produce technical documentation, runbooks, and clear explanations of model/LLM behavior and limitations to non-technical stakeholders.
Want more jobs like this?
Get jobs in Johnstown, PA delivered to your inbox every week.

- Critical Thinking: Ability to think critically and approach problems from multiple perspectives.
- Quantitative Analysis: Strong quantitative skills to interpret and manipulate data effectively.
- Attention to Detail: Meticulous attention to detail to ensure accuracy in data analysis and model development.
- Communication: Excellent verbal and written communication skills to convey complex findings to non-technical stakeholders.
- Team Collaboration: Ability to work collaboratively with diverse teams to achieve common goals.
- Leadership: Capability to lead projects and mentor junior team members.
- Experience: Relevant experience in data analysis, machine learning, and business intelligence
- Certifications: Professional certifications in data science, machine learning, or business analytics can be beneficial.
- Education: A bachelor's degree in a related field such as Statistics, Mathematics, Computer Science, or Engineering. Advanced degrees (Master's or Ph.D.) in these fields are highly desirable.
40Time Type
Full timePay Range
The typical pay range for this role is:$64,890.00 - $222,480.00This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above.Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.Great benefits for great peopleWe take pride in our comprehensive and competitive mix of pay and benefits - investing in the physical, emotional and financial wellness of our colleagues and their families to help them be the healthiest they can be. In addition to our competitive wages, our great benefits include:
- Affordable medical plan options, a 401(k) plan (including matching company contributions), and an employee stock purchase plan.
- No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching.
- Benefit solutions that address the different needs and preferences of our colleagues including paid time off, flexible work schedules, family leave, dependent care resources, colleague assistance programs, tuition assistance, retiree medical access and many other benefits depending on eligibility.
Perks and Benefits
Health and Wellness
- Health Insurance
- Dental Insurance
- Vision Insurance
- Life Insurance
- HSA
- HSA With Employer Contribution
- Pet Insurance
- Mental Health Benefits
Parental Benefits
- Fertility Benefits
- Adoption Assistance Program
- Family Support Resources
Work Flexibility
- Flexible Work Hours
- Remote Work Opportunities
- Hybrid Work Opportunities
Office Life and Perks
Vacation and Time Off
- Paid Vacation
- Paid Holidays
- Personal/Sick Days
Financial and Retirement
- 401(K) With Company Matching
Professional Development
- Tuition Reimbursement
Diversity and Inclusion
- Employee Resource Groups (ERG)
- Diversity, Equity, and Inclusion Program