St. Mary's Good Samaritan Senior Data Scientist in St. Louis, Missouri


Role Purpose:

The Senior Data Scientist manages data throughout all stages of the data analysis lifecycle. This includes obtaining novel data from external and sometimes untraditional sources, managing and coordinating the collection and utilization of data across diverse data platforms, cleaning and transforming data of widely-varying formats, and providing meaningful insights and actionable analytics. The Senior Data Scientist must be highly skilled in working with both structured and unstructured data, transforming data to ensure consistency and accuracy across data systems, automating resource-demanding data processes, and creating visualizations and reports. The Senior Data Scientist also provides administrative support relative to these activities.


  • Obtaining Data- Utilizes SQL to mine appropriate data elements from the various clinical database systems. Interacts with web APIs to fetch data from externally published sources. Utilizes web scraping and mining tools to extract data from on-traditional Internet sources. Extracts data algorithmically from PDFs, formatted Excel workbooks, and other common document formats that are not traditionally machine-readable.

  • Cleaning, Transforming, and Transmitting Data- Parses, validates and scrubs data of widely-differing formats. Assures integrity of clinical data as it is transmitted from system to system and/or system to database, ensuring transactional accuracy, completeness and timeliness.

  • Analyzing Data- Integrates data from multiple sources and performs complex statistical data analysis in support of ad hoc and scheduled requests. Provides actionable insights.

  • Visualizing Data- Creates publication-quality graphical representations of data that are pleasing to the eye and easily interpreted.

  • Presenting Data to Drive Business Decisions- delivers data products in a timely manner using effective report and/or presentation formats.

  • Automation- Utilizes scripting languages to automate resource-consuming and repetitive tasks within the analysis workflow. Writes reusable code for speeding up the delivery of commonly-made requests related to fetching, summarizing, transforming and visualizing data.

  • Building Software Solutions- Utilizes object-oriented or functional programming languages to create dependable, easy-to-use tools to help team members work with data more efficiently and increase team productivity.


Minimum Requirements:

  • Bachelor’s Degree in computer science, engineering, statistics or math

  • A minimum of 5 years’ experience in database ETL (Extract, Transform, Load) operations, statistical computer programming, and functional/object orientated computer programming are required

  • Expertise with analytical programming languages, such as SPSS, Python, R and Matlab

  • Expertise with functional/object- oriented programming languages, such as Java, C, C++, Python, Ruby or similar

  • Expertise in retrieving data from web-based sources (JSON, REST APIs, web scraping, etc.)

Preferred Qualifications:

  • Advanced degree is preferred

*SSM Health - System Office – *

SSM Health is one of the largest Catholic health systems in the country and is dedicated to quality and compassionate care for anyone in need, regardless of ability to pay. Based in St. Louis, where its System Office is located, SSM Health operates 20 hospitals in Wisconsin, Illinois, Missouri and Oklahoma. We provide care in various settings: outpatient sites, physician offices, a pharmacy benefit company, an insurance plan, hospitals, nursing homes, home care, hospice, telehealth and a technology company. _Our Mission: Through our exceptional health care services, we reveal the healing presence of God. _

Organization: SSM Health - System Office

Primary Location: Missouri-St. Louis-SSM Health System Office - 10101 Woodfield

Work Locations: SSM Health System Office - 10101 Woodfield (0033) 10101 Woodfield Lane St. Louis, 63132

Job: Professional Services

Req ID: 18002809