Principal Data Scientist


Primary Responsibilities:

  • Self-motivated and demonstrates an innate ability to absorb new concepts, technical frameworks, programming paradigms while employing them creatively to solve complex analytical problems
  • Builds, Manages and Owns entire Analytics Pipelines that includes Data Sourcing, Engineering, Modeling and Interpretation – Proposes innovative ways to analyze and solve problems by using data mining approaches on available information
  • Collaborates with end-users to evaluate business goals, identify needs, and propose solutions using the portfolio of technical assets like SAS, Big Data / Hadoop, Spark, and Relational data stores (e.g. Teradata, Oracle, SQL Server)
  • Identifies sources of truth, data domains, and attributes to support current and new hypotheses across lines of businesses (Medicare, Medicaid, Commercial etc.) and functional domains (Clinical, Operational, Financial etc.)
  • Performs data profiling, cleansing, and enhancements, identifies usable features while ensuring consistent formatting with an eye towards repeatability as well as flexibility for future changes
  • Transforms datasets for analytical research versus traditional reporting and optimizes the performance of large datasets for analytical speed and cost-effective storage
  • Builds analytical models (supervised, unsupervised) that extract the necessary signal from the data which test a variety of hypotheses and draw meaningful interpretations against the business context
  • Able to articulate analytical findings and discoveries to senior leadership across multiple lines of business, while exposing assumptions and validation work in an easily understandable and consistent manner

Required Qualifications:

  • Bachelor’s degree or equivalent in Information Systems, Computer Science, related field or equivalent experience in Technology or Analytics
  • 8+ years of data analysis experience – ability to work with huge volumes of data, disparate relational databases and schemas and design applications that efficiently scale to process hundreds of millions of observations
  • 5+ years of experience as a programmer either in application languages like Java, C++, VB or Data Processing / Statistical Computing platforms like Spark, SAS, R, Python


  • Experience leveraging the Hadoop ecosystem (e.g. Hive, Pig, Spark) as a Data Lake for offloading traditional ETL workloads
  • Ability to leverage SAS / Hadoop to solve advanced analytics use-cases
  • Project management or Process Optimization experience
  • Experience in the PBM / healthcare services or adjacent space
  • Interpersonal and relationship skills necessary to work with an integrated team of customers, product managers, project managers, developers, analysts, and quality engineers
  • Excellent Communication (oral & written) skills along with relationship-building and customer facing skills are needed with an ability to facilitate meetings with Business and IT stakeholders
  • Strong presentation and communication skills with ability to develop high level executive as well deep-dive technical presentations

To apply for this job please visit