Join our Talent Network
Skip to main content

Data Scientist, AI/ML and RWD

This job posting is no longer active.

Location: , United States

Save Job Saved


Citeline is one of the world's leading providers of data and intelligence on clinical trials, drug treatments, medical devices and what's new in the regulatory and commercial landscape. Relying on us to deliver vital advantage when making critical R&D and commercial decisions, our customers come from over 3000 of the worlds leading pharmaceutical, contract research organizations (CROs), medical technology, biotechnology and healthcare service providers, including the top 10 global pharma and CROs.

From drug and device discovery and development to regulatory approval, and from product launch to lifecycle management, we provide the intelligence and insight to help our customers seize opportunities, mitigate risk and make business-critical decisions, faster. As the pharma and healthcare sector faces unparalleled upheaval, customers rely on our independent advice, enabling them to cut through the clutter and make sense of changing drug development, regulatory and competitive landscapes.

Now, Citeline is proud to be a part of Norstella, an organization that consists of market-leading pharmaceutical solutions providers united under one goal: to improve patient access to life-saving therapies. Within this organization, Citeline plays a key role in helping clients connect the dots from pipeline to patient.

Please note- all candidates must be authorized to work in the United States. We do not provide visa sponsorship or transfers. We are not currently accepting candidates who are on an OPT visa.

The Role:

  • As part of the AI & Life Sciences Solutions team within the RWD group, use in-house RWD assets to build rapid prototypes for machine learning (ML) models on real-world endpoints that are relevant for pharma and MedTech AI-SaaS solutions and decision support tools across the drug life cycle, including patient journey, line-of-therapy (LOT), time-on-treatment, drug adverse events, patient stratification, clinical trial designs, drug adherence, prescription patterns, market access etc.
  • Design, train, and evaluate survival, classification, and regression ML models using appropriate algorithms and frameworks.
  • Generate in-depth and higher-resolution actionable and explainable insights and recommendations from in-house RWD assets using AI/ML.
  • Build real-time dashboards and web apps to make ML predictive models and generate insights accessible across to internal stakeholders across the organization.
  • Augment/enrich in-house claims data assets through application of AI/ML.
  • Contribute to development of a centralized code libraries and knowledge base for rapid and streamlined generation of analysis-ready analytic data frames from RWD for the patient cohort of interest.
  • Act as a power user of in-house RWD data assets and provide thought leadership on innovative ML solutions on RWD.


  • An advanced degree (M.S., PhD) in a quantitative field such as physics, biophysics, statistics, biomedical sciences/engineering, data science, computer science, computational biology or similar fields.
  • 2+ years of hands-on experience in generating ML ready high-dimensional analytical files from longitudinal RWD sources for research cohort of interest applying complex study designs and I/E criteria and leveraging advanced data science and ML methods in modeling real-world patient outcomes across the drug life cycle.
  • 2+ years of hands-on experience in applying ML algorithms, both supervised (XGBoost, random forest, MLP etc.) and unsupervised (Agglomerative Clustering, K-means, DBSCAN etc.), in healthcare space.
  • Familiarity with explainable AI (SHAP).
  • Proficiency in data preprocessing, feature selection and engineering, and dimensionality reduction methods. Hands-on experience and solid understanding of survival analysis is required.
  • Strong proficiency in programming in Python and 2+ years of experience with data science tools and libraries and ML frameworks and (e.g., NumPy, SciPy, Pandas, scikit-learn).
  • Familiarity with data science and ML practices, e.g., version control systems, agile methodologies, and documentation.
  • Experience in working with AWS cloud environment and large databases (e.g., AWS redshift).
  • Eloquent with communicating complex insights and presenting concepts to diverse audiences.

Preferred Experience and Skills:

  • Hands-on experience in building deep learning (DL) models such as, Recurrent Neural Net, Transformer (BERT) on longitudinal healthcare data using DL libraries and frameworks (e.g., PyTorch or TensorFlow or Keras).
  • Proficiency in SQL. Experience with PySpark is a plus.
  • Fundamental understanding of methodologies to tackle data imbalance (predicting rare diagnoses or events) and data missingness.
  • Experience in managing ML lifecycle using open-source tools (e.g., MLflow).
  • Experience with developing web apps and real-time dashboards for data science and ML using open-source tools (e.g., Streamlit).
  • Solid understanding of omics data and hands-on experience with analyzing such datasets.
  • Conference abstracts and/or peer-reviewed publications on application of AI/ML in healthcare datasets.
  • 3 + years of experience with common pharma analytics use cases, including patient journey/LOT, time-on-treatment, drug adverse events, availability for clinical trials, market access etc.

The Guiding Principles for success at Norstella:

01: Bold, Passionate, Mission-First

We have a lofty mission to Smooth Access to Life Saving Therapies and we will get there by being bold and passionate about the mission and our clients. Our clients and the mission in what we are trying to accomplish must be in the forefront of our minds in everything we do.

02: Integrity, Truth, Reality

We make promises that we can keep, and goals that push us to new heights. Our integrity offers us the opportunity to learn and improve by being honest about what works and what doesnt. By being true to the data and producing realistic metrics, we are able to create plans and resources to achieve our goals.

03: Kindness, Empathy, Grace

We will empathize with everyone's situation, provide positive and constructive feedback with kindness, and accept opportunities for improvement with grace and gratitude. We use this principle across the organization to collaborate and build lines of open communication.

04: Resilience, Mettle, Perseverance

We will persevere even in difficult and challenging situations. Our ability to recover from missteps and failures in a positive way will help us to be successful in our mission.

05: Humility, Gratitude, Learning

We will be true learners by showing humility and gratitude in our work. We recognize that the smartest person in the room is the one who is always listening, learning, and willing to shift their thinking.


  • Medical and prescription drug benefits
  • Health savings accounts or flexible spending accounts
  • Dental plans and vision benefits
  • Basic life and AD&D Benefits
  • 401k retirement plan
  • Short- and Long-Term Disability
  • Maternity leave
  • Paid parental leave
  • Open Vacation Policy

The expected base salary for this position ranges from $120,000 to $160,000. It is not typical for offers to be made at or near the top of the range. Salary offers are based on a wide range of factors including relevant skills, training, experience, education, and, where applicable, licensure or certifications obtained. Market and organizational factors are also considered. In addition to base salary and a competitive benefits package, successful candidates are eligible to receive a discretionary bonus.

Norstella is an equal opportunities employer and does not discriminate on the grounds of gender, sexual orientation, marital or civil partner status, pregnancy or maternity, gender reassignment, race, color, nationality, ethnic or national origin, religion or belief, disability or age. Our ethos is to respect and value peoples differences, to help everyone achieve more at work as well as in their personal lives so that they feel proud of the part they play in our success. We believe that all decisions about people at work should be based on the individuals abilities, skills, performance and behavior and our business requirements. Norstella operates a zero-tolerance policy to any form of discrimination, abuse or harassment.

Sometimes the best opportunities are hidden by self-doubt. We disqualify ourselves before we have the opportunity to be considered. Regardless of where you came from, how you identify, or the path that led you here- you are welcome. If you read this job description and feel passion and excitement, were just as excited about you.



Interested in a career at Citeline?
Join our Talent Network today!

Join our Talent Network