DAB Pipeline
A modular, reproducible pipeline for detecting algorithmic bias in opioid use disorder treatment data.
What is the DAB Pipeline?
The Detecting Algorithmic Bias (DAB) Pipeline evaluates whether machine learning models for predicting opioid use disorder (OUD) treatment outcomes perform consistently across demographic groups. It uses the CTN-0094 clinical trial dataset and supports multiple modeling strategies and outcome definitions.
We define algorithmic bias as a disparity in model performance between different demographic groups — for example, a model that predicts relapse accurately for White patients but not for Black or Hispanic patients.
Quick Start
Key Features
-
Fairness-First Design
Every evaluation includes the demographic composition of the training cohort, making bias auditing a first-class output.
-
Clinical Outcomes
8 pre-defined OUD outcomes across binary, count, and time-to-event endpoint types, derived from the CTN-0094 trial.
-
Modular Architecture
Swap out models or outcome definitions without touching the core pipeline logic. External datasets must conform to the CTN-0094 schema required by PSM.
-
Propensity Score Matching
Constructs balanced cohorts via PSM across a spectrum of majority/minority demographic ratios.
Built On
This pipeline is described in Odom et al. (2026, Drug and Alcohol Review) and extends the work of Luo et al. (2024), replicating their OUD return-to-use prediction model and adding a fairness analysis layer to evaluate how model performance shifts across demographic compositions.
Team
| Role | Name | Institution |
|---|---|---|
| Clinical Lead | Prof. Laura Brandt | City College of New York |
| Computational Lead | Prof. Gabriel Odom | Florida International University |
| Data Scientist | Ganesh Jainarain | City College of New York |
| Data Scientist | Aaron Marker | Stony Brook University |
| Funding | NIH AIM-AHEAD | 1OT2OD032581-02-267 |
Citation
If you use this pipeline in your research, please cite: