BBJ Lab at the University of Chicago

Selected Publications

Prototype Learning to Create Refined Interpretable Digital Phenotypes from ECGs

Sahil Sethi*, David Chen*, Michael C. Burkhart, Nipun Bhandari, Bashar Ramadan, Brett K Beaulieu-Jones. 31st Pacific Symposium on Biocomputing (oral) (2025). *Equal contribution.

Prototype-based deep learning models generate interpretable predictions by comparing inputs to representative training examples. We applied ProtoECGNet—a model trained for ECG classification on PTB-XL—to the MIMIC-IV clinical database without retraining. Individual prototypes showed stronger associations with discharge diagnoses than predicted classes or automated report concepts, suggesting that these learned waveform patterns may serve as transferable physiologic markers aligned with downstream clinical conditions.

ProtoECGNet: Case-Based Interpretable Deep Learning for Multi-Label ECG Classification with Contrastive Learning

Sahil Sethi, David Chen, Thomas Statchen, Michael C. Burkhart, Nipun Bhandari, Bashar Ramadan, Brett K Beaulieu-Jones. 10th Machine Learning for Healthcare Conference (MLHC), Proceedings of Machine Learning Research 298 (2025)

ProtoECGNet is a prototype-based deep learning model for multi-label ECG classification that mirrors clinical interpretation by learning distinct prototypes for rhythm, morphology, and global abnormalities. It introduces a contrastive loss to structure the prototype space based on diagnostic co-occurrence, achieving near state-of-the-art performance on PTB-XL while providing faithful, case-based explanations grounded in real ECG segments from the training set. ProtoECGNet delivers transparent reasoning across all 71 labels, spanning a comprehensive range of cardiac abnormalities in 12-lead ECGs.

Synthetic Data Distillation Enables the Extraction of Clinical Information at Scale

Elizabeth Geena Woo*, Michael C Burkhart*, Emily Alsentzer, Brett Beaulieu-Jones. npj Digital Medicine (2025)
*co-first authors

Our team demonstrated that synthetic data distillation can fine-tune smaller, open-source large-language models (LLMs) to achieve performance similar to larger models in extracting clinical information. This smaller model outperforms its base version and sometimes even the larger model. This approach will enable more scalable and cost-efficient clinical information extraction, improving tasks like patient phenotyping.

Advancing Healthcare AI Governance: A Comprehensive Maturity Model Based on Systematic Review

Rowan Hussein, Anna Zink, Bashar Ramadan, Frederick M Howard, Maia Hightower, Sachin Shah, Brett K Beaulieu-Jones. Preprint (2024)

Our systematic analysis of healthcare AI governance frameworks revealed significant gaps in addressing diverse organizational needs, leading to the development of HAIRA - a novel, resource-aware maturity model spanning seven critical domains. This adaptive framework provides actionable governance pathways across five organizational levels, from small practices to major medical centers, enabling healthcare institutions to systematically assess and advance their AI governance capabilities based on available resources.

Disease progression strikingly differs in research and real-world Parkinson’s populations

Brett K Beaulieu-Jones, Francesca Frau, Sylvie Bozzi, Karen J Chandross, M Judith Peterschmitt, Caroline Cohen, Catherine Coulovrat, Dinesh Kumar, Mark J Kruger, Scott L Lipnick, Lane Fitzsimmons, Isaac S Kohane, Clemens R Scherzer. npj Parkinson's disease (2024)

Our team compared Parkinson's disease (PD) progression across research and real-world populations, utilizing real-world data (RWD) and large language models for detailed characterization. It finds that patients in real-world settings are diagnosed later and start treatment later than those in research populations, with faster motor and cognitive progression in real-world cohorts. The study highlights the differences between research and real-world populations, emphasizing the need to use diverse data sources and account for biases in clinical trial design and analyses.

Predicting seizure recurrence after an initial seizure-like episode from routine clinical notes using large language models: a retrospective cohort study

Beaulieu-Jones, Brett K., Mauricio F Villamar, Phil Scordis, Ana Paula Bartmann, Waqar Ali, Benjamin D Wissel, Emily Alsentzer, Johann de Jong, Arijit Patra, Isaac Kohane. Lancet Digital Health (2023)

Our team demonstrated machine learning models, particularly large language models pre-trained on domain-specific data, are highly effective in predicting seizure recurrence in children after an initial seizure-like event. These models outperformed traditional structured data approaches and indicate that clinical notes contain significant information useful for the prediction of seizure recurrence.

Phenotypic overlap between rare disease patients and variant carriers in a large population cohort informs biological mechanisms

Lane Fitzsimmons, Undiagnosed Diseases Network, Brett Beaulieu-Jones*, Shilpa Nadimpalli Kobren* Preprint (in press) (2024)
*co-corresponding

The biological mechanisms causing extreme symptoms in rare disease patients are complex and often elusive. This study analyzes genotype and phenotype data from the UK Biobank to understand the pathways leading to seizures in undiagnosed patients from the Undiagnosed Diseases Network. By examining milder, related symptoms in UK Biobank participants with similar genetic variants, the study aims to shed light on the molecular mechanisms behind these rare conditions

Characterizing the connection between Parkinson's disease progression and healthcare utilization

Lane Fitzsimmons, Francesca Frau, Sylvie Bozzi, Karen Chandross, Brett Beaulieu-Jones Preprint (2024)

This study analyzed Parkinson's disease (PD) progression by examining clinical events across different Hoehn & Yahr (H&Y) stages extracted using natural language processing. It provides a view of healthcare utilization at different H&Y stages, models expected H&Y progression and demonstrates the potential value for a therapeutic which would slow progression.

Machine learning for patient risk stratification: standing on, or looking over, the shoulders of clinicians?

Beaulieu-Jones, Brett K., William Yuan, Gabriel A. Brat, Andrew L. Beam, Griffin Weber, Marshall Ruffin, and Isaac S. Kohane. NPJ digital medicine (2021)

We trained deep learning models on clinician-initiated administrative data for 42.9 million admissions and found performance close to full EMR-based benchmarks for inpatient outcomes. These models rely heavily on clinical behavior, and should not be used for individualized clinical decision making. For meaningful clinical guidance, models should outperform these benchmarks using data sources that capture patient state rather than clinician actions (i.e., looking over their shoulder).

Examining the use of real‐world evidence in the regulatory process

Beaulieu‐Jones, Brett K., Samuel G. Finlayson, William Yuan, Russ B. Altman, Isaac S. Kohane, Vinay Prasad, and Kun‐Hsing Yu. Clinical Pharmacology & Therapeutics (2020)

The 21st Century Cures Act requires the US FDA to create guidelines for using real-world evidence (RWE) in the regulatory process. While RWE has led to crucial medical findings, it faces challenges in proving treatment efficacy compared to randomized controlled trials. In this review article, we summarized the advantages and limitations of RWE, identified the key opportunities for RWE, and pointed the way forward to maximize the potential of RWE for regulatory purposes.

Privacy-preserving generative deep neural networks support clinical data sharing

Beaulieu-Jones, Brett K., Zhiwei Steven Wu, Chris Williams, Ran Lee, Sanjeev P. Bhavnani, James Brian Byrd, and Casey S. Greene. Circulation: Cardiovascular Quality and Outcomes (2019)

Our team has developed a method using deep neural networks to generate synthetic data that closely resembles real participants from the SPRINT trial, ensuring privacy while maintaining the utility of the data for research. This technique allows for the sharing of clinical data with researchers for secondary analysis without risking patient privacy.

Reproducibility of computational workflows is automated using continuous analysis

Beaulieu-Jones, Brett K., and Casey S. Greene. Nature biotechnology (2017)

Continuous analysis is a workflow that integrates Docker container technology with continuous integration to automatically rerun computational analyses upon any changes in source code or data. This approach facilitates effortless reproducibility of research results for peers and provides an audit trail for data analyses, enhancing transparency and reliability in scientific studies.

Semi-supervised learning of the electronic health record for phenotype stratification

Beaulieu-Jones, Brett K., and Casey S. Greene. Journal of biomedical informatics (2016)

We developed a semi-supervised learning technique to improve the extraction of phenotypes from electronic health records, aiding in the identification of disease subtypes and genetic associations. This method has shown promise in enhancing classification accuracy and predicting patient outcomes, even with limited high-quality data.

Primary Research Interests

Selected Publications

People

Brett Beaulieu-Jones, PhD

Bashar Ramadan, MBBS

Geena Woo, BA

Michael Burkhart, PhD

Rowan Hussein, BA

Tom Statchen, BS

Sahil Sethi, BS

Luke Solo, BS

Inhyeok (Daniel) Lee, BS

S’Khaja Charles, BS

Hanna Hieromnimon, PhD

Alumni, Close Collaborators & Thesis Examinees

Anna Zink, PhD

David Chen, MS

Ming-Chieh (Eddie) Liu, MS

Sylvia Edoigiawerie, PhD

Lane Fitzsimmons, BS

Temidayo Adeluwa, MS

Jessica De Freitas, PhD

Yidi Huang, MS

Mohammed Saqib, BS

Open Positions

Active / Recent Funding and Support

Get in touch