We are a multidisciplinary research lab of data scientists, epidemiologists, software engineers, and clinicians working at the intersection of medicine and computer science. Our mission is to improve human health and healthcare by analyzing electronic health records, administrative data, disease registries, and genomic resources.
Our lab analyzes electronic health records from millions of individuals using advanced statistical approaches in order to develop a better understanding of the causes and mechanisms of disease. Our research can guide clinicians, policy makers, and researchers on how to formulate differential diagnoses, allocate resources, and target research priorities.
Recent research examples:
Raw EHR require a substantial amount of preprocessing before they can be transformed into research-ready datasets that can be statistically analyzed to answer clinically meaningful questions. Our lab develops computational algorithms for defining, validating and ascertaining multi-modal disease phenotypes in EHR data. Created phenotypes are stored in the open-access HDR UK Phenotpe Library.
Recent research examples:
A growing body of evidence from observational and interventional research suggesting that complex diseases, such as type-II diabetes, asthma and chronic obstructive pulmonary disease (COPD), are composed of distinct sub-phenotypes with different risk factor and prognostic profiles. Our lab develops and evaluates unsupervised machine learning algorithms to identify, describe and evaluate disease subtypes that can lead to the development of personalized treatments.
Recent research examples:
Data linkage is the process of identifying and linking individuals across heterogeneous data sources. Working with the Federal University of Bahia, our lab is contributing to the development of scalable probabilistic data linkage methods for linking administrative over 140 million participants in Brazil and evaluating the quality of the linkage using supervised machine learning.
Responding to the current public health emergency caused by the SARS-CoV-2 virus and COVID-19 pandemic, our Lab has been actively engaged with several national research initiatives:
Gurdasani D. et al. Vaccinating adolescents in England: a risk-benefit analysis. Journal of the Royal Society of Medicine 10.1177/01410768211052589
Media coverage: Daily Mail, The Guardian, BMJ, Royal Society of Medicine, The Times, Financial Times, Daily Mail
Wilde A. H. et al. The association between mechanical ventilator compatible bed occupancy and mortality risk in intensive care patients with COVID-19: a national retrospective cohort study. BMC Medicine 10.1186/s12916-021-02096-0
Media coverage: Vox, Daily Mail, Independent, Evening Express, BBC News, Yahoo! News.
Eyre M. et al. Impact of baseline cases of cough and fever on UK COVID-19 diagnostic testing rates: estimates from the Bug Watch community cohort study. Wellcome Open Research 10.12688/wellcomeopenres.16304.1
Media coverage: BBC News, The Guardian. Research cited in the Academy of Medical Sciences Preparing for a challenging winter 2020-2021 report.
Banerjee A. et al. Estimating excess 1-year mortality associated with the COVID-19 pandemic according to underlying conditions and age: a population-based cohort study. The Lancet 10.1016/S0140-6736(20)30854-0
Interactive online calculator: OurRisk.Cov
Media coverage: FT, Times, Guardian, BBC, Protagon (GR) and elsewhere.
Lai A. et al. Estimated impact of the COVID-19 pandemic on cancer services and excess 1-year mortality in people with cancer and multimorbidity: near real-time data on cancer care, cancer deaths and a population-based cohort study. BMJ Open 10.1136/bmjopen-2020-043828
Media coverage: The Guardian, Daily Mail, Evening Standard, BMJ.
Using large-scale electronic health records in England, we have developed a simple online tool (OurRisk.Cov) that can calculate and visualize excess deaths over one year from the COVID-19 pandemic based on age, sex, and underlying disease-specific estimates.
Cite HDR UK Phenotype Portal OurRisk.Cov excess mortality risk calculator