David Carrell, PhD, is an assistant investigator who develops and applies technology for extracting rich information from unstructured clinical text, such as physician progress notes. This work uses state-of-the-art clinical natural language processing (NLP) technologies in single- and multi-site settings.
An example of this work is an NLP system to identify women who have been diagnosed with recurrent breast cancer. Despite being a common and consequential clinical diagnosis, recurrent breast cancer cannot be tracked reliably using standard medical codes found in a person’s chart. Supported by a grant from the National Cancer Institute, he and his colleagues used information from clinician progress notes, radiology reports, and pathology reports to classify women by breast cancer recurrence.
Working with teams of researchers inside and outside Kaiser Permanente Washington Health Research Institute, Dr. Carrell has applied similar precision phenotyping methods to identify evidence of carotid artery stenosis, colon polyps, problem use of prescription opioids, and colonoscopy quality.
Dr. Carrell’s current research projects are applying NLP and machine learning methods to improve medication safety surveillance (through the Food and Drug Administration Sentinel Initiative) and to evaluate the impact on drug use disorder diagnosis and treatment of Kaiser Permanente Washington patients screened for unhealthy cannabis and other drug use. His ongoing work also includes development and application of automated algorithms based on electronic health record data to identify patients with particular health conditions (called “patient phenotypes”) for use in genetic and epidemiological research.
Surveillance methods for adverse events associated with medication exposure, including problem use of prescription opioids
Methods for using structured and unstructured electronic health record data to identify patients with (or without) specific clinical conditions or phenotypes for large scale epidemiological and genomic studies
Identifying recurrent breast cancer using EHR text; Colonoscopy quality metrics
Recurrent breast cancer; Colonoscopy quality; Extracting information from clinical text; Automated de-identification of clinical text; Methods for applying NLP methods in multi-site research
Prevention and treatment
Klarin D, Verma SS, Judy R, Dikilitas O, Wolford BN, Paranjpe I, Levin MG, Pan C, Tcheandjieu C, Spin JM, Lynch J, Assimes TL, Nyrønning LÅ, Mattsson E, Edwards TL, Denny J, Larson E, Lee MTM, Carrell D, Zhang Y, Jarvik GP, Gharavi AG, Harley J, Mentch F, Pacheco JA, Hakonarson H, Skogholt AH, Thomas L, Gabrielsen ME, Hveem K, Nielsen JB, Zhou W, Fritsche L, Huang J, Natarajan P, Sun YV, DuVall SL, Rader DJ, Cho K, Chang KM, Wilson PWF, O'Donnell CJ, Kathiresan S, Scali ST, Berceli SA, Willer C, Jones GT, Bown MJ, Nadkarni G, Kullo IJ, Ritchie M, Damrauer SM, Tsao PS. Genetic architecture of abdominal aortic aneurysm in the Million Veteran program. Circulation. 2020 Sep 28. doi: 10.1161/CIRCULATIONAHA.120.047544. [Epub ahead of print]. PubMed
Palmer MR, Kim DS, Crosslin DR, Stanaway IB, Rosenthal EA, Carrell DS, Cronkite DJ, Gordon A, Du X, Li YK, Williams MS, Weng C, Feng Q, Li R, Pendergrass SA, Hakonarson H, Fasel D, Sohn S, Sleiman P, Handelman SK, Speliotes E, Kullo IJ, Larson EB, Ja+++. Loci identified by a genome-wide association study of carotid artery stenosis in the eMERGE network. Genet Epidemiol. 2021 Feb;45(1):4-15. doi: 10.1002/gepi.22360. Epub 2020 Sep 22. PubMed
Carrell DS, Malin BA, Cronkite DJ, Aberdeen JS, Clark C, Li MR, Bastakoty D, Nyemba S, Hirschman L. Resilience of clinical text de-identified with "hiding in plain sight" to hostile reidentification attacks by human readers. J Am Med Inform Assoc. 2020 Jul 1;27(9):1374-1382. doi: 10.1093/jamia/ocaa095. PubMed
Gordon AS, Zouk H, Venner E, Eng CM, Funke BH, Amendola LM, Carrell DS, Chisholm RL, Chung WK, Denny JC, Fedotov A, Hakonarson H, Kullo IJ, Larson EB, Leduc MS, Leppig KA, Lennon NJ, Linder JE, Muzny DM, Prows CA, Rasmussen-Torvik LJ, Rasouly HM, Roden DM, Rosenthal EA, Smith ME, Stanaway IB, Van Dreist SL, Walker K, Wiesner GL, Williams MS, Witkowski L, Crosslin DR, Gibbs RA, Rehm HL, Jarvik GP. Frequency of genomic secondary findings among 21,915 eMERGE network participants. Genet Med. 2020 Jun 17. doi: 10.1038/s41436-020-0810-9. Online ahead of print. PubMed
D Sullivan M, Boudreau D, Ichikawa L, Cronkite D, Albertson-Junkans L, Salgado G, VonKorff M, Carrell DS. Primary care opioid taper plans are associated with sustained opioid dose reduction. J Gen Intern Med. 2020 Mar;35(3):687-695. doi: 10.1007/s11606-019-05445-1. Epub 2020 Jan 6. PubMed
Carrell DS, Albertson-Junkans L, Ramaprasan A, Scull G, Mackwood M, Johnson E, Cronkite DJ, Baer A, Hansen K, Green CA, Hazlehurst BL, Janoff SL, Coplan PM, DeVeaugh-Geiss A, Grijalva CG, Liang C, Enger CL, Lange J, Shortreed SM, Von Korff M. Measuring problem prescription opioid use among patients receiving long-term opioid analgesic treatment: development and evaluation of an algorithm for use in EHR and claims data. J Drug Assess. 2020 Apr 28;9(1):97-105. doi: 10.1080/21556660.2020.1750419. eCollection 2020. PubMed
Schmidt AF, Holmes MV, Preiss D, Swerdlow DI, Denaxas S, Fatemifar G, Faraway R, Finan C, Valentine D, Fairhurst-Hunter Z, Hartwig FP, Horta BL, Hypponen E, Power C, Moldovan M, van Iperen E, Hovingh K, Demuth I, Norman K, Steinhagen-Thiessen E, Demuth J, Bertram L, Lill CM, Coassin S, Willeit J, Kiechl S, Willeit K, Mason D, Wright J, Morris R, Wanamethee G, Whincup P, Ben-Shlomo Y, McLachlan S, Price JF, Kivimaki M, Welch C, Sanchez-Galvez A, Marques-Vidal P, Nicolaides A, Panayiotou AG, Onland-Moret NC, van der Schouw YT, Matullo G, Fiorito G, Guarrera S, Sacerdote C, Wareham NJ, Langenberg C, Scott RA, Luan J, Bobak M, Malyutina S, Pajak A, Kubinova R, Tamosiunas A, Pikhart H, Grarup N, Pedersen O, Hansen T, Linneberg A, Jess T, Cooper J, Humphries SE, Brilliant M, Kitchner T, Hakonarson H, Carrell DS, McCarty CA, Lester KH, Larson EB, Crosslin DR, de Andrade M, Roden DM, Denny JC, Carty C, Hancock S, Attia J, Holliday E, Scott R, Schofield P, O'Donnell M, Yusuf S, Chong M, Pare G, van der Harst P, Said MA, Eppinga RN, Verweij N, Snieder H; Lifelines Cohort authors, Christen T, Mook-Kanamori DO; ICBP Consortium, Gustafsson S, Lind L, Ingelsson E, Pazoki R, Franco O, Hofman A, Uitterlinden A, Dehghan A, Teumer A, Baumeister S, Dörr M, Lerch MM, Völker U, Völzke H, Ward J, Pell JP, Meade T, Christophersen IE, Maitland-van der Zee AH, Baranova EV, Young R, Ford I, Campbell A, Padmanabhan S, Bots ML, Grobbee DE, Froguel P, Thuillier D, Roussel R, Bonnefond A, Cariou B, Smart M, Bao Y, Kumari M, Mahajan A, Hopewell JC, Seshadri S; METASTROKE Consortium of the ISGC, Dale C, Costa RPE, Ridker PM, Chasman DI, Reiner AP, Ritchie MD, Lange LA, Cornish AJ, Dobbins SE, Hemminki K, Kinnersley B, Sanson M, Labreche K, Simon M, Bondy M, Law P, Speedy H, Allan J, Li N, Went M, Weinhold N, Morgan G, Sonneveld P, Nilsson B, Goldschmidt H, Sud A, Engert A, Hansson M, Hemingway H, Asselbergs FW, Patel RS, Keating BJ, Sattar N, Houlston R, Casas JP, Hingorani AD. Phenome-wide association analysis of LDL-cholesterol lowering genetic variants in PCSK9. BMC Cardiovasc Disord. 2019 Oct 29;19(1):240. doi: 10.1186/s12872-019-1187-z. PubMed
Using doctor's notes to learn about drug reactions, dementia, and cannabis use.
Dr. Jennifer Nelson explains how KP scientists are helping the CDC and FDA keep an eye out for rare adverse events.
A Kaiser Permanente-led BCSC study is among the largest ever to evaluate adding MRI surveillance for breast cancer survivors.
Dr. Paula Lozano explains how a Learning Health System project finds Kaiser Permanente Washington members who could benefit most from preventive services.