Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/135208
Citations
Scopus Web of Science® Altmetric
?
?
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMäkinen, V.-P.-
dc.contributor.authorRehn, J.-
dc.contributor.authorBreen, J.-
dc.contributor.authorYeung, D.-
dc.contributor.authorWhite, D.L.-
dc.date.issued2022-
dc.identifier.citationInternational Journal of Molecular Sciences, 2022; 23(9):1-17-
dc.identifier.issn1661-6596-
dc.identifier.issn1422-0067-
dc.identifier.urihttps://hdl.handle.net/2440/135208-
dc.description.abstractRNA sequencing provides a snapshot of the functional consequences of genomic lesions that drive acute lymphoblastic leukemia (ALL). The aims of this study were to elucidate diagnostic associations (via machine learning) between mRNA-seq profiles, independently verify ALL lesions and develop easy-to-interpret transcriptome-wide biomarkers for ALL subtyping in the clinical setting. A training dataset of 1279 ALL patients from six North American cohorts was used for developing machine learning models. Results were validated in 767 patients from Australia with a quality control dataset across 31 tissues from 1160 non-ALL donors. A novel batch correction method was introduced and applied to adjust for cohort differences. Out of 18,503 genes with usable expression, 11,830 (64%) were confounded by cohort effects and excluded. Six ALL subtypes (ETV6::RUNX1, KMT2A, DUX4, PAX5 P80R, TCF3::PBX1, ZNF384) that covered 32% of patients were robustly detected by mRNA-seq (positive predictive value ≥ 87%). Five other frequent subtypes (CRLF2, hypodiploid, hyperdiploid, PAX5 alterations and Ph-positive) were distinguishable in 40% of patients at lower accuracy (52% ≤ positive predictive value ≤ 73%). Based on these findings, we introduce the Allspice R package to predict ALL subtypes and driver genes from unadjusted mRNA-seq read counts as encountered in real-world settings. Two examples of Allspice applied to previously unseen ALL patient samples with atypical lesions are included.-
dc.description.statementofresponsibilityVille-Petteri Mäkinen, Jacqueline Rehn, James Breen, David Yeung, and Deborah L. White-
dc.language.isoen-
dc.publisherMDPI AG-
dc.rights© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/).-
dc.source.urihttp://dx.doi.org/10.3390/ijms23094574-
dc.subjectacute lymphoblastic leukemia-
dc.subjectRNA-seq-
dc.subjectconfounder adjustment-
dc.subjectmachine learning-
dc.subject.meshHumans-
dc.subject.meshBurkitt Lymphoma-
dc.subject.meshOncogene Proteins, Fusion-
dc.subject.meshRNA, Messenger-
dc.subject.meshSequence Analysis, RNA-
dc.subject.meshPrecursor Cell Lymphoblastic Leukemia-Lymphoma-
dc.subject.meshTranscriptome-
dc.titleMulti-Cohort Transcriptomic Subtyping of B-Cell Acute Lymphoblastic Leukemia-
dc.typeJournal article-
dc.identifier.doi10.3390/ijms23094574-
dc.relation.granthttp://purl.org/au-research/grants/nhmrc/1160833-
pubs.publication-statusPublished-
dc.identifier.orcidRehn, J. [0000-0001-5043-6943]-
dc.identifier.orcidBreen, J. [0000-0001-6184-0925]-
dc.identifier.orcidYeung, D. [0000-0002-7558-9927]-
dc.identifier.orcidWhite, D.L. [0000-0003-4844-333X]-
Appears in Collections:Medical Sciences publications

Files in This Item:
File Description SizeFormat 
hdl_135208.pdfPublished version988.4 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.