This website requires JavaScript.
DOI: 10.1101/2023.03.14.23287143

LymphoML: An interpretable artificial intelligence-based method identifies morphologic features that correlate with lymphoma subtype

V.Shankar X. Yang V. Krishna ...+9 P. Rajpurkar
摘要
Lymphomas vary in terms of clinical behavior, morphology, and response to therapies and thus accurate classification is essential for appropriate management of patients. In this study, using a set of 670 cases of lymphoma obtained from a center in Guatemala City, we propose an interpretable machine learning method, LymphoML, for lymphoma subtyping into eight diagnostic categories. LymphoML sequentially applies steps of (1) object segmentation to extract nuclei, cells, and cytoplasm from H&E-stained TMA cores, (2) feature extraction of morphological, textural, and architectural features, and (3) aggregation of per-object features to create patch-level feature vectors for lymphoma classification. LymphoML achieves a diagnostic accuracy of 64.3% (95% CI: [55.8%, 72.9%]) among 8 lymphoma subtypes using only tissue microarray core sections stained with hematoxylin and eosin (H&E), at a level similar to experienced hematopathologists. We find that the best model's set of nuclear and cytoplasmic morphological, textural, and architectural features are most discriminative for diffuse large B-cell lymphoma (F1 score: 78.7%), classic Hodgkin lymphoma (F1 score: 74.5%), and mantle cell lymphoma (F1 score: 71.0%). We find that nuclear shape features provide the highest diagnostic yield, with nuclear texture, cytoplasmic, and architectural features providing smaller gains in accuracy. Finally, we find that combining information from the H&E-based model together with the results of a limited set of immunohistochemical (IHC) stains resulted in a similar diagnostic accuracy (85.3% ([79.1%, 90.7%])) as with a much larger set of IHC stains (86.1% ([79.8%, 91.5%])). Our work suggests a potential way to incorporate machine learning tools into clinical practice to reduce the number of expensive IHC stains while achieving a similar level of diagnostic accuracy.
展开全部
图表提取

暂无人提供速读十问回答

论文十问由沈向洋博士提出,鼓励大家带着这十个问题去阅读论文,用有用的信息构建认知模型。写出自己的十问回答,还有机会在当前页面展示哦。

Q1论文试图解决什么问题?
Q2这是否是一个新的问题?
Q3这篇文章要验证一个什么科学假设?