Skip to main content
Article

LASSO-Based Machine Learning Algorithm for Prediction of Lymph Node Metastasis in T1 Colorectal Cancer

Jeonghyun KangGangnam Severance HospitalYoon Jung ChoiDepartment of Pathology, Yongin Severance Hospital, Yonsei University College of Medicine, Seoul, KoreaIm-kyung KimDepartment of Surgery, Severance Hospital, Yonsei University College of Medicine, Seoul, KoreaHye Sun LeeBiostatistics Collaboration Unit, Yonsei University College of Medicine, Seoul, KoreaHogeun KimDepartment of Pathology, Severance Hospital, Yonsei University College of Medicine, Seoul, KoreaSeung Hyuk BaikDepartment of Surgery, Gangnam Severance Hospital, Yonsei University College of Medicine, Seoul, KoreaNam Kyu KimDepartment of Surgery, Severance Hospital, Yonsei University College of Medicine, Seoul, KoreaKang Young LeeDepartment of Surgery, Severance Hospital, Yonsei University College of Medicine, Seoul, Korea
2020en
ABI

Abstract

PURPOSE: The role of tumor-infiltrating lymphocytes (TILs) in predicting lymph node metastasis (LNM) in patients with T1 colorectal cancer (CRC) remains unclear. Furthermore, clinical utility of a machine learning-based approach has not been widely studied. MATERIALS AND METHODS: Immunohistochemistry for TILs against CD3, CD8, and forkhead box P3 in both center and invasive margin of the tumor were performed using surgically resected T1 CRC slides. Three hundred and sixteen patients were enrolled and categorized into training (n=221) and validation (n=95) sets via random sampling. Using clinicopathologic variables including TILs, the least absolute shrinkage and selection operator (LASSO) regression model was applied for variable selection and predictive signature building in the training set. The predictive accuracy of our model and the Japanese criteria were compared using area under the receiver operating characteristic (AUROC), net reclassification improvement (NRI)/integrated discrimination improvement (IDI), and decision curve analysis (DCA) in the validation set. RESULTS: LNM was detected in 29 (13.1%) and 12 (12.6%) patients in training and validation sets, respectively. Nine variables were selected and used to generate the LASSO model. Its performance was similar in training and validation sets (AUROC, 0.795 vs. 0.765; p=0.747). In the validation set, the LASSO model showed better outcomes in predicting LNM than Japanese criteria, as measured by AUROC (0.765 vs. 0.518, p=0.003) and NRI (0.447, p=0.039)/IDI (0.121, p=0.034). DCA showed positive net benefits in using our model. CONCLUSION: Our LASSO model incorporating histopathologic parameters and TILs showed superior performance compared to conventional Japanese criteria in predicting LNM in patients with T1 CRC.

Identifiers

Citations and references

Cited by 30 references