Skip to main content
AkademIndex

Products

For developers

AkademBasesoonOpen API for the ecosystem
Latin
English
Article

The data retrieval optimization from the perspective of evidence-based medicine

Vladimir DobryninSaint-Petersburg State University, Universitetskaya nab., 7-9, Saint-Petersburg, RussiaYu. Е. BalykinaSaint-Petersburg State University, Universitetskaya nab., 7-9, Saint-Petersburg, RussiaMichael KamalovSaint-Petersburg State University, Universitetskaya nab., 7-9, Saint-Petersburg, RussiaА. С. КолбинPavlov First Saint-Petersburg State Medical University, L'va Tolstogo str., 6/8, Saint-Petersburg, RussiaE. VerbitskayaPavlov First Saint-Petersburg State Medical University, L'va Tolstogo str., 6/8, Saint-Petersburg, RussiaMunira KasimovaTashkent Institute of Postgraduate Medical Education, Parkent str., 51, Tashkent, Uzbekistan
ABI

Abstract

The paper is devoted to classification of MEDLINE abstracts into categories that correspond to types of medical interventions -types of patient treatments. This set of categories was extracted from Clinicaltrials.gov web site. Few classification algorithms were tested including Multinomial Naive Bayes, Multinomial Logistic Regression, and Linear SVM implementations from sklearn machine learning library. Document marking was based on the consideration of abstracts containing links to the Clinicaltrials.gov Web site. As the result of an automatical marking 3534 abstracts were marked for training and testing the set of algorithms metioned above. Best result of multinomial classification was achieved by Linear SVM with macro evaluation precision 70.06%, recall 55.62% and F-measure 62.01%, and micro evaluation precision 64.91%, recall 79.13% and F-measure 71.32%.

Topics

Identifiers

Citations and references

Metrics — AkademScholar · Coming soon