Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Статья

Word Sense Disambiguation Using Clustered Sense Labels

Jeong Yeon ParkDepartment of Computer Science, Chungbuk National University, Cheongju 28644, KoreaHyeong Jin ShinDepartment of Computer Science, Chungbuk National University, Cheongju 28644, KoreaJae Sung LeeDepartment of Computer Science, Chungbuk National University, Cheongju 28644, Korea
2022en
ABI

Аннотация

Sequence labeling models for word sense disambiguation have proven highly effective when the sense vocabulary is compressed based on the thesaurus hierarchy. In this paper, we propose a method for compressing the sense vocabulary without using a thesaurus. For this, sense definitions in a dictionary are converted into sentence vectors and clustered into the compressed senses. First, the very large set of sense vectors is partitioned for less computational complexity, and then it is clustered hierarchically with awareness of homographs. The experiment was done on the English Senseval and Semeval datasets and the Korean Sejong sense annotated corpus. This process demonstrated that the performance greatly increased compared to that of the uncompressed sense model and is comparable to that of the thesaurus-based model.

Перевод пока недоступен

Идентификаторы

Цитирования и источники

Цитирований: 3Использованных источников: 0