Could we automatically reproduce semantic relations of an information retrieval thesaurus?
Электронный научный архив УРФУ
Информация об архиве | Просмотр оригиналаПоле | Значение | |
Заглавие |
Could we automatically reproduce semantic relations of an information retrieval thesaurus?
|
|
Автор |
Panchenko, A.
|
|
Тематика |
THESAURUS
SEMANTIC RELATIONS VECTOR-SPACE MODEL DISTRIBUTIONAL ANALYSIS MULTIWORD EXPRESSIONS |
|
Описание |
A well constructed thesaurus is recognized as a valuable source of semantic information for various applications, especially for Information Retrieval. The main hindrances to using thesaurus-oriented approaches are the high complexity and cost of manual thesauri creation. This paper addresses the problem of automatic thesaurus construction, namely we study the quality of automatically extracted semantic relations as compared with the semantic relations of a manually crafted thesaurus. The vector-space model based on syntactic contexts was used to reproduce relations between the terms of a manually constructed thesaurus. We propose a simple algorithm for representing both single word and multiword terms in the distributional space of syntactic contexts. Furthermore, we propose a method for evaluation quality of the extracted relations. Our experiments show significant difference between the automatically and manually constructed relations: while many of the automatically generated relations are relevant, just a small part of them could be found in the original thesaurus.
|
|
Дата |
2010-11-09T07:30:50Z
2010-11-09T07:30:50Z 2010 |
|
Тип |
Article
Journal article (info:eu-repo/semantics/article) Published version (info:eu-repo/semantics/publishedVersion) |
|
Идентификатор |
Panchenko, A. Could we automatically reproduce semantic relations of an information retrieval thesaurus? / A. Panchenko // IV Российская летняя школа по информационному поиску RuSSIR’2010, 13-18 сентября 2010 г. : труды Четвертой Российской конференции молодых ученых по информационному поиску. — Воронеж : Издательско-полиграфический центр Воронежского государственного университета, 2010. — С. 36-51.
978-5-9273-1728-8 http://elar.urfu.ru/handle/10995/3058 |
|
Язык |
ru
|
|
Связанные ресурсы |
IV Российская летняя школа по информационному поиску RuSSIR’2010, 13-18 сентября 2010 г. : труды Четвертой Российской конференции молодых ученых по информационному поиску
|
|
Формат |
1608830 bytes
application/pdf |
|
Издатель |
Издательско-полиграфический центр Воронежского государственного университета
|
|