Transforming Message Detection
Электронный научный архив УРФУ
Информация об архиве | Просмотр оригиналаПоле | Значение | |
Заглавие |
Transforming Message Detection
|
|
Автор |
Ermakova, L.
|
|
Тематика |
SPAM
TRANSFORMING MESSAGE N-GRAMS SVM DAMERAU-LEVENSHTEIN DISTANCE |
|
Описание |
The majority of existing spam filtering techniques suffers from several serious disadvantages. Some of them provide many false positives. The others are suitable only for email filtering and may not be used in IM and social networks. Therefore content methods seem to be more efficient. One of them is based on signature retrieval. However it is not change resistant. There are enhancements (e.g. checksums) but they are extremely time and resource consuming. That is why the main objective of this research is to develop a transforming message detection method. To this end we have compared spam in various languages, namely English, French, Russian and Italian. For each language the number of examined messages including spam and notspam was about 1000. 135 quantitative features have been retrieved. Almost all these features do not depend on the language. They underlie the first step of the algorithm based on support vector machine. The next stage is to test the obtained results applying N-gram approach. Special attention is paid to word distortion and text alteration. The obtaining results indicate the efficiency of the suggested approach. |
|
Дата |
2011-10-12T09:31:27Z
2011-10-12T09:31:27Z 2011 |
|
Тип |
Article
Journal article (info:eu-repo/semantics/article) Published version (info:eu-repo/semantics/publishedVersion) |
|
Идентификатор |
Ermakova L. Transforming Message Detection / L. Ermakova // Web of Data: The joint RuSSIR/EDBT 2011 Summer School, August 15–19, 2011, Proceedings of the Fifth Russian Young Scientists Conference in Information Retrieval / B. Novikov, P. Braslavsky (Eds.). — St. Petersburg, 2011 — P. 15-29.
978-5-288-05225-5 http://elar.urfu.ru/handle/10995/3708 |
|
Язык |
en
|
|
Связанные ресурсы |
RuSSIR/EDBT2011
|
|
Формат |
437949 bytes
application/pdf |
|
Издатель |
St. Petersburg University Press
|
|