Просмотреть запись

Transforming Message Detection

Электронный научный архив УРФУ

Информация об архиве | Просмотр оригинала
 
 
Поле Значение
 
Заглавие Transforming Message Detection
 
Автор Ermakova, L.
 
Тематика SPAM
TRANSFORMING MESSAGE
N-GRAMS
SVM
DAMERAU-LEVENSHTEIN DISTANCE
 
Описание The majority of existing spam filtering techniques suffers from several serious
disadvantages. Some of them provide many false positives. The others are suitable only for
email filtering and may not be used in IM and social networks. Therefore content methods
seem to be more efficient. One of them is based on signature retrieval. However it is not change resistant. There are enhancements (e.g. checksums) but they are extremely time and resource consuming. That is why the main objective of this research is to develop a transforming message detection method. To this end we have compared spam in various languages, namely English, French, Russian and Italian. For each language the number of examined messages including spam and notspam was about 1000. 135 quantitative features have been retrieved. Almost all these features do not depend on the language. They underlie the first step of the algorithm based on support vector machine. The next stage is to test the obtained results
applying N-gram approach. Special attention is paid to word distortion and text alteration. The obtaining results indicate the efficiency of the suggested approach.
 
Дата 2011-10-12T09:31:27Z
2011-10-12T09:31:27Z
2011
 
Тип Article
Journal article (info:eu-repo/semantics/article)
Published version (info:eu-repo/semantics/publishedVersion)
 
Идентификатор Ermakova L. Transforming Message Detection / L. Ermakova // Web of Data: The joint RuSSIR/EDBT 2011 Summer School, August 15–19, 2011, Proceedings of the Fifth Russian Young Scientists Conference in Information Retrieval / B. Novikov, P. Braslavsky (Eds.). — St. Petersburg, 2011 — P. 15-29.
978-5-288-05225-5
http://elar.urfu.ru/handle/10995/3708
 
Язык en
 
Связанные ресурсы RuSSIR/EDBT2011
 
Формат 437949 bytes
application/pdf
 
Издатель St. Petersburg University Press