ARTICLE
TITLE

A COMPARISON OF DIFFERENT KERNEL FUNCTIONS OF SVM CLASSIFICATION METHOD FOR SPAM DETECTION

SUMMARY

Today, the use of e-mail, especially for formal online communication, is still often done. There is one common problem faced by e-mail users, which is the frequent receiving of spam messages. Spam messages are generally in the form of advertising or promotional messages in bulk to everyone. Of course this will cause inconvenience for people who receive the SPAM message. SPAM e-mails can be interpreted as junk messages or junk mail. So that spam has the nature of sending electronic messages repeatedly to the owner of the e-mail. This is abuse of the messaging system. One way to solve the spam problem is to identify spam messages for automatic message filtering. Several machine learning based methods are used to classify spam messages. In this study, a comparison was made between several kernel functions (i.e., linear, degree 1 polynomial, degree 2 polynomial, degree 3 polynomial, and RBF) of the SVM method to get the best SVM model in identifying spam messages. The evaluation results based on the Kaggle 1100 dataset showed that the best model were the SVM model with a linear kernel function and a degree 1 polynomial, where both models returned Precision = 0.99, Recall = 0.99, and F1-Score = 0.98. On the other hand, the RBF kernel produced lower performance in terms of Precision, Recall, and F1-Score of 0.95, 0.95, and 0.94, respectively.

KEYWORDS

 Articles related

Rizal Adi Saputra    

Image is a spatial dimension contains information, color, and not time-dependent. Nowadays image is very important for recognition system as source/data. In order to obtain certain information (features), image transformed or extracted.  Wavelet is ... see more

Revista: semanTIK

Mustafa Tuncay, Ali Haydar    

Differential Evolution algorithm (DE) is a well-known nature-inspired method in evolutionary computations scope. This paper adds some new features to DE algorithm and proposes a novel method focusing on ranking technique. The proposed method is named as ... see more


Inaam Rikan Hassan    

The current paper studies the performance efficiency of two uninformative priors, namely Bayes-Laplace (Uniform) prior and Jeffrey’s prior for Binomial model. Several performance measures, such as the Bayes estimators under different loss functions, the ... see more


Damir Cavka,Dragan Poljak,Andres Peratta    

Boundary and finite element modeling approach to the assessment of electrostatic field on human head generated by Video Display Units (VDU’s) are compared and discussed. Attention is focused to the field distribution over the surface of the face. The mat... see more


Cherry Galatia Ballangan    

Active contour, or snake, is an energy minimizing spline that is useful in image boundary detection. Active contours are stimulated by internal forces, image forces and external forces which maintain the shape of the contours while attract the contours t... see more