ARTICLE
TITLE

Feature Selection of Network Intrusion Data using Genetic Algorithm and Particle Swarm Optimization

SUMMARY

This paper describes the advantages of using Evolutionary Algorithms (EA) for feature selection on network intrusion dataset. Most current Network Intrusion Detection Systems (NIDS) are unable to detect intrusions in real time because of high dimensional data produced during daily operation. Extracting knowledge from huge data such as intrusion data requires new approach. The more complex the datasets, the higher computation time and the harder they are to be interpreted and analyzed. This paper investigates the performance of feature selection algoritms in network intrusiona data. We used Genetic Algorithms (GA) and Particle Swarm Optimizations (PSO) as feature selection algorithms. When applied to network intrusion datasets, both GA and PSO have significantly reduces the number of features. Our experiments show that GA successfully reduces the number of attributes from 41 to 15 while PSO reduces the number of attributes from 41 to 9. Using k Nearest Neighbour (k-NN) as a classifier,the GA-reduced dataset which consists of 37% of original attributes, has accuracy improvement from 99.28% to 99.70% and its execution time is also 4.8 faster than the execution time of original dataset. Using the same classifier, PSO-reduced dataset which consists of 22% of original attributes, has the fastest execution time (7.2 times faster than the execution time of original datasets). However, its accuracy is slightly reduced 0.02% from 99.28% to 99.26%. Overall, both GA and PSO are good solution as feature selection techniques because theyhave shown very good performance in reducing the number of features significantly while still maintaining and sometimes improving the classification accuracy as well as reducing the computation time.

 Articles related

Muhlis Tahir, Tessy Badriyah, Iwan Syarif    

Preeclampsia is a pregnancy abnormality that develops after 20 weeks of pregnancy characterized by hypertension and proteinuria.  The purpose of this research was to predict the risk of preeclampsia level in pregnant women during pregnancy process ... see more


Muh Subhan, Amang Sudarsono, Ali Ridho Barakbah    

Radical content in procedural meaning is content which have provoke the violence, spread the hatred and anti nationalism. Radical definition for each country is different, especially in Indonesia. Radical content is more identical with provocation issue,... see more


M. Basyir, M. Nasir, Suryati Suryati, Widdha Mellyssa    

Emergency Reporting Application is an android-based application that serves to help the community in reporting the emergency condition. This application allows users to choose and contact the emergency services office, without the need to notice their po... see more


Iwan Syarif    

Classification problem especially for high dimensional datasets have attracted many researchers in order to find efficient approaches to address them. However, the classification problem has become very complicatedespecially when the number of possible d... see more


(1) Hassan Almazini (Shatt Al-Arab University College, Iraq) (2) Ku Ruhana Ku-Mahamud (University Utara Malaysia, Kedah, Malaysia & Shibaura Institute of Technology, Tokyo,, Japan) (3) Hussein Fouad Almazini (Shatt Al-Arab University College, Iraq)    

The popular modified graph clustering ant colony optimization (ACO) algorithm (MGCACO) performs feature selection (FS) by grouping highly correlated features. However, the MGCACO has problems in local search, thus limiting the search for optimal feature ... see more