ARTICLE
TITLE

WEB NEWS DOCUMENTS CLUSTERING IN INDONESIAN LANGUAGE USING SINGULAR VALUE DECOMPOSITION-PRINCIPAL COMPONENT ANALYSIS (SVDPCA) AND ANT ALGORITHMS

SUMMARY

Ant-based document clustering is a cluster method of measuring text documents similarity based on the shortest path between nodes (trial phase) and determines the optimal clusters of sequence document similarity (dividing phase). The processing time of trial phase Ant algorithms to make document vectors is very long because of high dimensional Document-Term Matrix (DTM). In this paper, we proposed a document clustering method for optimizing dimension reduction using Singular Value Decomposition-Principal Component Analysis (SVDPCA) and Ant algorithms. SVDPCA reduces size of the DTM dimensions by converting freq-term of conventional DTM to score-pc of Document-PC Matrix (DPCM). Ant algorithms creates documents clustering using the vector space model based on the dimension reduction result of DPCM. The experimental results on 506 news documents in Indonesian language demonstrated that the proposed method worked well to optimize dimension reduction up to 99.7%. We could speed up execution time efficiently of the trial phase and maintain the best F-measure achieved from experiments was 0.88 (88%).

 Articles related

Mira Ziveria, Ridha Sefina Samosir, Allysia Amanda Tjoaputri    

Lembaga Alkitab Indonesia (LAI or Indonesia Bible Society) is a nonprofit institution that exists to assist and support churches, organizations and Christians in Indonesia in carrying out the task of communion, witness and ministry through the provision ... see more


Husna Sarirah Husin,Nadiah Ruza    

When users visit a page, the browser records the referrer that sent the user to the page. If no referrer is recorded, it means that the user typed in the url of the page. Referrer can comes from various sites such as search engine site, social media site... see more


Jossandro Balardin Silva,Jacques Nelson Corleta Schreiber,Elpídio Oscar Benitez Nara    

This research was responsible for the development of a method for recommending news in online newspapers. This study takes into consideration that each reader has specific needs and interests when reading online newspapers, and it is a challenge to bring... see more


Iwan Putra Setiawan, Noor Miyono    

Information or news today is a very important thing, wherever and whenever everyone will surely need it. Media to get information and news is a lot, one of which is print media or newspapers. Newspapers are one of the mass media reported the events of ev... see more


Mo Chen    

This research has caught researchers’ wide attention for detecting network topic exactly with the arrival of big data era characterized by semi-structured or unstructured text. This paper proposes a model of network topic detection based on web usage beh... see more