ARTICLE
TITLE

IMPLEMENTATION OF DBSCAN CLUSTERING ALGORITHM WITHIN THE FRAMEWORK OF THE OBJECTIVE CLUSTERING INDUCTIVE TECHNOLOGY BASED ON R AND KNIME TOOLS

SUMMARY

Context. The problem of the data clustering within the framework of the objective clustering inductive technology is considered. Practical implementation of the obtained hybrid model based on the complex use of R and KNIME tools is performed. The object of the study is the hybrid model of the data clustering based on the complex use of both DBSCAN clustering algorithm and theobjective clustering inductive technology.Objective. The aim of the work is the creation of the hybrid model of the objective clustering based on DBSCAN clustering algorithm and its practical implementation on the basis of the complex use of both R and KNIME tools.Method. The inductive methods of complex systems modelling have been used as the basis to determine the optimal parameters of DBSCAN clustering algorithm within the framework of the objective clustering inductive technology. The practicalimplementation of this technology involves: the use of two equal power subsets, which contain the same quantity of pairwise similar objects; calculation of the internal and the external clustering quality criteria; calculation of the complex balance criterion, maximum value of which corresponds to the best clustering in terms of the used criteria. Implementation of this process involves two mainstages. Firstly, the optimal values of the EPS parameter were determined at each step within the range of the minPts value changes. The charts of the complex balance criterion versus the EPS value were obtained for each minPts value as the results of this stage implementation. Then, the analysis of the obtained intermediate results was performed in order to determine the optimal solution,which corresponds to both the maximum value of the complex balance criterion on the one side and the aims of the current clustering on the other side.Results. The developed hybrid model has been implemented based on software KNIME with the use of plugins, which have beenwritten in software R. The efficiency of the model was tasted with the use of the different data: low dimensional data of the computing school of East Finland University; Fisher’s iris; gene expression profiles of the patients, which were investigated on lung cancer.Conclusions. The results of the simulation have shown high efficiency of the proposed method. The studied objects were distributed into clusters correctly in all cases. The proposed method allows us to decrease the reproducibility error, since the solution concerning determination of the clustering algorithm optimal parameters was taken based on both the clustering results obtained onequal power subsets separately and the difference of the clustering results obtained on the two equal power subsets. 

 Articles related

Yousif Khalid Yousid,R Badlishah,N. Yaakob,A Amir    

One of the most critical problems in Wireless Sensor Networks (WSNs) is to how to reduce energy consumption and prolong the network lifetime of WSNs. Clustering is of the solutions, which have been used to reduce energy consumption by partition the netwo... see more


Mohammad Nur Shodiq,Dedy Hidayat Kusuma,Mirza Ghulam Rifqi,Ali Ridho Barakbah,Tri Harsono    

Earthquake is a type of natural disaster. The Indonesian archipelago located in the world's three mega plates; they are Australian plate, Eurasian plate, and Pacific plate. Therefore, it is possible for applied of earthquake risk of mitigation. One of th... see more


Iin Parlina,Herman Mawengkang,Syahril Efendi    

Efforts to evaluate employees in the work is to assess the performance of each employee. For it has been formulated assessment is based upon work objectives according to the position or job title, and by weighting against six indicators into three groups... see more


Adi Wibowo,Justinus Andjarwirawan,David Valentino    

Finding a specific file in Android devices is not an easy task. Not many apps can search by files’ contents and find terms similarities between user’s queries and files’ terms. This research proposed using Suffix Tree Clustering to index files contents, ... see more

Revista: Telematika

Christos Troussas,Maria Virvou    

People working as groups, collaborating, rather than people working individually, has unquestionably helped them develop and make accomplishments beyond our imagination. It is quite common to believe that human beings have an inner need to act as social ... see more