Full Paper View Go Back

An Efficient Cluster Formation Approach for Handling Large Scale Data

Anil 1 , Rajendra Gupta2

Section:Research Paper, Product Type: Journal-Paper
Vol.5 , Issue.1 , pp.1-6, Dec-2018


Online published on Dec 31, 2018


Copyright © Anil, Rajendra Gupta . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
 

View this paper at   Google Scholar | DPI Digital Library


XML View     PDF Download

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Anil, Rajendra Gupta, “An Efficient Cluster Formation Approach for Handling Large Scale Data,” World Academics Journal of Engineering Sciences, Vol.5, Issue.1, pp.1-6, 2018.

MLA Style Citation: Anil, Rajendra Gupta "An Efficient Cluster Formation Approach for Handling Large Scale Data." World Academics Journal of Engineering Sciences 5.1 (2018): 1-6.

APA Style Citation: Anil, Rajendra Gupta, (2018). An Efficient Cluster Formation Approach for Handling Large Scale Data. World Academics Journal of Engineering Sciences, 5(1), 1-6.

BibTex Style Citation:
@article{Gupta_2018,
author = {Anil, Rajendra Gupta},
title = {An Efficient Cluster Formation Approach for Handling Large Scale Data},
journal = {World Academics Journal of Engineering Sciences},
issue_date = {12 2018},
volume = {5},
Issue = {1},
month = {12},
year = {2018},
issn = {2347-2693},
pages = {1-6},
url = {https://www.isroset.org/journal/WAJES/full_paper_view.php?paper_id=1573},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.isroset.org/journal/WAJES/full_paper_view.php?paper_id=1573
TI - An Efficient Cluster Formation Approach for Handling Large Scale Data
T2 - World Academics Journal of Engineering Sciences
AU - Anil, Rajendra Gupta
PY - 2018
DA - 2018/12/31
PB - IJCSE, Indore, INDIA
SP - 1-6
IS - 1
VL - 5
SN - 2347-2693
ER -

147 Views    122 Downloads    86 Downloads
  
  

Abstract :
The basic K-means algorithm is the most widely used clustering algorithm and the best-known of the partitioning-based clustering methods. Choosing the initial centroids randomly results in poor clustering. Instead of optimal clustering (or global optimum), sub-optimal clustering (or local minimum) is obtained. One of the problems associated with the basic K-means algorithm is that empty clusters can be obtained if no points are allocated to a cluster during the assignment step. So implementation in basic k-mean algorithm gives better solution for large scale data analysis. In this paper, the k-mean algorithm and its implementation have done and the obtained results are also discussed. After the analysis, it is found that the implementation in k-mean algorithm provide better outcome.

Key-Words / Index Term :
Basic k-mean, implemented k-mean, Large Scale Data, Fuzzy C4.5

References :
[1] Bhagyashri S. Gandhi ,Leena A. Deshpande , “The Survey on Approaches to Efficient Clustering and Classification Analysis of Large scale data”, International Journal of Engineering Trends and Technology (IJETT) – Volume 36 Number 1- June 2016
[2] V. Estivill-Castro, Why so many clustering algorithms: A position paper, SIGKDD Explorations Newsletter, 4 (2002), pp. 65–75
[3] Sami Ayr ¨ am ¨ o Tommi K ¨ arkk ¨ ainen ¨,” Introduction to partitioning-based clustering methods with a robust example” University of Jyvaskyl ¨ a¨ ISBN 951-39-2467-X, ISSN 1456-4378, 2006
[4] R. Duda and P. Hart, Pattern Classification And Scene Analysis, John Wiley & Sons, Inc., Ny, 1973.
[5] V. Estivill-Castro, Why So Many Clustering Algorithms: A Position Paper, Sigkdd Explorations Newsletter, 4 (2002), Pp. 65–75.
[6] A. K. Jain and R. C. Dubes, Algorithms For Clustering Data, Prentice-Hall, Inc., Upper Saddle River, Nj, Usa, 1988.
[7] M. S. Aldenderfer and R. K. Blashfield, Cluster Analysis, Sage Publications, London, England, 2016.
[8] Tanvir Ahmad, Rafeeq Ahmad, Sarah Masud, FarheenNilofer, “Framework to Extract Context Vectors from Unstructured Data using Large scale data Analytics”, Pages: 1 -6, Ninth International Conference on Contemporary Computing (IC3) ,2016 IEEE
[9] Harleen,Naveen Garg, “Analysis of Hadoop Performance And Unstructured Data Using Zeppelin”, Year: 2016, Pages: 1 -6, International Conference on Research Advances in Integrated Navigation Systems, April 06-07, 2016 IEEE
[10] Radhika K R, Pushpa C N, Thriveni J, Venugopal K R, “EDSC: Efficient Document Subspace Clustering Technique for High-Dimensional Data”, International Conference on Computational Techniques in Information and Communication Technologies (ICCTICT),2016 IEEE.2016, 222
[11] Joe Tekli, “An Overview on XML Semantic Disambiguation from Unstructured Text to Semi-Structured Data: Background, Applications, and Ongoing Challenges”, IEEE Transactions on Knowledge and Data Engineering Year: 2016, Volume: 28, Issue: 6Pages: 1383 -1407,
[12] SarmadIstephan, Mohammad-Reza Siadat, “Extensible Query Framework for Unstructured Medical Data – A Large scale data Approach”, IEEE International Conference on Data Mining Workshop (ICDMW)Year: 2015, Pages: 455 - 462, DOI: 10.1109/ICDMW.2015.67, 2015
[13] I-Jen Chiang, “Agglomerative Algorithm to Discover Semantics From Unstructured Large scale data” , IEEE International Conference on Large scale data (Large scale data)Year: 2015 Pages: 1556 -1563, 2015
[14] Aarti Rahul Salunke, ArunNathaGaikwad, “Review of Unstructured Architecture for Voice and Data Services in Mobile Communication”, Year: 2015,Pages 1-7, DOI: 10.1109/ICECCT.2015.7226190,2015 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT).
[15] Mania Tlili, Tarek M. Hamdani, “Large scale data clustering validity”, 6th International Conference of Soft Computing and Pattern Recognition 348-352, 2014,.
[16] Ganeshayya Shidaganti, S. Prakash, “Feedback analysis of unstructured data from collaborative networking a Large scale data analytics approach”, CIMCA IEEE, International conference on circuits, communication, control and computing.2014, 343-347,

Authorization Required

 

You do not have rights to view the full text article.
Please contact administration for subscription to Journal or individual article.
Mail us at  support@isroset.org or view contact page for more details.

Go to Navigation