Educational Sentiments Web Content Extraction and Analysis for Opinion Mining using Clustering Techniques

Authors

  • Sanjay Singh Bhadoria Faculty of Computer Application, Dr. A.P.J. Abdul Kalam University, Indore, India
  • Dhanraj Verma Faculty of Computer Application, Dr. A.P.J. Abdul Kalam University, Indore, India

Keywords:

Educational Data Mining, Learning Analytics, Sentimental Analysis, Opinion Mining, Classification, Clustering Techniques

Abstract

The number of data produced across the world is increasing and could keep on growing at an accelerating rate for years to come. At organizations total businesses, servers have been overflowing with usage logs, transaction records, message flows and business operations records, detector data along with cell device data. Given such large sizes of text data sets, mining programs, that arrange the writing data sets into structured knowledge, will enhance efficient document access. There`s a demand for a highly effective method to categorize and analyze web reviews being a significant statistics investigation. The task of refining and assessing such collective net information together is called opinion mining, also it is also called Sentiment Analysis. We compare clustering techniques such as EM, K-Means, and Farthest First – in density and distance modes – with a threshold partitioning technique on the resulting two-dimensional data. These clustering techniques are also enhanced with the use of histogram smoothing techniques. We then evaluate our approach using standard accuracy, precision and recall metrics.

 

References

Blei, D. M., Ng, A. Y., and Jordan, M. I. 2001. Latent dirichlet allocation. Advances in neural information processing systems. 601-608.

Calado, P., Cristo, M., Goncalves, M. A., de Moura, E. S., Ribeiro-Neto, B., and Ziviani, N. 2006. Link-based similarity measures for the classification of web documents. Journal of the American Society for Information Science and Technology (57:2), 208-221.

Chakrabarti, S., B. Dom and P. Indyk. 1998. Enhanced hypertext categorization using hyperlinks. Proceedings of ACM SIGMOD 1998.

Chen, Z., Wu, O., Zhu, M., and Hu, W. 2006. A novel web page filtering system by combining texts and images. Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, 732–735. Washington, DC IEEE Computer Society.

Cohen, W. 2002. Improving a page classifier with anchor extraction and link analysis. In S. Becker, S. Thrun, and K. Obermayer (Eds.), Advances in Neural Information Processing Systems (Vol. 15, Cambridge, MA: MIT Press) 1481– 1488.

Dumais, S. T., and Chen, H. 2000. Hierarchical classification of web content. Proceedings of SIGIR`00, 256-263.

V. Vapnik, “Statistical learning theory”, New York: Wiley, 1998.

C. J. C. Burges, “A tutorial on support vector machines for pattern recognition”, Data Mining and Knowledge Discovery, vol. 2, pp.121–167, 1998.

H.G. Jung, P.J. Yoon and J. Kim, “Genetic algorithm-based optimization of SVM-based pedestrian classifier”, In: The 22nd international technical conference on circuits/systems, computers and communications, pp. 783–784, July 2007.

J. Wang ,X. Hong, R. Ren and T.-H. Li, “A real-time intrusion detection system based on PSO-SVM”, In Proc. of the International Workshop on Information Security and Application 2009 (IWISA 2009), pp. 319–321, 2009.

R. Eberhart and J. Kennedy, “A new optimizer using particle swarm theory”, In Proc. of the Sixth International Symposium on Micro Machine and Human Science, Nagoya, Japan, IEEE, pp.39–43, 1995.

P. Hayati, V. Potdar, K. Chai and A. Talevski, “Web Spambot Detection Based on Web Navigation Behaviour”, In Proc. of the 24th IEEE International Conference on Advanced Information Networking and Applications(AINA 2010), Perth, Western Australia, pp. 797–803, 2010.

G. Venter and J. Sobieszczanski-Sobieski, “A Parallel Particle Swarm Optimization Algorithm Accelerated by Asynchronous Evaluations”, in 6th World Congresses of Structural and Multidisciplinary Optimization, Rio de Janeiro, Brazil, pp. 1–10, 30 May – 03 June 2005.

Downloads

Published

2020-04-30

How to Cite

[1]
S. S. Bhadoria and D. Verma, “Educational Sentiments Web Content Extraction and Analysis for Opinion Mining using Clustering Techniques”, Int. J. Sci. Res. Comp. Sci. Eng., vol. 8, no. 2, pp. 55–58, Apr. 2020.

Issue

Section

Review Article

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.