Full Paper View Go Back

Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm

Sanjay S Bhadoria1 , Rajendra Kumar Patel2

Section:Research Paper, Product Type: Isroset-Journal
Vol.2 , Issue.5 , pp.1-4, Sep-2014


Online published on Dec 02, 2014


Copyright © Sanjay S Bhadoria , Rajendra Kumar Patel . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
 

View this paper at   Google Scholar | DPI Digital Library


XML View     PDF Download

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Sanjay S Bhadoria , Rajendra Kumar Patel, “Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm,” International Journal of Scientific Research in Computer Science and Engineering, Vol.2, Issue.5, pp.1-4, 2014.

MLA Style Citation: Sanjay S Bhadoria , Rajendra Kumar Patel "Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm." International Journal of Scientific Research in Computer Science and Engineering 2.5 (2014): 1-4.

APA Style Citation: Sanjay S Bhadoria , Rajendra Kumar Patel, (2014). Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm. International Journal of Scientific Research in Computer Science and Engineering, 2(5), 1-4.

BibTex Style Citation:
@article{Bhadoria_2014,
author = {Sanjay S Bhadoria , Rajendra Kumar Patel},
title = {Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm},
journal = {International Journal of Scientific Research in Computer Science and Engineering},
issue_date = {9 2014},
volume = {2},
Issue = {5},
month = {9},
year = {2014},
issn = {2347-2693},
pages = {1-4},
url = {https://www.isroset.org/journal/IJSRCSE/full_paper_view.php?paper_id=145},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.isroset.org/journal/IJSRCSE/full_paper_view.php?paper_id=145
TI - Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm
T2 - International Journal of Scientific Research in Computer Science and Engineering
AU - Sanjay S Bhadoria , Rajendra Kumar Patel
PY - 2014
DA - 2014/12/02
PB - IJCSE, Indore, INDIA
SP - 1-4
IS - 5
VL - 2
SN - 2347-2693
ER -

3344 Views    3228 Downloads    3134 Downloads
  
  

Abstract :
The Web today contains lots of information about subjects such as people, companies, organizations, products, etc. That may be of wide interest. Text mining is the technique that helps users to find useful information from a large amount of digital text documents on the Web or databases. This paper discusses The naive Bayes classifier algorithm of how to follow the appointed website or web page according to users request and in Internet by extraction on web mining.

Key-Words / Index Term :
Classification, Text Extraction, Link Crawler, Data Mining

References :
[01] Shaun Yin Gang Wang Yaqui Qiu Weiqun Zhang. ‖ Research and Implement of Classification Algorithm on Web Text Mining‖. IEEE.(2007)446-449
[02] Choi, B. and Peng, X., 2004. Dynamic and Hierarchical Classification of Web Pages. Online Information Review, Vol. 28, No. 2, pp. 139-147.
[03] Sam, L. Z., Maarof, M. A. B. and Selamat, A., 2006. Automated Web Pages Classification with Independent Component Analysis. Proceedings of the Postgraduate Annual Research Seminar. Vol. 1, pp. 466-469.
[03]. M. Castellano, G. Mastronardi, A. Aprile, and G. Tarricone ‖A Web Text Mining Flexible Architecture‖. World Academy of Science, Engineering and Technology 32 2007
[04] Catarina Silva, Bernardete Ribeiro ―Margin-based Active Learning and Background Knowledge in Text Mining‖.Proceedings of the Fourth International Conference on Hybrid Intelligent Systems (HIS‘04)IEEE
[05] Weiguo Fan1, Linda Wallace, Stephanie Rich, Zhongju Zhang ―Tapping into the Power of Text Mining‖.
[06] http://tartarus.org/~martin/PorterStemmer
[08] http://www.htmlparser.com
[09] Mahadevan, I., Karuppasamy, S. and Ramasamy, R., 2009. Resource Optimization in Automatic Web Page Classification using Integrated Feature Selection and Machine Learning. International Arab Journal of e-Technology, Vol. 1, No. 1, pp. 19-28.
[10] Zhang, B., Xu, M. and Xiu, L., 2012. A Web Site Classification Approach Based on its Topological Structure. International Journal on Asian Language Processing. Vol. 20, No. 2, pp. 75-86.

Authorization Required

 

You do not have rights to view the full text article.
Please contact administration for subscription to Journal or individual article.
Mail us at  support@isroset.org or view contact page for more details.

Go to Navigation