Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm

Sanjay S Bhadoria and  Rajendra Kumar Patel

Full Paper View Go Back

Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm

Sanjay S Bhadoria¹ , Rajendra Kumar Patel²

Section:Research Paper, Product Type: Isroset-Journal
Vol.2 , Issue.5 , pp.1-4, Sep-2014

Online published on Dec 02, 2014

Copyright © Sanjay S Bhadoria , Rajendra Kumar Patel . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at Google Scholar | DPI Digital Library

XML View PDF Download

How to Cite this Paper

IEEE Citation
MLA Citation
APA Citation
BibTex Citation
RIS Citation

IEEE Style Citation: Sanjay S Bhadoria , Rajendra Kumar Patel, “Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm,” International Journal of Scientific Research in Computer Science and Engineering, Vol.2, Issue.5, pp.1-4, 2014.

MLA Style Citation: Sanjay S Bhadoria , Rajendra Kumar Patel "Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm." International Journal of Scientific Research in Computer Science and Engineering 2.5 (2014): 1-4.

APA Style Citation: Sanjay S Bhadoria , Rajendra Kumar Patel, (2014). Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm. International Journal of Scientific Research in Computer Science and Engineering, 2(5), 1-4.

BibTex Style Citation:
@article{Bhadoria_2014,
author = {Sanjay S Bhadoria , Rajendra Kumar Patel},
title = {Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm},
journal = {International Journal of Scientific Research in Computer Science and Engineering},
issue_date = {9 2014},
volume = {2},
Issue = {5},
month = {9},
year = {2014},
issn = {2347-2693},
pages = {1-4},
url = {https://www.isroset.org/journal/IJSRCSE/full_paper_view.php?paper_id=145},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.isroset.org/journal/IJSRCSE/full_paper_view.php?paper_id=145
TI - Web Text Content Extraction and Classification using Naïve Bayes Classifier Algorithm
T2 - International Journal of Scientific Research in Computer Science and Engineering
AU - Sanjay S Bhadoria , Rajendra Kumar Patel
PY - 2014
DA - 2014/12/02
PB - IJCSE, Indore, INDIA
SP - 1-4
IS - 5
VL - 2
SN - 2347-2693
ER -

3344 Views

3228 Downloads

3134 Downloads

Bar Line

Abstract :
The Web today contains lots of information about subjects such as people, companies, organizations, products, etc. That may be of wide interest. Text mining is the technique that helps users to find useful information from a large amount of digital text documents on the Web or databases. This paper discusses The naive Bayes classifier algorithm of how to follow the appointed website or web page according to users request and in Internet by extraction on web mining.

Key-Words / Index Term :
Classification, Text Extraction, Link Crawler, Data Mining

References :
[01] Shaun Yin Gang Wang Yaqui Qiu Weiqun Zhang. ‖ Research and Implement of Classification Algorithm on Web Text Mining‖. IEEE.(2007)446-449
[02] Choi, B. and Peng, X., 2004. Dynamic and Hierarchical Classification of Web Pages. Online Information Review, Vol. 28, No. 2, pp. 139-147.
[03] Sam, L. Z., Maarof, M. A. B. and Selamat, A., 2006. Automated Web Pages Classification with Independent Component Analysis. Proceedings of the Postgraduate Annual Research Seminar. Vol. 1, pp. 466-469.
[03]. M. Castellano, G. Mastronardi, A. Aprile, and G. Tarricone ‖A Web Text Mining Flexible Architecture‖. World Academy of Science, Engineering and Technology 32 2007
[04] Catarina Silva, Bernardete Ribeiro ―Margin-based Active Learning and Background Knowledge in Text Mining‖.Proceedings of the Fourth International Conference on Hybrid Intelligent Systems (HIS‘04)IEEE
[05] Weiguo Fan1, Linda Wallace, Stephanie Rich, Zhongju Zhang ―Tapping into the Power of Text Mining‖.
[06] http://tartarus.org/~martin/PorterStemmer
[08] http://www.htmlparser.com
[09] Mahadevan, I., Karuppasamy, S. and Ramasamy, R., 2009. Resource Optimization in Automatic Web Page Classification using Integrated Feature Selection and Machine Learning. International Arab Journal of e-Technology, Vol. 1, No. 1, pp. 19-28.
[10] Zhang, B., Xu, M. and Xiu, L., 2012. A Web Site Classification Approach Based on its Topological Structure. International Journal on Asian Language Processing. Vol. 20, No. 2, pp. 75-86.

Authorization Required

Close(X)

You do not have rights to view the full text article.
Please contact administration for subscription to Journal or individual article.
Mail us at support@isroset.org or view contact page for more details.

Full Paper View Go Back

Main Menu

Journals Contents

Information

Download

Publication Certificate

Contact Us

Use full Link