Full Paper View Go Back

Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis

Sunil Kumar1 , Rajendra Gupta2

Section:Research Paper, Product Type: Journal-Paper
Vol.11 , Issue.5 , pp.82-87, Oct-2023


Online published on Oct 31, 2023


Copyright © Sunil Kumar, Rajendra Gupta . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
 

View this paper at   Google Scholar | DPI Digital Library


XML View     PDF Download

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Sunil Kumar, Rajendra Gupta, “Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis,” International Journal of Scientific Research in Computer Science and Engineering, Vol.11, Issue.5, pp.82-87, 2023.

MLA Style Citation: Sunil Kumar, Rajendra Gupta "Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis." International Journal of Scientific Research in Computer Science and Engineering 11.5 (2023): 82-87.

APA Style Citation: Sunil Kumar, Rajendra Gupta, (2023). Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis. International Journal of Scientific Research in Computer Science and Engineering, 11(5), 82-87.

BibTex Style Citation:
@article{Kumar_2023,
author = {Sunil Kumar, Rajendra Gupta},
title = {Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis},
journal = {International Journal of Scientific Research in Computer Science and Engineering},
issue_date = {10 2023},
volume = {11},
Issue = {5},
month = {10},
year = {2023},
issn = {2347-2693},
pages = {82-87},
url = {https://www.isroset.org/journal/IJSRCSE/full_paper_view.php?paper_id=3289},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.isroset.org/journal/IJSRCSE/full_paper_view.php?paper_id=3289
TI - Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis
T2 - International Journal of Scientific Research in Computer Science and Engineering
AU - Sunil Kumar, Rajendra Gupta
PY - 2023
DA - 2023/10/31
PB - IJCSE, Indore, INDIA
SP - 82-87
IS - 5
VL - 11
SN - 2347-2693
ER -

146 Views    256 Downloads    51 Downloads
  
  

Abstract :
The homonym words have the same spelling but have different meanings and these words found in almost every language. The homonyms are a source of noise in most text analysis and are difficult to detect. It essentially understands to make correspond to combinations of identifying / difference in parameters like sound, writing, and meaning, according to how the terms are traditionally used; the combination of same sound, same spelling, but distinct meaning is for homonyms. The paper presents a clustering improvement analysis using concept based document similarity method for homonym recognition based on concept based document similarity, which allows a word to be comprehended based on its context. The results show the proposed method shows better performance in clustering improvement and entropy calculation.

Key-Words / Index Term :
Concept based Document Similarity, Homonym Words, Clustering, Entropy

References :
[1] Müller MC (2017) “Semantic Author Name Disambiguation with Word Embeddings”, In: International Conference on Theory and Practice of Digital Libraries. Springer, pp.300–311, 2017.
[2] Pennington J, Socher R, Manning CD (2014) “Glove: Global Vectors for Word Representation”, In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.1532–1543, 2014.
[3] Pittke F, Leopoldh, Mendling J (2015) “Automatic Detection and Resolution of Lexical Ambiguity in Process Models”, IEEE Trans Software Engineering, Vol.41, Issue.6, pp.526–544, 2015.
[4] Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I (2019) “Language Models are Unsupervised Multitask Learners”, Open AI Blog 1(8):9, 2019.
[5] Roll U, Correia RA, Berger-Tal O (2018) “Using Machine Learning to Disentangle Homonyms in Large Text Corpora”, Conservation Biology, Vol.32, Issue.3, pp.716–724, 2018.
[6] Santana AF, Gonçalves MA, Laender AH, Ferreira AA (2017) “Incremental Author Name Disambiguation by Exploiting Domain-Specific Heuristics”, Journal of Association Information Science & Technology, Vol.68, Issue.4, pp.931–945, 2017.
[7] Santos CN, Gatti M (2014) “Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts”, In: COLING, pp.69–78, 2014.
[8] Schiemann T, Leser U, Hakenberg J (2009) “Word Sense Disambiguation in Biomedical Applications: A Machine Learning Method”, In: Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration. IGI Global, pp.142–161, 2009.
[9] Schuemiemj, Kors JA, Monsb “Word Sense Disambiguation in the Biomedical Domain: An Overview”, Journal of Computer Biology Vol.12, Issue.5, pp.554–565, 2015.
[10] Schulz C, Mazloumian A, Petersen AM, Penner O, Helbing D “Exploiting Citation Networks for Large-Scale Author Name Disambiguation”, EPJ Data Science 3(1):11, 2014.
[11] Shaikh T, Deshpande D “A Review On Opinion Mining and Sentiment Analysis”, International Journal of Computer Application, 975:8887, 2016.
[12] Sharma S, Srivastava SK “Review on Text Mining Algorithms”, International Journal of Computer Applications, Vol.134, Issue.8, pp.39–43, 2016.
[13] Shen Q, Wu T, Yang H, Wu Y, Qu H, Cui W “Nameclarifier: A Visual Analytics System for Author Name Disambiguation”, IEEE Trans Vis Computer Graph, Vol.23, Issue.1, pp.141–150, 2016.
[14] Singh T “A Comprehensive Review of Text Mining”, International Journal of Computer Science and Information Technology, Vol.7, Issue.1, pp.167–169, 2016.
[15] Song M, Kim EHJ, Kim HJ “Exploring Author Name Disambiguation on Pubmed-Scale”, Journal of Informetric Vol.9, Issue.4, pp.924–941, 2015.
[16] Songa X, Mina YJ, Da-Xionga L, Fengb WZ, Shua C “Research on Text Error Detection and Repair Method Based on Online Learning Community”, Procedia Computer Science, 154: pp.13–19, 2019.

Authorization Required

 

You do not have rights to view the full text article.
Please contact administration for subscription to Journal or individual article.
Mail us at  support@isroset.org or view contact page for more details.

Go to Navigation