Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis

Sunil Kumar, Rajendra Gupta

Full Paper View Go Back

Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis

Sunil Kumar¹ , Rajendra Gupta²

Section:Research Paper, Product Type: Journal-Paper
Vol.11 , Issue.5 , pp.82-87, Oct-2023

Online published on Oct 31, 2023

Copyright © Sunil Kumar, Rajendra Gupta . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at Google Scholar | DPI Digital Library

XML View PDF Download

How to Cite this Paper

IEEE Citation
MLA Citation
APA Citation
BibTex Citation
RIS Citation

IEEE Style Citation: Sunil Kumar, Rajendra Gupta, “Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis,” International Journal of Scientific Research in Computer Science and Engineering, Vol.11, Issue.5, pp.82-87, 2023.

MLA Style Citation: Sunil Kumar, Rajendra Gupta "Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis." International Journal of Scientific Research in Computer Science and Engineering 11.5 (2023): 82-87.

APA Style Citation: Sunil Kumar, Rajendra Gupta, (2023). Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis. International Journal of Scientific Research in Computer Science and Engineering, 11(5), 82-87.

BibTex Style Citation:
@article{Kumar_2023,
author = {Sunil Kumar, Rajendra Gupta},
title = {Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis},
journal = {International Journal of Scientific Research in Computer Science and Engineering},
issue_date = {10 2023},
volume = {11},
Issue = {5},
month = {10},
year = {2023},
issn = {2347-2693},
pages = {82-87},
url = {https://www.isroset.org/journal/IJSRCSE/full_paper_view.php?paper_id=3289},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.isroset.org/journal/IJSRCSE/full_paper_view.php?paper_id=3289
TI - Clustering Improvement in Homonym Detection using Concept Based Document Similarity with Conceptual Term Frequency Analysis
T2 - International Journal of Scientific Research in Computer Science and Engineering
AU - Sunil Kumar, Rajendra Gupta
PY - 2023
DA - 2023/10/31
PB - IJCSE, Indore, INDIA
SP - 82-87
IS - 5
VL - 11
SN - 2347-2693
ER -

146 Views

256 Downloads

51 Downloads

Bar Line

Abstract :
The homonym words have the same spelling but have different meanings and these words found in almost every language. The homonyms are a source of noise in most text analysis and are difficult to detect. It essentially understands to make correspond to combinations of identifying / difference in parameters like sound, writing, and meaning, according to how the terms are traditionally used; the combination of same sound, same spelling, but distinct meaning is for homonyms. The paper presents a clustering improvement analysis using concept based document similarity method for homonym recognition based on concept based document similarity, which allows a word to be comprehended based on its context. The results show the proposed method shows better performance in clustering improvement and entropy calculation.

Key-Words / Index Term :
Concept based Document Similarity, Homonym Words, Clustering, Entropy

References :
[1] Müller MC (2017) “Semantic Author Name Disambiguation with Word Embeddings”, In: International Conference on Theory and Practice of Digital Libraries. Springer, pp.300–311, 2017.
[2] Pennington J, Socher R, Manning CD (2014) “Glove: Global Vectors for Word Representation”, In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.1532–1543, 2014.
[3] Pittke F, Leopoldh, Mendling J (2015) “Automatic Detection and Resolution of Lexical Ambiguity in Process Models”, IEEE Trans Software Engineering, Vol.41, Issue.6, pp.526–544, 2015.
[4] Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I (2019) “Language Models are Unsupervised Multitask Learners”, Open AI Blog 1(8):9, 2019.
[5] Roll U, Correia RA, Berger-Tal O (2018) “Using Machine Learning to Disentangle Homonyms in Large Text Corpora”, Conservation Biology, Vol.32, Issue.3, pp.716–724, 2018.
[6] Santana AF, Gonçalves MA, Laender AH, Ferreira AA (2017) “Incremental Author Name Disambiguation by Exploiting Domain-Specific Heuristics”, Journal of Association Information Science & Technology, Vol.68, Issue.4, pp.931–945, 2017.
[7] Santos CN, Gatti M (2014) “Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts”, In: COLING, pp.69–78, 2014.
[8] Schiemann T, Leser U, Hakenberg J (2009) “Word Sense Disambiguation in Biomedical Applications: A Machine Learning Method”, In: Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration. IGI Global, pp.142–161, 2009.
[9] Schuemiemj, Kors JA, Monsb “Word Sense Disambiguation in the Biomedical Domain: An Overview”, Journal of Computer Biology Vol.12, Issue.5, pp.554–565, 2015.
[10] Schulz C, Mazloumian A, Petersen AM, Penner O, Helbing D “Exploiting Citation Networks for Large-Scale Author Name Disambiguation”, EPJ Data Science 3(1):11, 2014.
[11] Shaikh T, Deshpande D “A Review On Opinion Mining and Sentiment Analysis”, International Journal of Computer Application, 975:8887, 2016.
[12] Sharma S, Srivastava SK “Review on Text Mining Algorithms”, International Journal of Computer Applications, Vol.134, Issue.8, pp.39–43, 2016.
[13] Shen Q, Wu T, Yang H, Wu Y, Qu H, Cui W “Nameclarifier: A Visual Analytics System for Author Name Disambiguation”, IEEE Trans Vis Computer Graph, Vol.23, Issue.1, pp.141–150, 2016.
[14] Singh T “A Comprehensive Review of Text Mining”, International Journal of Computer Science and Information Technology, Vol.7, Issue.1, pp.167–169, 2016.
[15] Song M, Kim EHJ, Kim HJ “Exploring Author Name Disambiguation on Pubmed-Scale”, Journal of Informetric Vol.9, Issue.4, pp.924–941, 2015.
[16] Songa X, Mina YJ, Da-Xionga L, Fengb WZ, Shua C “Research on Text Error Detection and Repair Method Based on Online Learning Community”, Procedia Computer Science, 154: pp.13–19, 2019.

Authorization Required

Close(X)

You do not have rights to view the full text article.
Please contact administration for subscription to Journal or individual article.
Mail us at support@isroset.org or view contact page for more details.

Full Paper View Go Back

Main Menu

Journals Contents

Information

Download

Publication Certificate

Contact Us

Use full Link