Computer Science & Electrical

Computer Science & Electrical

Archive
Join as an Editor/Reviewer

Enhancement of TextRank Algorithm using Coreference Resolution

Volume: 101  ,  Issue: 1 , May    Published Date: 22 May 2022
Publisher Name: IJRP
Views: 413  ,  Download: 351 , Pages: 114 - 123    
DOI: 10.47119/IJRP1001011520223190

Authors

# Author Name
1 Karla Jane C. Patosa
2 Marion James M. Hernandez
3 Vivien A. Agustin
4 Richard Regala
5 Khatalyn E. Mata
6 Dan Michael A. Cortez
7 Leisyl Mahusay
8 Andrew Bitancor
9 Perferinda Caubang

Abstract

TextRank Algorithm is an unsupervised graph-based algorithm by Mihalcea with two primary applications, namely in text summarization and keyword extraction of a text document. This study will focus on the enhancement of the TextRank algorithm on the keyword extraction side. This paper introduces an enhanced version of the algorithm wherein the whole text document is preprocessed with a method known as Coreference Resolution, wherein this method normalizes every referenced entity in a text into a single entity. The application of this method to a document text with a longer sequence, outperforms the Precision, Recall, F1-Measure, and the Mean Average Precision metrics of the original algorithm.

Keywords

  • Natural Language Processing
  • Keyword Extraction
  • TextRank Algorithm
  • Coreference Resolution