FoDoSu: Multi-document summarization exploiting semantic analysis based on social Folksonomy
- Title
- FoDoSu: Multi-document summarization exploiting semantic analysis based on social Folksonomy
- Author
- 이동호
- Keywords
- Multi-document summarization; Folksonomy; Tag cluster; Semantic analysis
- Issue Date
- 2015-01
- Publisher
- ELSEVIER SCI LTD
- Citation
- INFORMATION PROCESSING & MANAGEMENT, v. 51, Page. 212-225
- Abstract
- Multi-document summarization techniques aim to reduce documents into a small set of words or paragraphs that convey the main meaning of the original document. Many approaches to multi-document summarization have used probability-based methods and machine learning techniques to simultaneously summarize multiple documents sharing a common topic. However, these techniques fail to semantically analyze proper nouns and newly-coined words because most depend on an out-of-date dictionary or thesaurus. To overcome these drawbacks, we propose a novel multi-document summarization system called FoDoSu, or Folksonomy-based Multi-Document Summarization, that employs the tag clusters used by Flickr, a Folksonomy system, for detecting key sentences from multiple documents. We first Create a word frequency table for analyzing the semantics and contributions of words using the HITS algorithm. Then, by exploiting tag clusters, we analyze the semantic relationships between words in the word frequency table. Finally, we create a summary of multiple documents by analyzing the importance of each word and its semantic relatedness to others. Experimental results from the TAC 2008 and 2009 data sets demonstrate the improvement of our proposed framework over existing summarization systems. (C) 2014 Elsevier Ltd. All rights reserved.
- URI
- http://hdl.handle.net/20.500.11754/29145https://www.sciencedirect.com/science/article/pii/S0306457314000508?via%3Dihub
- ISSN
- 0306-4573; 1873-5371
- DOI
- 10.1016/j.ipm.2014.06.003
- Appears in Collections:
- COLLEGE OF COMPUTING[E](소프트웨어융합대학) > MEDIA, CULTURE, AND DESIGN TECHNOLOGY(ICT융합학부) > Articles
- Files in This Item:
There are no files associated with this item.
- Export
- RIS (EndNote)
- XLS (Excel)
- XML