193 0

FoDoSu: Multi-document summarization exploiting semantic analysis based on social Folksonomy

Title
FoDoSu: Multi-document summarization exploiting semantic analysis based on social Folksonomy
Author
이동호
Keywords
Multi-document summarization; Folksonomy; Tag cluster; Semantic analysis
Issue Date
2015-01
Publisher
ELSEVIER SCI LTD
Citation
INFORMATION PROCESSING & MANAGEMENT, v. 51, Page. 212-225
Abstract
Multi-document summarization techniques aim to reduce documents into a small set of words or paragraphs that convey the main meaning of the original document. Many approaches to multi-document summarization have used probability-based methods and machine learning techniques to simultaneously summarize multiple documents sharing a common topic. However, these techniques fail to semantically analyze proper nouns and newly-coined words because most depend on an out-of-date dictionary or thesaurus. To overcome these drawbacks, we propose a novel multi-document summarization system called FoDoSu, or Folksonomy-based Multi-Document Summarization, that employs the tag clusters used by Flickr, a Folksonomy system, for detecting key sentences from multiple documents. We first Create a word frequency table for analyzing the semantics and contributions of words using the HITS algorithm. Then, by exploiting tag clusters, we analyze the semantic relationships between words in the word frequency table. Finally, we create a summary of multiple documents by analyzing the importance of each word and its semantic relatedness to others. Experimental results from the TAC 2008 and 2009 data sets demonstrate the improvement of our proposed framework over existing summarization systems. (C) 2014 Elsevier Ltd. All rights reserved.
URI
http://hdl.handle.net/20.500.11754/29145
ISSN
0306-4573; 1873-5371
DOI
10.1016/j.ipm.2014.06.003
Appears in Collections:
COLLEGE OF COMPUTING[E] > MEDIA, CULTURE, AND DESIGN TECHNOLOGY(ICT융합학부) > Articles
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE