330 0

A Block-based Graph Model for Effective Wrapper Maintenance

Title
A Block-based Graph Model for Effective Wrapper Maintenance
Author
최중민
Keywords
Information Extraction; Wrapper Maintenance; Graph Model
Issue Date
2009-07
Publisher
대한전자공학회
Citation
2009 ITC-CSCC :International Technical Conference on Circuits Systems, Computers and Communications, Page. 886-888
Abstract
The wrapper is a special program which can automatically extract the information from the web pages which has a structured form. Most of the web pages are changed frequently and grow constantly. The structure of the web page is also changed frequently. The wrapper which is created before may not extract the information accurately from the changed web pages. In case that the structure of a web page is changed, even though wrapping operations are performed on the same information, the wrapper may not extract information practically. To solve this problem, we propose the novel method that segments a web page into visual block units and compose them as graph model. As a result, the graph model would be efficient wrapper management. The existing wrapper can extract the target information through the comparison of the graph model with the new one even if the structure of web page is changed.
URI
http://www.dbpia.co.kr/Journal/ArticleDetail/NODE01590708https://repository.hanyang.ac.kr/handle/20.500.11754/104057
Appears in Collections:
COLLEGE OF ENGINEERING SCIENCES[E](공학대학) > COMPUTER SCIENCE AND ENGINEERING(컴퓨터공학과) > Articles
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE