195 0

Wrapper generation by using XML-based domain knowledge for intelligent information extraction

Title
Wrapper generation by using XML-based domain knowledge for intelligent information extraction
Author
도경구
Issue Date
2002-08
Publisher
SPRINGER-VERLAG BERLIN
Citation
PRICAI 2002: Trends in Artificial Intelligence. PRICAI 2002. Lecture Notes in Computer Science, v. 2417, page. 472-481
Abstract
This paper discusses some of the issues in Web information extraction, focusing on automatic extraction methods that exploit wrapper induction. In particular, we point out the limitations of traditional heuristic-based wrapper generation systems, and as a solution, emphasize the importance of the domain knowledge in the process of wrapper generation. We demonstrate the effectiveness of domain knowledge by presenting our scheme of knowledge-based wrapper generation for semi-structured and labeled documents. Our agent-oriented information extraction system, XTROS, represents both the domain knowledge and the wrappers by XML documents to increase modularity, flexibility, and interoperability. XTROS shows good performance on several Web sites in the domain of real estate, and it is expected to be easily adaptable to different domains by plugging in appropriate XML-based domain knowledge.
URI
https://link.springer.com/chapter/10.1007/3-540-45683-X_51https://repository.hanyang.ac.kr/handle/20.500.11754/157476
ISBN
978-3-540-44038-3; 978-3-540-45683-4
DOI
10.1007/3-540-45683-X_51
Appears in Collections:
ETC[S] > 연구정보
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE