Wrapper Generation by Using XML-Based Domain Knowledge for Intelligent Information Extraction
- Title
- Wrapper Generation by Using XML-Based Domain Knowledge for Intelligent Information Extraction
- Author
- 최중민
- Issue Date
- 2002-08
- Publisher
- SPRINGER-VERLAG BERLIN
- Citation
- PRICAI 2002: Trends in Artificial Intelligence. PRICAI 2002. Lecture Notes in Computer Science, v. 2417, page. 472-481
- Abstract
- This paper discusses some of the issues in Web information extraction, focusing on automatic extraction methods that exploit wrapper induction. In particular, we point out the limitations of traditional heuristic-based wrapper generation systems, and as a solution, emphasize the importance of the domain knowledge in the process of wrapper generation.
We demonstrate the effectiveness of domain knowledge by presenting our scheme of knowledge-based wrapper generation for semi-structured and labeled documents. Our agent-oriented information extraction system, XTROS, represents both the domain knowledge and the wrappers by XML documents to increase modularity, flexibility, and interoperability. XTROS shows good performance on several Web sites in the domain of real estate, and it is expected to be easily adaptable to different domains by plugging in appropriate XML-based domain knowledge.
- URI
- https://link.springer.com/chapter/10.1007/3-540-45683-X_51https://repository.hanyang.ac.kr/handle/20.500.11754/157478
- ISBN
- 978-3-540-44038-3; 978-3-540-45683-4
- DOI
- 10.1007/3-540-45683-X_51
- Appears in Collections:
- COLLEGE OF ENGINEERING SCIENCES[E](공학대학) > COMPUTER SCIENCE AND ENGINEERING(컴퓨터공학과) > Articles
- Files in This Item:
There are no files associated with this item.
- Export
- RIS (EndNote)
- XLS (Excel)
- XML