411 0

Full metadata record

DC FieldValueLanguage
dc.contributor.author김영훈-
dc.date.accessioned2018-06-12T00:53:17Z-
dc.date.available2018-06-12T00:53:17Z-
dc.date.issued2017-04-
dc.identifier.citationINFORMATION SCIENCES, v. 385, Page. 438-456en_US
dc.identifier.issn0020-0255-
dc.identifier.issn1872-6291-
dc.identifier.urihttps://www.sciencedirect.com/science/article/pii/S002002551730018X-
dc.identifier.urihttps://repository.hanyang.ac.kr/handle/20.500.11754/72026-
dc.description.abstractData integration is the process of identifying pairs of records from different databases that refer to the same entity in the real world. It has been extensively studied with regard to entity resolution, record linkage, duplicate detection or network alignment. With the increasing use of crowdsourcing platforms as a means of assessing queries manually at low cost, many studies have begun to consider ways to exploit crowdsourcing systems for efficient data integration. In this paper, we present an efficient algorithm to integrate two graphs collected from different sources using crowdsourcing systems. Given two graphs, we repeatedly select a query node from a graph and request a human annotator to find its matching node from the other graph, which is considered to be the one indicating the same entity as the query node. The proposed method is to choose the query nodes that would increase the precision the most if it is labeled. By experiments with both the simulated answers and the labels collected by real crowdsourcing, we show that our algorithm finds more accurate graph matches with a smaller cost for crowdsourcing than the baseline algorithms. (C) 2017 Elsevier Inc. All rights reserved.en_US
dc.description.sponsorshipThis research was supported by Next-Generation Information Computing Development Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (2012M3C4 A7033342). This research was supported by the MSIP (Ministry of Science, ICT and Future Planning), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2016-H8501-16-1013) supervised by the IITP (Institute for Information & communication Technology Promotion). This work was also supported by Korea Institute of Energy Technology Evaluation and Planning (KETEP) granted financial resource from the Ministry of Trade, Industry & Energy, Republic of Korea (No. 20153010011980).en_US
dc.language.isoen_USen_US
dc.publisherELSEVIER SCIENCE INCen_US
dc.subjectGraph integrationen_US
dc.subjectCrowdsourcingen_US
dc.subjectEntity resolutionen_US
dc.subjectENTITY RESOLUTIONen_US
dc.subjectALGORITHMSen_US
dc.subjectDISTANCEen_US
dc.titleIntegration of graphs from different data sources using crowdsourcingen_US
dc.typeArticleen_US
dc.relation.volume385-
dc.identifier.doi10.1016/j.ins.2017.01.006-
dc.relation.page438-456-
dc.relation.journalINFORMATION SCIENCES-
dc.contributor.googleauthorKim, Younghoon-
dc.contributor.googleauthorJung, Woohwan-
dc.contributor.googleauthorShim, Kyuseok-
dc.relation.code2017002699-
dc.sector.campusE-
dc.sector.daehakCOLLEGE OF COMPUTING[E]-
dc.sector.departmentDIVISION OF COMPUTER SCIENCE-
dc.identifier.pidnongaussian-
Appears in Collections:
COLLEGE OF COMPUTING[E](소프트웨어융합대학) > COMPUTER SCIENCE(소프트웨어학부) > Articles
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE