Repository at Hanyang University: Robust Deep Multi-modal Learning Based on Gated Information Fusion Network

Browse

My Repository

Repository at Hanyang UniversityCOLLEGE OF ENGINEERING[S](공과대학)ELECTRICAL AND BIOMEDICAL ENGINEERING(전기·생체공학부)Articles

288 0

Full metadata record

DC Field	Value	Language
dc.contributor.author	최준원	-
dc.date.accessioned	2019-10-18T01:45:58Z	-
dc.date.available	2019-10-18T01:45:58Z	-
dc.date.issued	2019-05	-
dc.identifier.citation	14th Asian Conference on Computer Vision , Page. 90-106	en_US
dc.identifier.isbn	978-303020869-1	-
dc.identifier.isbn	978-3-030-20870-7	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	https://link.springer.com/chapter/10.1007%2F978-3-030-20870-7_6	-
dc.identifier.uri	https://repository.hanyang.ac.kr/handle/20.500.11754/111218	-
dc.description.abstract	The goal of multi-modal learning is to use complementary information on the relevant task provided by the multiple modalities to achieve reliable and robust performance. Recently, deep learning has led significant improvement in multi-modal learning by allowing for fusing high level features obtained at intermediate layers of the deep neural network. This paper addresses a problem of designing robust deep multi-modal learning architecture in the presence of the modalities degraded in quality. We introduce deep fusion architecture for object detection which processes each modality using the separate convolutional neural network (CNN) and constructs the joint feature maps by combining the intermediate features obtained by the CNNs. In order to facilitate the robustness to the degraded modalities, we employ the gated information fusion (GIF) network which weights the contribution from each modality according to the input feature maps to be fused. The combining weights are determined by applying the convolutional layers followed by the sigmoid function to the concatenated intermediate feature maps. The whole network including the CNN backbone and GIF is trained in an end-to-end fashion. Our experiments show that the proposed GIF network offers the additional architectural flexibility to achieve the robust performance in handling some degraded modalities.	en_US
dc.description.sponsorship	This work was supported by Institute for Information & communications Technology Promotion (IITP) grant funded by the Korea government(MSIT) (2016-0-00564, Development of Intelligent Interaction Technology Based on Context Awareness and Human Intention Understanding).	en_US
dc.language.iso	en	en_US
dc.publisher	Springer	en_US
dc.subject	Object detection	en_US
dc.subject	Multi-modal fusion	en_US
dc.subject	Sensor fusion	en_US
dc.subject	Gated information fusion	en_US
dc.title	Robust Deep Multi-modal Learning Based on Gated Information Fusion Network	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1007/978-3-030-20870-7_6	-
dc.relation.page	90-106	-
dc.contributor.googleauthor	Kim, Jaekyum	-
dc.contributor.googleauthor	Koh, Junho	-
dc.contributor.googleauthor	Kim, Yecheol	-
dc.contributor.googleauthor	Choi, Jaehyung	-
dc.contributor.googleauthor	Hwang, Youngbae	-
dc.contributor.googleauthor	Choi, Jun Won	-
dc.relation.code	20190171	-
dc.sector.campus	S	-
dc.sector.daehak	COLLEGE OF ENGINEERING[S]	-
dc.sector.department	DIVISION OF ELECTRICAL AND BIOMEDICAL ENGINEERING	-
dc.identifier.pid	junwchoi	-

Appears in Collections:: COLLEGE OF ENGINEERING[S](공과대학) > ELECTRICAL AND BIOMEDICAL ENGINEERING(전기·생체공학부) > Articles

Files in This Item:

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository

Browse

My Repository

BROWSE