Repository at Hanyang University: ScarfNet: Multi-scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection

Browse

My Repository

Repository at Hanyang UniversityCOLLEGE OF ENGINEERING[S](공과대학)ELECTRICAL AND BIOMEDICAL ENGINEERING(전기·생체공학부)Articles

161 0

Full metadata record

DC Field	Value	Language
dc.contributor.author	최준원	-
dc.date.accessioned	2022-10-17T05:48:28Z	-
dc.date.available	2022-10-17T05:48:28Z	-
dc.date.issued	2021-01	-
dc.identifier.citation	25th International Conference on Pattern Recognition (ICPR), page. 4505-4512	en_US
dc.identifier.issn	1051-4651	en_US
dc.identifier.uri	https://ieeexplore.ieee.org/document/9412795	en_US
dc.identifier.uri	https://repository.hanyang.ac.kr/handle/20.500.11754/175472	-
dc.description.abstract	Convolutional neural networks (CNNs) have led us to achieve significant progress in object detection research. To detect objects of various sizes, object detectors often exploit the hierarchy of the multiscale feature maps called feature pyramids, which are readily obtained by the CNN architecture. However, the performance of these object detectors is limited because the bottom-level feature maps, which experience fewer convolutional layers, lack the semantic information needed to capture the characteristics of the small objects. To address such problems, various methods have been proposed to increase the depth for the bottom-level features used for object detection. While most approaches are based on the generation of additional features through the top-down pathway with lateral connections, our approach directly fuses multi-scale feature maps using bidirectional long short-term memory (biLSTM) in an effort to leverage the gating functions and parameter-sharing in generating deeply fused semantics. The resulting semantic information is redistributed to the individual pyramidal feature at each scale through the channel-wise attention model. We integrate our semantic combining and attentive redistribution feature network (ScarfNet) with the baseline object detectors, i.e., Faster R-CNN, single-shot multibox detector (SSD), and RetinaNet. Experimental results show that our method offers a significant performance gain over the baseline detectors and outperforms the competing multiscale fusion methods in the PASCAL VOC and COCO detection benchmarks.	en_US
dc.description.sponsorship	This work was supported in part by the Institute of Information & Communications Technology Planning & Evaluation (IITP) grant funded by the Korea government (MSIT) (No. 2020-0-01373, Artificial Intelligence Graduate School Program (Hanyang University)) and the National Research Foundation of Korea (NRF) grant funded by the Korea government(MSIT) (No. 2020R1A2C2012146).	en_US
dc.language.iso	en	en_US
dc.publisher	IEEE	en_US
dc.title	ScarfNet: Multi-scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1109/ICPR48806.2021.9412795	en_US
dc.relation.page	4505-4512	-
dc.contributor.googleauthor	Yoo, Jin Hyeok	-
dc.contributor.googleauthor	Kum, Dongsuk	-
dc.contributor.googleauthor	Choi, Jun Won	-
dc.relation.code	20210142	-
dc.sector.campus	S	-
dc.sector.daehak	COLLEGE OF ENGINEERING[S]	-
dc.sector.department	SCHOOL OF ELECTRICAL AND BIOMEDICAL ENGINEERING	-
dc.identifier.pid	junwchoi	-

Appears in Collections:: COLLEGE OF ENGINEERING[S](공과대학) > ELECTRICAL AND BIOMEDICAL ENGINEERING(전기·생체공학부) > Articles

Files in This Item:

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository

Browse

My Repository

BROWSE