Repository at Hanyang University: Maximum entropy scaled super pixels segmentation for multi-object detection and scene recognition via deep belief network

Browse

My Repository

Repository at Hanyang UniversityCOLLEGE OF COMPUTING[E](소프트웨어융합대학)MEDIA, CULTURE, AND DESIGN TECHNOLOGY(ICT융합학부)Articles

Full metadata record

DC Field	Value	Language
dc.contributor.author	김기범	-
dc.date.accessioned	2024-04-25T00:47:37Z	-
dc.date.available	2024-04-25T00:47:37Z	-
dc.date.issued	2022-09-20	-
dc.identifier.citation	MULTIMEDIA TOOLS AND APPLICATIONS, v. 82, NO 9, Page. 13401-13430	en_US
dc.identifier.issn	1380-7501	en_US
dc.identifier.issn	1573-7721	en_US
dc.identifier.uri	https://information.hanyang.ac.kr/#/eds/detail?an=edssjs.86A3EB14&dbId=edssjs	en_US
dc.identifier.uri	https://repository.hanyang.ac.kr/handle/20.500.11754/190002	-
dc.description.abstract	Recent advances in visionary technologies impacted multi-object recognition and scene understanding. Such scene-understanding tasks are a demanding part of several technologies such as augmented reality based scene integration, robotic navigation, autonomous driving and tourist guide applications. Incorporating visual information in contextually unified segments, super-pixel-based approaches significantly mitigate the clutter, which is normal in pixel wise frameworks during scene understanding. Super-pixels allow customized shapes and variable size patches of connected components to be obtained. Furthermore, the computational time for these segmentation approaches can significantly decreased due to the reduced number of super-pixel target clusters. Hence, the super pixel-based approaches are more commonly used in robotics, computer vision and other intelligent systems. In this paper, we propose a Maximum Entropy scaled Super-Pixels (MEsSP) Segmentation method that encapsulates super-pixel segmentation based on an Entropy Model and utilizes local energy terms to label the pixels. Initially, after acquisition and pre-processing, image is segmented by two different methods: Fuzzy C-Means (FCM) and MEsSP. Then, to extract the features from these segmented objects, the dynamic geometrical features, fast Fourier transform (FFT), blob extraction, Maximally Stable Extremal Regions (MSER) and KAZE features are extracted using the bag of features approach. Then, to categorize the objects, multiple kernel learning is applied. Finally, a deep belief network (DBN) assigns the relevant labels to the scenes based on the categorized objects, intersection over union scores and dice similarity coefficient. The experimental results regarding multiple objects recognition accuracy, precision, recall and F1 scores over PASCAL VOC, Caltech 101 and UIUC Sports datasets show a remarkable performance. In addition, the evaluation of proposed scene recognition method over these benchmark datasets outperforms the state of the art (SOTA) methods.	en_US
dc.description.sponsorship	This research was supported by the Ministry of Culture, Sports and Tourism and Korea Creative Content Agency (Project Number: R2021040093).	en_US
dc.language	en_US	en_US
dc.publisher	SPRINGER	en_US
dc.relation.ispartofseries	v. 82, NO 9;13401-13430	-
dc.subject	Bag of features	en_US
dc.subject	Deep belief network	en_US
dc.subject	Entropy-scaled segmentation	en_US
dc.subject	Super-pixels	en_US
dc.title	Maximum entropy scaled super pixels segmentation for multi-object detection and scene recognition via deep belief network	en_US
dc.type	Article	en_US
dc.relation.no	9	-
dc.relation.volume	82	-
dc.identifier.doi	10.1007/s11042-022-13717-y	en_US
dc.relation.page	13401-13430	-
dc.relation.journal	MULTIMEDIA TOOLS AND APPLICATIONS	-
dc.contributor.googleauthor	Rafique, Adnan Ahmed	-
dc.contributor.googleauthor	Gochoo, Munkhjargal	-
dc.contributor.googleauthor	Jalal, Ahmad	-
dc.contributor.googleauthor	Kim, Kibum	-
dc.relation.code	2023037732	-
dc.sector.campus	E	-
dc.sector.daehak	COLLEGE OF COMPUTING[E]	-
dc.sector.department	SCHOOL OF MEDIA, CULTURE, AND DESIGN TECHNOLOGY	-
dc.identifier.pid	kibum	-

Appears in Collections:: COLLEGE OF COMPUTING[E](소프트웨어융합대학) > MEDIA, CULTURE, AND DESIGN TECHNOLOGY(ICT융합학부) > Articles

Files in This Item:

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository

Browse

My Repository

BROWSE