Repository at Hanyang University: Semantic Recognition of Human-Object Interactions via Gaussian-Based Elliptical Modeling and Pixel-Level Labeling

Browse

My Repository

Repository at Hanyang UniversityCOLLEGE OF COMPUTING[E](소프트웨어융합대학)MEDIA, CULTURE, AND DESIGN TECHNOLOGY(ICT융합학부)Articles

173 89

Full metadata record

DC Field	Value	Language
dc.contributor.author	김기범	-
dc.date.accessioned	2023-04-25T01:10:29Z	-
dc.date.available	2023-04-25T01:10:29Z	-
dc.date.issued	2021-07	-
dc.identifier.citation	IEEE ACCESS, v. 9.0, Page. 111249-111266	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	https://ieeexplore.ieee.org/document/9502603	en_US
dc.identifier.uri	https://repository.hanyang.ac.kr/handle/20.500.11754/179184	-
dc.description.abstract	Human-Object Interaction (HOI) recognition, due to its significance in many computer vision-based applications, requires in-depth and meaningful details from image sequences. Incorporating semantics in scene understanding has led to a deep understanding of human-centric actions. Therefore, in this research work, we propose a semantic HOI recognition system based on multi-vision sensors. In the proposed system, the de-noised RGB and depth images, via Bilateral Filtering (BLF), are segmented into multiple clusters using a Simple Linear Iterative Clustering (SLIC) algorithm. The skeleton is then extracted from segmented RGB and depth images via Euclidean Distance Transform (EDT). Human joints, extracted from the skeleton, provide the annotations for accurate pixel-level labeling. An elliptical human model is then generated via a Gaussian Mixture Model (GMM). A Conditional Random Field (CRF) model is trained to allocate a specific label to each pixel of different human body parts and an interaction object. Two semantic feature types that are extracted from each labeled body part of the human and labelled objects are: Fiducial points and 3D point cloud. Features descriptors are quantized using Fisher's Linear Discriminant Analysis (FLDA) and classified using K-ary Tree Hashing (KATH). In experimentation phase the recognition accuracy achieved with the Sports dataset is 92.88%, with the Sun Yat-Sen University (SYSU) 3D HOI dataset is 93.5% and with the Nanyang Technological University (NTU) RGB+D dataset it is 94.16%. The proposed system is validated via extensive experimentation and should be applicable to many computer-vision based applications such as healthcare monitoring, security systems and assisted living etc.	-
dc.description.sponsorship	This work was supported in part by the Basic Science Research Program through the National Research Foundation of Korea (NRF) under Grant 2018R1D1A1A02085645, in part by Korea Medical Device Development Fund Grant through Korean Government (the Ministry of Science and ICT; the Ministry of Trade, Industry and Energy; the Ministry of Health and Welfare; and the Ministry of Food and Drug Safety) under Grant 202012D05-02, and in part by Hanyang University under Grant 201800000000647.	-
dc.language	en	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.subject	Semantics	-
dc.subject	Image segmentation	-
dc.subject	Labeling	-
dc.subject	Feature extraction	-
dc.subject	Three-dimensional displays	-
dc.subject	Sensors	-
dc.subject	Biological system modeling	-
dc.subject	3D point cloud	-
dc.subject	fiducial points	-
dc.subject	human-object interaction	-
dc.subject	pixel labeling	-
dc.subject	semantic segmentation	-
dc.subject	super-pixels	-
dc.subject	K-ary tree hashing	-
dc.title	Semantic Recognition of Human-Object Interactions via Gaussian-Based Elliptical Modeling and Pixel-Level Labeling	-
dc.type	Article	-
dc.relation.volume	9.0	-
dc.identifier.doi	10.1109/ACCESS.2021.3101716	-
dc.relation.page	111249-111266	-
dc.relation.journal	IEEE ACCESS	-
dc.contributor.googleauthor	Khalid, Nida	-
dc.contributor.googleauthor	Ghadi, Yazeed Yasin	-
dc.contributor.googleauthor	Gochoo, Munkhjargal	-
dc.contributor.googleauthor	Jalal, Ahmad	-
dc.contributor.googleauthor	Kim, Kibum	-
dc.sector.campus	E	-
dc.sector.daehak	소프트웨어융합대학	-
dc.sector.department	ICT융합학부	-
dc.identifier.pid	kibum	-

Appears in Collections:: COLLEGE OF COMPUTING[E](소프트웨어융합대학) > MEDIA, CULTURE, AND DESIGN TECHNOLOGY(ICT융합학부) > Articles

Files in This Item:: 81672_김기범.pdf Download

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository

Browse

My Repository

BROWSE