Repository at Hanyang University: Tracking Persons-of-Interest via Adaptive Discriminative Features

Browse

My Repository

Repository at Hanyang UniversityCOLLEGE OF ENGINEERING[S](공과대학)COMPUTER SCIENCE(컴퓨터소프트웨어학부)Articles

Full metadata record

DC Field	Value	Language
dc.contributor.author	임종우	-
dc.date.accessioned	2018-11-05T02:09:31Z	-
dc.date.available	2018-11-05T02:09:31Z	-
dc.date.issued	2016-09	-
dc.identifier.citation	Computer Vision - 14th European Conference, ECCV 2016, Proceedings. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2016), v. 9909, Page. 415-433	en_US
dc.identifier.isbn	978-3-31-946453-4	-
dc.identifier.isbn	978-3-319-46454-1	-
dc.identifier.issn	1611-3349	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	https://link.springer.com/chapter/10.1007%2F978-3-319-46454-1_26	-
dc.identifier.uri	https://repository.hanyang.ac.kr/handle/20.500.11754/79117	-
dc.description.abstract	Multi-face tracking in unconstrained videos is a challenging problem as faces of one person often appear drastically different in multiple shots due to significant variations in scale, pose, expression, illumination, and make-up. Low-level features used in existing multi-target tracking methods are not effective for identifying faces with such large appearance variations. In this paper, we tackle this problem by learning discriminative, video-specific face features using convolutional neural networks (CNNs). Unlike existing CNN-based approaches that are only trained on large-scale face image datasets offline, we further adapt the pre-trained face CNN to specific videos using automatically discovered training samples from tracklets. Our network directly optimizes the embedding space so that the Euclidean distances correspond to a measure of semantic face similarity. This is technically realized by minimizing an improved triplet loss function. With the learned discriminative features, we apply the Hungarian algorithm to link tracklets within each shot and the hierarchical clustering algorithm to link tracklets across multiple shots to form final trajectories. We extensively evaluate the proposed algorithm on a set of TV sitcoms and music videos and demonstrate significant performance improvement over existing techniques.	en_US
dc.description.sponsorship	The work is partially supported by National Basic Research Program of China (973 Program, 2015CB351705), NSFC (61332018), Office of Naval Research (N0014-16-1-2314), R&D programs by NRF (2014R1A1A2058501) and MSIP/IITP (IITP-2016-H8601-16-1005) of Korea, NSF CAREER (1149783) and gifts from Adobe and NVIDIA.	en_US
dc.language.iso	en	en_US
dc.publisher	ECCV/Springer	en_US
dc.subject	Convolutional Neural Network	en_US
dc.subject	Music Video	en_US
dc.subject	Multiple Shot	en_US
dc.subject	Visual Constraint	en_US
dc.subject	Data Association Problem	en_US
dc.title	Tracking Persons-of-Interest via Adaptive Discriminative Features	en_US
dc.type	Article	en_US
dc.identifier.doi	https://doi.org/10.1007/978-3-319-46454-1_26	-
dc.relation.page	415-433	-
dc.contributor.googleauthor	Zhang, Shun	-
dc.contributor.googleauthor	Gong, Yihong	-
dc.contributor.googleauthor	Huang, Jia-Bin	-
dc.contributor.googleauthor	Lim, Jongwoo	-
dc.contributor.googleauthor	Wang, Jinjun	-
dc.contributor.googleauthor	Ahuja, Narendra	-
dc.contributor.googleauthor	Yang, Ming-Hsuan	-
dc.relation.code	20160042	-
dc.sector.campus	S	-
dc.sector.daehak	COLLEGE OF ENGINEERING[S]	-
dc.sector.department	DEPARTMENT OF COMPUTER SCIENCE	-
dc.identifier.pid	jlim	-

Appears in Collections:: COLLEGE OF ENGINEERING[S](공과대학) > COMPUTER SCIENCE(컴퓨터소프트웨어학부) > Articles

Files in This Item:

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository

Browse

My Repository

BROWSE