502 0

Full metadata record

DC FieldValueLanguage
dc.contributor.author권보경-
dc.date.accessioned2022-05-23T06:51:38Z-
dc.date.available2022-05-23T06:51:38Z-
dc.date.issued2022-02-
dc.identifier.citationCOMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION; FEB 3 2022, 15p.en_US
dc.identifier.issn03610918-
dc.identifier.issn15324141-
dc.identifier.urihttps://www.tandfonline.com/doi/full/10.1080/03610918.2022.2037636-
dc.identifier.urihttps://repository.hanyang.ac.kr/handle/20.500.11754/171066-
dc.description.abstractModern computing and communication technologies can make data collection procedures very efficient, while our ability to analyze and extract information from large data sets is hard-pressed to keep up with our capacity for data collection. If we are interested in learning a classification/prediction rule that was not considered in the original data collection procedure, then there is usually a lack of label information in the original data. Thus, how to start with minimum number of labeled data and aggressively selected most informative samples for being labeled becomes an important issue. The thoughts of active learning, with subjects to be selected sequentially without using label information, is a possible outlet for this situation. In addition, if we can identify variables, from the lengthy variable list of the data set, for constructing an effective classification rule with good interpretation ability will be better. Here, we propose an active learning procedure targeted the maximum area under the receiver operating characteristic curve via a commonly use logistic model, which can sequentially select the effective informative subjects and variables, simultaneously. We discuss the asymptotic properties of the proposed procedure and illustrate it with some synthesized and real data sets.en_US
dc.description.sponsorshipA part of this research is supported by the National Natural Science Foundation of China (No. 11971457), the Anhui Provincial Natural Science Foundation (No. 1908085MA06), Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (No. 2021-0-01343, Artificial Intelligence Convergence Research Center (Hanyang University ERICA), 2021), and the Ministry of Science and Technology, Taiwan (MOST 108-2118-M-001-004-MY3). A part of this research is supported by the National Natural Science Foundation of China (No. 11971457), the Anhui Provincial Natural Science Foundation (No. 1908085MA06), Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (No. 2021-0-01343, Artificial Intelligence Convergence Research Center (Hanyang University ERICA), 2021), and the Ministry of Science and Technology, Taiwan (MOST 108-2118-M-001-004-MY3).en_US
dc.language.isoenen_US
dc.publisherTAYLOR & FRANCIS INCen_US
dc.subjectActive learningen_US
dc.subjectArea under ROC curveen_US
dc.subjectClassificationen_US
dc.subjectLocal optimal designen_US
dc.subjectShrinkage estimateen_US
dc.subjectStopping timeen_US
dc.titleActive learning with logistic models featuring simultaneous variable and subject selectionen_US
dc.typeArticleen_US
dc.identifier.doi10.1080/03610918.2022.2037636-
dc.relation.page1-15-
dc.relation.journalCOMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION-
dc.contributor.googleauthorWang, Zhanfeng-
dc.contributor.googleauthorKwon, Amy M.-
dc.contributor.googleauthorChang, Yuan-chin Ivan-
dc.relation.code2022039288-
dc.sector.campusE-
dc.sector.daehak[E]-
dc.identifier.pidamykwon-
Appears in Collections:
COLLEGE OF SPORTS AND ARTS[E](예체능대학) > ETC
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE