265 0

클래스 불균형 문제가 있는 다중클래스 텍스트 분류에서의 특징 선택 방법

Title
클래스 불균형 문제가 있는 다중클래스 텍스트 분류에서의 특징 선택 방법
Other Titles
Feature Selection Method from Multiclass Text with Class Imbalance Problem
Author
허선
Keywords
Text Classification; Class Imbalance; Multi-Class Text Data; Feature Selection
Issue Date
2019-04
Publisher
대한산업공학회
Citation
대한산업공학회지, v. 45, No. 2, Page. 93-100
Abstract
A text classification model in which one of the class variables is biased to the majority class typically classifiesmost documents into the majority class to enhance the overall classification accuracy. It is called a classimbalance problem. This study proposes a feature selection method based on simplified chi-square statistics toselect features in each class for developing a robust model to the problem. Proposed method and typical featureselection methods are compared by Reuter21578 data. Experiment shows that the proposed method is superior totypical feature selection methods in terms of naïve Bayes and support vector machine which are robust to theclass imbalance problem.
URI
http://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE08000390&language=ko_KRhttps://repository.hanyang.ac.kr/handle/20.500.11754/112975
ISSN
1225-0988
DOI
10.7232/JKIIE.2019.45.2.093
Appears in Collections:
COLLEGE OF ENGINEERING SCIENCES[E](공학대학) > INDUSTRIAL AND MANAGEMENT ENGINEERING(산업경영공학과) > Articles
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE