142 0

Machine learning approaches applied to biological property prediction

Title
Machine learning approaches applied to biological property prediction
Author
Jin Man HONG
Alternative Author(s)
홍진만
Advisor(s)
노미나
Issue Date
2022. 8
Publisher
한양대학교
Degree
Master
Abstract
The use of quantitative structure-activity relationship modeling is drawing attention to predicting the internal influence of chemical substances as an alternative to in vivo experiments. This method is mainly used in machine learning models by using three types of molecular structure information: the type of atom in a molecule, the coordinates of each atom, and chemical bonds. However, using all extracted features may reduce the performance due to interference of features. The disadvantage is that the model size increases, which needs more computational resources and running time. In this study, we used a method of selecting the top five feature groups to improve performance by reducing interference among features. The classification of ChemDes was used to group features. Tox21 data challenge 2014 and five DeepChem datasets were used to confirm the effectiveness of feature selection. CatBoost, Extra Trees, SVM, and Tabular RNN were selected as comparative models. The AUC-ROC score was higher when feature selection was applied in all four models. Based on the experimental results, we suggest that the classification performance can be improved by using the feature selection method.
URI
http://hanyang.dcollection.net/common/orgView/200000626802https://repository.hanyang.ac.kr/handle/20.500.11754/174205
Appears in Collections:
GRADUATE SCHOOL[S](대학원) > COMPUTER SCIENCE(컴퓨터·소프트웨어학과) > Theses (Master)
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE