Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | 노미나 | - |
dc.contributor.author | Jin Man HONG | - |
dc.date.accessioned | 2022-09-27T16:03:06Z | - |
dc.date.available | 2022-09-27T16:03:06Z | - |
dc.date.issued | 2022. 8 | - |
dc.identifier.uri | http://hanyang.dcollection.net/common/orgView/200000626802 | en_US |
dc.identifier.uri | https://repository.hanyang.ac.kr/handle/20.500.11754/174205 | - |
dc.description.abstract | The use of quantitative structure-activity relationship modeling is drawing attention to predicting the internal influence of chemical substances as an alternative to in vivo experiments. This method is mainly used in machine learning models by using three types of molecular structure information: the type of atom in a molecule, the coordinates of each atom, and chemical bonds. However, using all extracted features may reduce the performance due to interference of features. The disadvantage is that the model size increases, which needs more computational resources and running time. In this study, we used a method of selecting the top five feature groups to improve performance by reducing interference among features. The classification of ChemDes was used to group features. Tox21 data challenge 2014 and five DeepChem datasets were used to confirm the effectiveness of feature selection. CatBoost, Extra Trees, SVM, and Tabular RNN were selected as comparative models. The AUC-ROC score was higher when feature selection was applied in all four models. Based on the experimental results, we suggest that the classification performance can be improved by using the feature selection method. | - |
dc.publisher | 한양대학교 | - |
dc.title | Machine learning approaches applied to biological property prediction | - |
dc.type | Theses | - |
dc.contributor.googleauthor | 홍진만 | - |
dc.contributor.alternativeauthor | 홍진만 | - |
dc.sector.campus | S | - |
dc.sector.daehak | 대학원 | - |
dc.sector.department | 컴퓨터·소프트웨어학과 | - |
dc.description.degree | Master | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.