420 0

Full metadata record

DC FieldValueLanguage
dc.contributor.author남진우-
dc.date.accessioned2021-05-14T02:06:44Z-
dc.date.available2021-05-14T02:06:44Z-
dc.date.issued2020-03-
dc.identifier.citationCOMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, v. 18, page. 814-820en_US
dc.identifier.issn2001-0370-
dc.identifier.urihttps://www.sciencedirect.com/science/article/pii/S2001037019304039?via%3Dihub-
dc.identifier.urihttps://repository.hanyang.ac.kr/handle/20.500.11754/162038-
dc.description.abstractThe Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-Cas systems, including dead Cas9 (dCas9), Cas9, and Cas12a, have revolutionized genome engineering in mammalian somatic cells. Although computational tools that assess the target sites of CRISPR-Cas systems are inevitably important for designing efficient guide RNAs (gRNAs), they exhibit generalization issues in selecting features and do not provide optimal results in a comprehensive manner. Here, we introduce a Comprehensive Guide Designer (CGD) for four different CRISPR systems, which utilizes the machine learning algorithm, Elastic Net Logistic Regression (ENLOR), to autonomously generalize the models. CGD contains specific models trained with public datasets generated by CRISPRi, CRISPRa, CRISPR-Cas9, and CRISPR-Cas12a (designated as CGDi, CGDa, CGD9, and CGD12a, respectively) in an unbiased manner. The trained CGD models were benchmarked to other regression-based machine learning models, such as ElasticNet Linear Regression (ENLR), Random Forest and Boruta (RFB), and Extreme Gradient Boosting (Xgboost) with inbuilt feature selection. Evaluation with independent test datasets showed that CGD models outperformed the pre-existing methods in predicting the efficacy of gRNAs. All CGD source codes and datasets are available at GitHub (https://gitub.com/vipinmenon1989/CGD), and the CGD webserver can be accessed at http://big.hanyang.ac.kr:2195/CGD.en_US
dc.description.sponsorshipThis work was supported by the Bio and Medical Technology Development Program and the Basic Science Research Program through the National Research Foundation (NRF), funded by the Ministry of Science and ICT, South Korea (grant numbers 2014M3C9A3063541 and 2018R1A2B2003782).en_US
dc.language.isoenen_US
dc.publisherELSEVIERen_US
dc.subjectCRISPR systemen_US
dc.subjectCas9en_US
dc.subjectCas12aen_US
dc.subjectdCas9en_US
dc.subjectgRNA designen_US
dc.subjectMachine learningen_US
dc.subjectLogistic regressionen_US
dc.titleCGD: Comprehensive guide designer for CRISPR-Cas systemsen_US
dc.typeArticleen_US
dc.relation.volume18-
dc.identifier.doi10.1016/j.csbj.2020.03.020-
dc.relation.page814-820-
dc.relation.journalCOMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL-
dc.contributor.googleauthorMenon, A Vipin-
dc.contributor.googleauthorSohn, Jang-il-
dc.contributor.googleauthorNam, Jin-Wu-
dc.relation.code2020049734-
dc.sector.campusS-
dc.sector.daehakCOLLEGE OF NATURAL SCIENCES[S]-
dc.sector.departmentDEPARTMENT OF LIFE SCIENCE-
dc.identifier.pidjwnam-
Appears in Collections:
COLLEGE OF NATURAL SCIENCES[S](자연과학대학) > LIFE SCIENCE(생명과학과) > Articles
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE