331 226

Full metadata record

DC FieldValueLanguage
dc.contributor.author남진우-
dc.date.accessioned2019-11-24T16:28:36Z-
dc.date.available2019-11-24T16:28:36Z-
dc.date.issued2017-04-
dc.identifier.citationGENOME RESEARCH, v. 27, no. 6, page. 1050-1062en_US
dc.identifier.issn1088-9051-
dc.identifier.issn1549-5469-
dc.identifier.urihttps://genome.cshlp.org/content/27/6/1050-
dc.identifier.urihttps://repository.hanyang.ac.kr/handle/20.500.11754/113735-
dc.description.abstractThe advent of high-throughput RNA sequencing (RNA-seq) has led to the discovery of unprecedentedly immense transcriptomes encoded by eukaryotic genomes. However, the transcriptome maps are still incomplete partly because they were mostly reconstructed based on RNA-seq reads that lack their orientations (known as unstranded reads) and certain boundary information. Methods to expand the usability of unstranded RNA-seq data by predetermining the orientation of the reads and precisely determining the boundaries of assembled transcripts could significantly benefit the quality of the resulting transcriptome maps. Here, we present a high-performing transcriptome assembly pipeline, called CAFE, that significantly improves the original assemblies, respectively assembled with stranded and/or unstranded RNA-seq data, by orienting unstranded reads using the maximum likelihood estimation and by integrating information about transcription start sites and cleavage and polyadenylation sites. Applying large-scale transcriptomic data comprising 230 billion RNA-seq reads from the ENCODE, Human BodyMap 2.0, The Cancer Genome Atlas, and GTEx projects, CAFE enabled us to predict the directions of about 220 billion unstranded reads, which led to the construction of more accurate transcriptome maps, comparable to the manually curated map, and a comprehensive lncRNA catalog that includes thousands of novel lncRNAs. Our pipeline should not only help to build comprehensive, precise transcriptome maps from complex genomes but also to expand the universe of noncoding genomes.en_US
dc.description.sponsorshipWe thank Drs. Youngsoo Song (Hanyang University) and Jiwon Shim (Hanyang University) for helpful comments. We also thank all members of the BIG lab for helpful comments and discussions, and Daehyun Baek and Jinman Park for providing resources. The results shown here are in part based upon data generated by TCGA, GTEx, and FANTOM consortia, and ENCODE and Human BodyMap 2.0 Projects. This work was supported by the Basic Science Research Program through NRF, funded by the Ministry of Science, ICT & Future Planning (NRF-2014M3C9A3063541 and NRF-2012M3A9D1054516), supported by the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), which was funded by the Ministry of Health and Welfare (grant number: HI15C3224 and HI15C1578), and supported by the Program for Agriculture Science & Technology Development (Project No. PJ01045303) of the Rural Development Administration.en_US
dc.language.isoen_USen_US
dc.publisherCOLD SPRING HARBOR LAB PRESSen_US
dc.subjectRNA-SEQ READSen_US
dc.subjectINTEGRATIVE ANALYSISen_US
dc.subjectSPLICE JUNCTIONSen_US
dc.subjectHUMAN GENOMEen_US
dc.subjectLANDSCAPEen_US
dc.subjectRECONSTRUCTIONen_US
dc.subjectANNOTATIONen_US
dc.subjectEXPRESSIONen_US
dc.subjectREVEALSen_US
dc.subjectQUANTIFICATIONen_US
dc.titleHigh-Confidence Coding and Noncoding Transcriptome Mapsen_US
dc.typeArticleen_US
dc.identifier.doi10.1101/gr.214288.116-
dc.relation.page1-13-
dc.relation.journalGENOME RESEARCH-
dc.contributor.googleauthorYou, Bo-Hyun-
dc.contributor.googleauthorYoon, Sang-Ho-
dc.contributor.googleauthorNam, Jin-Wu-
dc.relation.code2017002187-
dc.sector.campusS-
dc.sector.daehakCOLLEGE OF NATURAL SCIENCES[S]-
dc.sector.departmentDEPARTMENT OF LIFE SCIENCE-
dc.identifier.pidjwnam-
dc.identifier.orcidhttp://orcid.org/0000-0003-0047-3687-


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE