253 134

Full metadata record

DC FieldValueLanguage
dc.contributor.author고인송-
dc.date.accessioned2021-02-17T02:28:23Z-
dc.date.available2021-02-17T02:28:23Z-
dc.date.issued2019-12-
dc.identifier.citationGenomics & Informatics, v. 17, no. 4, article no. e40en_US
dc.identifier.issn2234-0742-
dc.identifier.urihttps://genominfo.org/journal/view.php?doi=10.5808/GI.2019.17.4.e40-
dc.identifier.urihttps://repository.hanyang.ac.kr/handle/20.500.11754/158534-
dc.description.abstractWhile studies aimed at detecting and analyzing indels or single nucleotide polymorphisms within human genomic sequences have been actively conducted, studies on detecting long insertions/deletions are not easy to orchestrate. For the last 10 years, the availability of long read data of human genomes from PacBio or Nanopore platforms has increased, which makes it easier to detect long insertions/deletions. However, because long read data have a critical disadvantage due to their relatively high cost, many next generation sequencing data are produced mainly by short read sequencing machines. Here, we constructed programs to detect so-called unmapped regions (UMRs, where no reads are mapped on the reference genome), scanned 40 Korean genomes to select UMR long deletion candidates, and compared the candidates with the long deletion break points within the genomes available from the 1000 Genomes Project (1KGP). An average of about 36,000 UMRs were found in the 40 Korean genomes tested, 284 UMRs were common across the 40 genomes, and a total of 37,943 UMRs were found. Compared with the 74,045 break points provided by the 1KGP, 30,698 UMRs overlapped. As the number of compared samples increased from 1 to 40, the number of UMRs that overlapped with the break points also increased. This eventually reached a peak of 80.9% of the total UMRs found in this study. As the total number of overlapped UMRs could probably grow to encompass 74,045 break points with the inclusion of more Korean genomes, this approach could be practically useful for studies on long deletions utilizing short read data.en_US
dc.description.sponsorshipThis research was supported by the Collaborative Genome Program for Fostering New Post-Genome Industry of the National Research Foundation (NRF) funded by the Ministry of Science and ICT (MSIT) (NRF-2017M3C9A6047623).en_US
dc.language.isoenen_US
dc.publisher한국유전체학회en_US
dc.subjectdeletionen_US
dc.subjectKoreanen_US
dc.subjectstructural variationen_US
dc.subjectunmapped regionen_US
dc.subjectwhole genome sequencingen_US
dc.titleAnalysis of unmapped regions associated with long deletions in Korean whole genome sequences based on short read dataen_US
dc.typeArticleen_US
dc.relation.no4-
dc.relation.volume17-
dc.identifier.doi10.5808/GI.2019.17.4.e40-
dc.relation.page40-40-
dc.relation.journalGenomics & Informatics-
dc.contributor.googleauthorLee, Yuna-
dc.contributor.googleauthorPark, Kiejung-
dc.contributor.googleauthorKoh, Insong-
dc.relation.code2019034966-
dc.sector.campusS-
dc.sector.daehakCOLLEGE OF MEDICINE[S]-
dc.sector.departmentDEPARTMENT OF MEDICINE-
dc.identifier.pidinsong-
dc.identifier.orcidhttps://orcid.org/0000-0001-6896-9748-


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE