科學家實現染色體尺度下、單體型解析度的人類基因組裝配
作者:
小柯機器人發布時間:2020/12/9 13:24:14
美國哈佛醫學院Heng Li等研究人員合作實現染色體尺度下、單體型解析度的人類基因組裝配。相關論文於2020年12月7日在線發表在《自然—生物技術》雜誌上。
研究人員提出了一種稱為二倍體組裝(DipAsm)的方法,該方法對單個個體使用長而準確的讀長和長距離構象數據,可在1天之內生成染色體規模的定相組裝。DipAsm被應用於四個公共人類基因組,PGP1、HG002、NA12878和HG00733,DipAsm產生了單體型分辨的程序集,具有最小重疊群長度,可覆蓋50%的已知基因組(NG50)直至25 Mb,並在大約90.5%的雜合位點達到定相98-99%的準確性,且在連續性和定相完整性方面均優於其他方法。
研究人員證明了染色體規模的定相裝配對於發現結構變異(SV)的重要性,包括數千個新的轉座子插入,以及高度多態性和醫學上重要的區域,例如人白細胞抗原(HLA)和殺傷細胞免疫球蛋白樣受體(KIR)區域。DipAsm將促進高質量的精準醫療以及個體單體型變異和種群多樣性的研究。
據了解,單體型分辨或定相的基因組組裝可提供基因組及其複雜遺傳變異的完整圖譜。但是,當前用於定相裝配的算法要麼無法生成染色體尺度的定相,要麼需要譜系信息,這限制了它們的應用。
附:英文原文
Title: Chromosome-scale, haplotype-resolved assembly of human genomes
Author: Shilpa Garg, Arkarachai Fungtammasan, Andrew Carroll, Mike Chou, Anthony Schmitt, Xiang Zhou, Stephen Mac, Paul Peluso, Emily Hatas, Jay Ghurye, Jared Maguire, Medhat Mahmoud, Haoyu Cheng, David Heller, Justin M. Zook, Tobias Moemke, Tobias Marschall, Fritz J. Sedlazeck, John Aach, Chen-Shan Chin, George M. Church, Heng Li
Issue&Volume: 2020-12-07
Abstract: Haplotype-resolved or phased genome assembly provides a complete picture of genomes and their complex genetic variations. However, current algorithms for phased assembly either do not generate chromosome-scale phasing or require pedigree information, which limits their application. We present a method named diploid assembly (DipAsm) that uses long, accurate reads and long-range conformation data for single individuals to generate a chromosome-scale phased assembly within 1day. Applied to four public human genomes, PGP1, HG002, NA12878 and HG00733, DipAsm produced haplotype-resolved assemblies with minimum contig length needed to cover 50% of the known genome (NG50) up to 25Mb and phased ~99.5% of heterozygous sites at 98–99% accuracy, outperforming other approaches in terms of both contiguity and phasing completeness. We demonstrate the importance of chromosome-scale phased assemblies for the discovery of structural variants (SVs), including thousands of new transposon insertions, and of highly polymorphic and medically important regions such as the human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptor (KIR) regions. DipAsm will facilitate high-quality precision medicine and studies of individual haplotype variation and population diversity.
DOI: 10.1038/s41587-020-0711-0
Source: https://www.nature.com/articles/s41587-020-0711-0