來自美國麻省理工學院和哈佛大學Broad研究院的研究人員,利用安捷倫(Agilent Technologies)卓越的寡核苷酸序列合成技術,合成大量生物素標記的RNA"誘餌"用於靶向序列的捕獲,成功應用於高通量平行靶向測序。這一重大科學進展作為封面文章發表在最新一期的"自然o生物科技"雜誌上。
下一代測序技術(Next Generation Sequencing)是對傳統測序技術的革命性變革,可以一次完成數十萬到數百萬條DNA分子的序列測定,使得在極短時間內對人類轉錄組和基因組進行細緻研究成為可能。2003年,"人類基因組計劃"獲得首張人類全基因組圖譜,為了測定人類基因組的30億對鹼基,數以千計的科學家們花費了超過30億美元和13年的時間。儘管與傳統測序技術相比,下一代測序技術的成本大幅降低,但目前測序一套個人基因組圖譜的成本仍然高達數十萬美元。所以,在現階段,如何實現對大批量樣本的高通量平行靶向測序,對許多科學和醫學研究課題具有重大意義。
為了實現這一目標,研究人員從生物晶片技術得到靈感,隨機選擇了1,900個人類基因,針對這些基因的15,565個編碼蛋白的外顯子序列(﹥2.5 Mb)以及另外4個基因組大片段序列(﹥1.7 Mb),根據tiling方法設計並且利用安捷倫的"SurePrint"技術合成了22,000條200-mers寡核苷酸探針。再經PCR和反轉錄後得到足夠量的生物素標記的RNA"誘餌",巧妙建立RNA-DNA的液相雜交系統,對全基因組DNA中的目的序列進行成功捕獲,序列數據經由Illumina公司的測序儀讀取。
測序結果表明:1)極佳的特異性:高達90%的讀出序列來源於RNA"誘餌"捕獲,其中和靶向序列完全重合的超過50%;2)優秀的均一性:對基因組大片段序列,至少和捕獲序列重合一半以上的RNA"誘餌"超過80%,即使對小片段的外顯子序列,這一數字也超過60%;3)卓越的重現性:2組技術重複試驗間的差異率不到10-5;4)可以精確地檢測SNP。綜上所述,實驗結果完美驗證了基於液相序列捕獲的高通量平行靶向測序技術的特異性,準確性,重現性和廣泛的應用前景。(生物谷Bioon.com)
生物谷推薦原始出處:
Nature Biotechnology. 2009 Feb;27(2):182-9. Epub 2009 Feb 1.
Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing
Andreas Gnirke1, Alexandre Melnikov1, Jared Maguire1, Peter Rogov1, Emily M LeProust2, William Brockman1,5, Timothy Fennell1, Georgia Giannoukos1, Sheila Fisher1, Carsten Russ1, Stacey Gabriel1, David B Jaffe1, Eric S Lander1,3,4 & Chad Nusbaum1
Targeting genomic loci by massively parallel sequencing requires new methods to enrich templates to be sequenced. We developed a capture method that uses biotinylated RNA 'baits' to fish targets out of a 'pond' of DNA fragments. The RNA is transcribed from PCR-amplified oligodeoxynucleotides originally synthesized on a microarray, generating sufficient bait for multiple captures at concentrations high enough to drive the hybridization. We tested this method with 170-mer baits that target 415,000 coding exons (2.5 Mb) and four regions (1.7 Mb total) using Illumina sequencing as read-out. About 90% of uniquely aligning bases fell on or near bait sequence; up to 50% lay on exons proper. The uniformity was such that B60% of target bases in the exonic 'catch', and B80% in the regional catch, had at least half the mean coverage. One lane of Illumina sequence was sufficient to call high-confidence genotypes for 89% of the targeted exon space.
1 Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA.
2 Agilent Technologies Inc., 5301 Stevens Creek Blvd., Santa Clara, California 95051, USA.
3 Department of Biology, Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, Massachusetts 02139, USA.
4 Department of Systems Biology, Harvard Medical School, 200 Longwood Ave., Boston, Massachusetts 02115, USA. 5Present address: Google, Inc.,
5 Cambridge Center, Cambridge, Massachusetts 02142, USA.