昆蟲基因組文獻解讀|綠盲蝽基因組

2021-02-20 顫抖吧小蟲子
昆蟲基因組文獻解讀|綠盲蝽基因組

2020年8月28日MER在線發表中國農科院植保所王桂榮課題組關於綠盲蝽基因組測序的文章。本文主要從基因組層面揭示綠盲蝽的多食性以及食葉特性。

綠盲蝽基因組1 | Abstract

綠盲蝽屬於半翅目盲蝽科昆蟲,基因組大小為1.02Gb,contigN50為785kb,scaffold N50為68Mb,1016Mb的contig組裝成17個大的scaffold,對應17條染色體。其巨大的基因組與轉座子的擴增有關。綠盲蝽屬於雜食性昆蟲,主要取食葉肉。與消化,化學感受以及解毒相關的基因家族的擴增與其雜食性的特性有關。研究人員發現其唾液腺中分泌一種特異性的果膠酶(polygalacturonase),這與其主要取食葉片的習性可能相關。

2 | MATERIALS AND METHODS2.1 | Insect rearing and genomic sequencing

inbred strain:12 generations

Illumina sequencing:a female adult

PacBio sequencing:100 female siblings

2.2 | Assembly and polishing of contigs

contig assembly

Canu:(ovsMethod = sequential genomeSize = 1g):18,403 contigs of total length ~1.97 Gb, contig N50 of 259 Kb.

Three rounds of contig polishing

The first round:contigs were polished using PacBio reads with the Arrow consensus caller in smrt-link . The original bam files generated from PacBio Sequel were aligned with contig assembly by pbalign. Then, using arrow , we polished the assembly.The second and third rounds: filter out adaptors and low quality sequences in raw Illumina reads. The clean data were mapped to the contigs using bwa and the assembly errors were corrected using pilon.

To filter haplotypic duplication

purge _ dups on the polished assembly: purged primary assembly of total length 1.03 Gb, contig N50 of 785 kb and a haplotig assembly of total length 936 Mb, contig N50 of 88 kb.

To assess the completeness of genome assembly, we run busco using the insecta database (OrthoDB version 9).

2.3 | Filtering contamination contigs

we used clean Illumina data to filter possible contaminations in assembly. We used bwa to align clean Illumina data with the assembly and if any contig had an Illumina coverage rate lower than 5%, it was removed.

2.4 | Scaffolding with LACHESIS

After quality control, clean Hi-C paired-end reads were first mapped to the contig assembly by bowtie2, and then hic-pro used the alignment to detect valid alignments and filter multiple hits and singletons. Finally, lachesis  was used to cluster, order and orient the contigs.

2.5 | Transcriptome library preparation and sequencing

The 20 samples included spawn, eight tissues (antenna, mouthpart, salivary gland, head, gut, leg, wing, body) from third instar nymphs, and 11 tissues (male antenna, female antenna, mouthpart, salivary gland, head, gut, leg, wing, male genital, female genital, body) from adults.

RNA samples from the whole body of A. lucorum in six different developmental stages including first to fifth nymph, and adult were also prepared for full-length transcriptome sequencing using the PacBio Iso-Seq protocol.

2.6 | Genome annotation

基因結構注釋

Tandem repeats were identified by tandem repeats finder

Transposable elements (TEs)

searching against the TE database (dfam 3.0, RepBase) by repeatmasker and searching against the TE protein database by repeatproteinmask.

constructing a de novo repeat library by repeatmodeler, followed by repeatmasker to find TE repeats.

The gene models in A. lucorum were predicted using augustus on the TE soft-masked genome, integrating evidence from RNA sequencing alignments, Isoform sequencing alignments and protein homology searches.

RNA sequencing alignments

20 paired-end datasets from different tissues were aligned with the genome using star. After filtering by filterBam in augustus, the sorted bam file was transferred to a hints file by bam2hints in augustus.

Isoform sequencing alignments:

we used Iso-Seq to assist in gene prediction. gmap was used to align Iso-Seq sequences with the genome and blat2hints.pl in augustus was used to generate a hints file.

Protein homology evidence

all Hemiptera proteins in NCBI RefSeq were download. We aligned the proteins with the genome by tblastn using 1e−5 as cutoff and filtered those with less than 50% identity.

We used exonerate to align the remaining proteins with the genome and used exonerate2hints.pl in augustus to generate a hints file.

Finally, we combined all hints files from RNA-Seq, Iso-Seq and protein homology, and used augustus with the combined hints file to predict gene models, resulting in 23,106 gene models.

To get accurate gene sets, we filtered genes with less than 35 amino acids. We aligned protein sequences of gene models with the NR database in diamond blastp using 1e−5 as a cutoff, and 16,187 gene models had homologous proteins in NR.

We also aligned 20 RNA-Seq data sets with coding sequences of gene models in bwa and 17,953 gene models had a coverage rate higher than 95%. We retained genes that either had homologous proteins in NR or had RNA support, resulting in 20,386 genes. After that, we detected 33 genes with two or more errors in start codons, stop codons or nontriplet length. We filtered those wrong genes and got the final official gene set (OGS) including 20,353 gene models.

基因功能注釋

we aligned the protein sequences of genes with kegg, eggnog, nr, swiss-prot databases by diamond, using 1e−5 as a cutoff and got the best hit. We also used interproscan to search interpro databases to find motifs and domains. Taken together, ==18,721 (91.98%) genes had homologous information in those databases==, indicating that the OGS is reasonably accurate.

Moreover, trnascan-se was used to find tRNAs with default parameters.

2.7 | Evolutionary analysis

Nine sequenced hemipteran insects and Drosophlia melanogaster as an outgroup were used to infer gene

orthology in OrthoFinder.

Phylogenetic tree and gene orthology results were displayed and annotated using Evolview.

Expanded orthologous groups in A. lucorum were determined using a rank sum test compared to other eight insects in Hemiptera.

Protein sequences of single copy genes from each species were aligned in muscle, then concatenated into one super-sequence. PhyML was used to reconstruct the phylogenetic tree based on the concatenated super-sequence with the LG + I + G + F model. Divergence times among species were calculated in mcmctree (paml package). Calibration times were set according to a previous paper, minimum = 320 Ma and maximum = 390 Ma for D. melanogaster and A. lucorum.

GO (Gene Ontology) annotation results were obtained from Interpro. GO enrichment analysis was performed using the OmicShare tools.

The reciprocal BLAST best hit was used to calculate the synonymous mutation rate (Ks) by kaks _ calculator 2.0 with default parameters.

Duplicate_gene_classifier in MCscanX was implemented to classify the origins of the duplicated genes into different types.

2.8 | Analysis of the digestive enzyme, chemosensory receptor, and detoxification enzyme genes

A set of described Hemiptera odorant receptors (ORs) and gustatory receptors (GRs) was used to search the A. lucorum gene sets by blastp with the cutoff e-value 1e−5.

Multiple PSI-BLASTP searches were initiated with divergent ORs and GRs to find any additional annotated proteins that might belong to these families, and up to four iterations were used. Finally, some ORs and GRs were corrected manually.

Ionotropic receptors (IRs), digestive enzymes, and detoxification enzymes were annotated using diamond results compared to the nr database, uniprot database and kegg database with e-value 1e−5 and confirmed by InterProScan or eggNOG.

To get a complete gene family set, reannotation of the gene family was performed. First, all digestive enzyme, chemosensory receptor, and detoxification enzyme genes got from former gene set was mapped to eight Hemiptera genomes by exonerate with identity >35%, and exonerate2hints.pl was used to generate a hints file.

Then, the region where these genes can map was used to predict gene models by augustus with the hints file, and short gene models (less than 200 bp) were filtered. The predicted gene model that doesn't exist in former gene set was added to the gene family sets.

3 | RESULTS3.1 | Chromosome-level genome assembly and recent expansion of DNA and LINE TEsMajor indicators of the Apolygus lucorum genomeIn A. lucorum, LTR (98 Mb), LINE (73 Mb) and DNA (88 Mb) elements are the major types of TEs, and LTR is considerably in excess of that from other compared insects.The genome landscape of Apolygus lucorum3.2 | Gene expansion and recent gene burst promote environmental adaptability

The phylogeny showed that A. lucorum diverged from C. lectularius about 168 million years ago (Ma) and from A. pisum about 275 MYA.

Gene ontology analyses observed significant enriched GO terms involved in odorant recognition, including sensory perception of smell (GO: 0007608) and sensory perception of chemical stimulus (GO:0007606; Figure S9), which provided clues for the extremely broad host plant ranges of A. lucorum.

enriched GO terms associated with digestion in A. lucorum were also observed, such as hydrolase activity, acting on glycosyl bonds (GO: 001698), hydrolase activity, hydrolysing O-glycosyl compounds (GO: 0004553) and polygalacturonase (PG) activity (GO: 0004650).

PG is an essential enzyme for digestion, which hydrolyse spectin substances and then destroys plant cell walls.

[x] These expanded genes could play an important role in the severe damage on a wide range of plants, as PGs can hydrolyse the pectin substances and then destroy the plant cell walls and ORs could promote the pest search for host.[x] Novel genes were mostly generated from gene duplication, which is recognized as a driving force of evolution.[x] Using a within-genome reciprocal best blast hit, 2,609 paralogue pairs were identified, and distribution of synonymous distances (Ks values) showed that 1,502 (58%) paralogue pairs had a Ks value smaller than 0.3, suggesting that most gene duplications possibly occurred in a recent period.[x] recent duplicated genes in A. lucorum are mostly derived from small local scale gene duplications, instead of whole genome duplications.05-01_orthologues05-02_paraloguesGenome evolution of Apolygus lucorum3.3 | Expansion of digestive enzyme genes promotes processing of diverse foods

A. lucorum had a comprehensive digestive enzyme spectrum, with a unique group of polygalacturonase (PGs) and a significantly expanded group of serine proteases(SPs).

PG is a group of plant cell wall-degrading enzyme, ubiquitous in fungi, bacteria, and plants. It is also found in Hemiptera and Coleoptera, predicted to be horizontally transferred from fungi.

The expression profile showed that 55 PGs were specifically expressed in salivary gland with high expression levels, indicating that the salivary gland of A. lucorum has a very high ability to synthesize PGs.

SPs are involved in various physiological processes of insects, such as digestion, development and innate immunity.

The expansion of SPs in A. lucorum can improve its digestive capacity and may contribute to its omnivorous

feeding habit, mainly phytophagous with prey to complement.

Miridae-specific polygalacturonases (PGs) and expansion of serine proteases (SPs) elucidate omnivorousness of Apolygus lucorum.3.4 | Rapid evolution of chemosensory receptors expands the range of host plants

A large number of chemosensory receptors containing 135 ORs, 57 GRs and 33 IRs were identified in the A. lucorum genome.

The phylogenetic analysis that showed 40% of the OR genes (55) were contained in several clades with high protein sequence identity (>80%), indicating that OR has experienced recent gene replication.

Expression profile analysis showed that GRs exhibit different expression patterns and most GRs are expressed in various tissues, which is consistent with the previous reports in C. lectularius and O. fasciatus, indicating the diversity of GR functions.

Expression profile studies showed that 88% of IRs are expressed in antennae, suggesting that most IRs have olfactory functions. However, some IRs were highly expressed in tissues other than antennae, reflecting the diversity of IR functions.

Rapid evolution of ORs in Apolygus lucorum.3.5 | Expansion of detoxification enzymes contributes to degrading toxin

Agricultural pests usually employ an efficient detoxification system containing various enzymes to overcome numerous toxins in food sources or the environment.

GST is a superfamily of multifunctional isoenzymes involved in the cellular detoxification of various physiological and xenobiotic substances, which is highly related to insecticide resistance in insects, as it can directly detoxify the insecticides.

Phylogenetic analysis of the GSTs of four true bugs showed three A. lucorum-specific branches, which contained 58% GST genes, suggesting the GSTs experienced a recent species-specific expansion in A. lucorum, enabling better detoxification of toxic substances and adaptation to the environment.

P450s constitute the largest and most functionally diverse class of insect detoxification enzymes, including four distinct clades CYP2, CYP3, CYP4 and CYPMito.

The phylogenetic tree of P450s exhibited four A.lucorum-specific branches, suggesting that P450s

experienced similar species-specific expansion as GSTs, enhancing the detoxification activity in A. lucorum.

Expansion of P450 and GST in Apolygus lucorumReference

DOI: 10.1111/1755-0998.13253

相關焦點

  • 綠盲蝽該怎麼治?
    一、綠盲蝽的危害和習性綠盲蝽的危害部位不同,造成的危害現象也不盡相同。嫩葉危害,會產生一個褐紅色的小點,逐漸擴大,隨著時間的推移,點會變得中空,後期葉片會皺縮;花蕾危害,花蕾上會出現小紅點,後期花蕾直接脫落,難以形成果實;幼果危害,綠盲蝽頭部有一根針,它會刺入幼果內吸食汁液,這樣幼果就會流出紅褐色的膠質物,後期,被綠盲蝽叮咬的地方,會凹陷長出鏽疤或者瘤子,影響果實外觀。
  • 開春果園被忽視的小蟲——綠盲蝽
    一、綠盲蝽的危害習性及防治難點綠盲蝽是蝽象類,由於蟲體顏色與葉片顏色相差無幾,蟲體小,隱蔽性強,一般很難發現。二、綠盲蝽的防治措施(1)物理防治。落葉後清除果園落葉雜草。(2)化學防治。①開春兩次清園,要對樹上樹下及周圍雜草,全面噴藥清除蟲卵及若蟲,可選擇毒死蜱或者龍麗樂。
  • 果園殺手綠盲蝽為害嚴重,隱蔽性很強
    綠盲蝽的寄主有100多種,在全國各地均有分布,最初主要在棉花、牧草上為害,隨著農業結構的調整,如今綠盲蝽在蔬菜和果樹上均有為害。在山東省,1990年之後,綠盲蝽開始逐漸為害果樹並逐年加重,2004年出現爆發,全省冬棗和葡萄受害嚴重。近幾年,綠盲蝽已經成為葡萄、蘋果、桃等果樹生產管理上的重要害蟲之一,危害程度不容小覷。
  • 苗床上種出天敵煙盲蝽
    原來,這是該基地今年從市植保站新引進的天敵昆蟲——煙盲蝽,它能吃粉蝨,還能吃斜紋夜蛾等害蟲的小幼蟲。雖然現在沒粉蝨,但這個天敵比較特別,現在釋放它不是為了讓它吃害蟲,而是為了能「種」出更多的天敵來,等到苗子定植到大棚後,它們就可以防害蟲了。   煙盲蝽是一種雜食性昆蟲   據市植保站天敵專家介紹,煙盲蝽是近年來在我國生物防治領域引進的一種雜食性天敵昆蟲。
  • 葡萄樹綠盲蝽和蚜蟲咋防治
    2018-02-27 12:19:22 來源: 關注農民 舉報    答:綠盲蝽又稱為別名盲蝽蟓
  • 果樹上的綠盲蝽蟲小危害大,花果期這樣防治,一次解決問題
    綠盲蝽是梨、蘋果、桃、葡萄、棗等果樹開花結果期危害最嚴重的害蟲之一,一旦防治不力,常常造成大量花和幼果被害,造成大量的落花落果,導致嚴重減產。 由於開花結果期,果樹對各種藥劑最敏感的時期,一旦使用不當,發生藥害,造成的損失更大。今天給大家介紹一個防治綠盲蝽最有效的方法。
  • 蘋果樹患上綠盲蝽和桑田牛後,我們該怎樣去進行防治呢?
    蘋果樹患上綠盲蝽和桑田牛後,我們該怎樣去進行防治呢?蘋果綠盲蝽:形態特徵成蟲體卵圓形,黃綠色,體長5毫米左右寬2.2毫米;觸角綠色;前翅基部革質、綠色,端部膜質、色、半透明。若蟲體綠色,有黑色細毛,翅芽端部黑色。
  • 新疆:南疆一團盲蝽嚴重 棉葉棉桃遭侵蝕
    從調查來看,80%的落鈴都有瘤斑痕跡,即棉盲蝽危害明顯症狀。並在棉株上未脫落的中上部小鈴苞葉內發現大量的棉盲蝽若蟲和成蟲,破孔葉片較多的點片,一個小鈴上平均可見1-3頭若蟲。由於若蟲跑的較快,沒有拍到清晰的圖片。從若蟲蟲體來看,小齡若蟲顏色稍淡,而大齡若蟲呈草綠色,由於臨近棗園地,懷疑為綠盲蝽。
  • 黑人、白人、黃種人是同一物種,正如稻綠蝽有全綠型、點斑型、黃肩型
    黑人、白人、黃種人是同一物種,正如稻綠蝽有全綠型、點斑型、黃肩型,這就是我這個昆蟲觀察者的觀點。*此圖來源於嘎嘎昆蟲網(臺灣),可以見到全綠型的稻綠蝽與點斑型的稻綠蝽在交配。稻綠蝽,為害的植物種類不算少,有水稻、玉米、花生、棉花、豆科、十字花科蔬菜、油菜、芝麻、茄子、辣椒、馬鈴薯、桃、李、梨、蘋果、菸葉等。
  • 益蝽研究小記:同樣是蝽,它卻是吃蝽的蝽
    蝽類昆蟲比較,圖自編為什麼認為蝽都是吃「素」的,那就要看——蝽的危害大多數蝽象為植食性,是各種農作物以及林木業的重要害蟲。茶翅蝽美國分布,圖自維基2、稻綠蝽/Nezara viridula英文通常叫southern green shield bug,中文翻譯為南方綠蝽象
  • 全球首個藥用昆蟲基因組圖譜繪製完成-光明日報-光明網
    四川大學生命科學院教授嶽碧松指出,美洲大蠊是當今世界最古老的昆蟲之一,最早收錄於《神農本草經》,後《本草綱目》《本草經梳》等多有記載。研究發現,美洲大蠊提取物製成的溶液劑,具有良好的創面修復功能,在治療胃潰瘍、十二指腸潰瘍、潰瘍性結腸炎、口腔潰瘍及燒燙傷等創面相關疾病療效顯著。
  • 益蝽研究小記:同樣是蝽,它卻是吃蝽的蝽
    蝽類昆蟲比較,圖自編為什麼認為蝽都是吃「素」的,那就要看——/Nezara viridula英文通常叫southern green shield bug,中文翻譯為南方綠蝽象,從譯名可以看出,應該是起源於南方,但現在幾乎所有溫帶地區都能見到,它也是我在天台觀察到最多的蝽象。
  • 轉基因棉田裡的新麻煩:棉鈴蟲走了盲蝽象上位
    在長期的耐心監測過程中,科學家們發現了一種叫盲蝽象的害蟲正悄然上位。吳孔明介紹,盲蝽象這種雜食性害蟲在中國的棉田裡一直屬於次要害蟲,以前噴灑農藥在殺死棉鈴蟲的同時,順便也就幹掉了盲蝽象,基本無需專門防治。  但隨著Bt棉花的大面積種植,用藥量下降了30%—40%,而Bt棉花只對標靶害蟲棉鈴蟲等鱗翅目生物有防治效果,對半翅目的非標靶害蟲盲蝽象不具防治意義。
  • 全球首個藥用昆蟲基因組圖譜發布
    全球首個藥用昆蟲基因組圖譜發布     四川擬在昆蟲類藥品引入全基因組測序    本報訊(記者 寇敏芳)12月10日,好醫生藥業集團和四川大學共同在北京發布了美洲大蠊全基因組研究的階段性成果
  • 碧鳳蝶染色體水平基因組公布
    近日,中國科學院昆明動物研究所李學燕副研究員帶領的昆蟲研究團隊運用三代長讀長測序技術,結合高通量染色體構象捕獲(Hi-C)技術,成功地組裝了碧鳳蝶染色體水平的基因組,這是首個利用Hi-C技術完成的染色體水平的蝴蝶基因組。
  • 綠盲蝽有4個防治時期最關鍵
    春 2月節氣:立春(2月4日),雨水(2月19日) 氣候條件:氣溫回升,陽氣上升、萬物蘇萌 葡萄2月物候期:(露天葡萄)傷流期;(促早栽培葡萄)出葉、花期 2月主要病害:灰黴病、穗軸褐枯病、霜黴病等 2月主要蟲害:綠盲蝽
  • 全新AutoML工具實現基因組全自動建模「寶藏技術」解讀生命天書
    目前全世界科學家可以解讀的遺傳密碼不超過3%,還有97%的遺傳密碼猶如一座科學尚未突破的巨塔,而AI或許就是攀登這座巨塔的「寶藏技術」。慧眼解讀「生命天書」此前,由於基因組數據的複雜性,主流的基於圖像和文本的AI模型不能很好地對基因組數據進行建模。
  • NG|66個水稻泛基因組文獻分享
    13個基因組數據兩個馴化物種達到了參考基因組水平的組裝(IR 8和N22),7個野生物種((Oryza rufipogon, Oryza nivara, Oryza barthii, O. glumaepatula, Oryza meridionalis, Oryza punctata 和L. perrieri
  • Nat Commun:沃爾巴克氏菌泛基因組研究
    教學視頻|重要文獻|實驗方法|生物軟體方案設計|生信分析|數據挖掘|寫作作圖
  • 幹掉綠盲蝽象,你得「細、勤、準、群」
    連續幾年綠盲蝽象對我區棗園造成很大的摧殘,尤其是今年,由於去年暖冬,加之長期乾旱,致使今年害蟲爆發量大大超出預計情形,第一代綠盲蝽象大面積爆發,棗園葉片因管理差異而五花八門,稍有鬆懈棗樹就會遍體鱗傷!個別棗園的花蕾已被綠盲蝽象幾乎吞噬乾淨,實在令人慘不忍睹!