SHANG Fang-jian1, 2, SHI Zhe-fang1, 2, WANG Cong1, 2, LIU Qi1, 2
1.Yunnan Provincial Key Laboratory of Entomological Biopharmaceutical R&D (Dali University),Dali 671000,China; 2. Integrated Lab of Pathology Biology, College of Basic Medical, Dali University,Dali 671000,China;
Abstract:This work aimed to research the codon usage preference of SARS-CoV-2 and the codon clustering relationship of epidemic strains in different countries. CodonW, EMBOSS, SigmaPlot14.0 and SPSS 22.0 were used to analyze the codon usage preference of SARS-CoV-2 and the codon clustering relationship of epidemic strains in different countries. The ENC value of SARS-CoV-2 was found to be between 26.60 and 57.81. Approximately 84.98% of the codon preference involved codons ending with A/U. ACA, ACU, AGA, AUU, CCU, CUU, GCU, GGU, GUU, UCA, UCU, UUA were the high-frequency codons used in most proteins, and ORF10 had no high-frequency codons. ENC-Plot, neutrality and PR2 analyses showed that the codon usage preference of SARS-CoV-2 was affected by different factors. The main factor was natural selection, followed by mutation. According to cluster analysis, the codon preference of SARS-CoV-2 in 20 countries has changed significantly. Spain, France, South Korea, the United States, and Vietnam clustered separately. Cluster analysis of S and ORF1ab indicated that the usage bias of SARS-CoV-2 strains in China and in the United States belonged to different clusters. Thus, the main factor affecting the codon usage preference of SARS-CoV-2 is natural selection. The codon usage preference of SARS-CoV-2 has changed significantly, possibly because of cross-species transmission. Dynamic monitoring of SARS-CoV-2 codon usage must be strengthened, and further study the significance of codon changes is needed.
尚方建, 石哲芳, 王聪, 刘奇. 新型冠状病毒(SARS-CoV-2)的密码子偏爱性分析[J]. 中国人兽共患病学报, 2021, 37(1): 15-21.
SHANG Fang-jian, SHI Zhe-fang, WANG Cong, LIU Qi. Analysis of SARS-CoV-2 codon usage preference. Chinese Journal of Zoonoses, 2021, 37(1): 15-21.
[1] Lan J, Ge J, Yu J, et al.Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor[J]. Nature, 2020,581(7807):215-220. DOI:10.1038/s41586-020-2180-5 [2] Dhama K, Khan S, Tiwari R, et al.Coronavirus Disease 2019-COVID-19[J]. Clin Microbiol Rev, 2020,33(4):e00028-20. DOI:10.1128/CMR.00028-20 [3] WHO. WHO Coronavirus Disease (COVID-19) Dashboard[EB/OL]. (2020-07-25)[2020-7-25]. https://covid19.who.int/ [4] Quax TE, Claassens NJ, Soll D, et al.Codon Bias as a Means to Fine-Tune Gene Expression[J]. Mol Cell, 2015,59(2):149-161. DOI:10.1016/j.molcel.2015.05.035 [5] Jitobaom K, Phakaratsakul S, Sirihongthong T, et al.Codon usage similarity between viral and some host genes suggests a codon-specific translational regulation[J]. Heliyon, 2020,6(5):e3915. DOI:10.1016/j.heliyon.2020.e03915 [6] Vabret N, Bailly-Bechet M, Najburg V, et al.The biased nucleotide composition of HIV-1 triggers type I interferon response and correlates with subtype D increased pathogenicity[J]. PLoS One, 2012,7(4):e33502. DOI:10.1371/journal.pone.0033502 [7] Berkhout B, van Hemert F. On the biased nucleotide composition of the human coronavirus RNA genome[J]. Virus Res, 2015,202:41-47. DOI:10.1016/j.virusres.2014.11.031 [8] Belalov IS, Lukashev AN.Causes and implications of codon usage bias in RNA viruses[J]. PLoS One, 2013,8(2):e56642. DOI:10.1371/journal.pone.0056642 [9] Jenkins GM, Holmes EC.The extent of codon usage bias in human RNA viruses and its evolutionary origin[J]. Virus Res, 2003,92(1):1-7. DOI:10.1016/s0168-1702(02)00309-x [10] Sharp PM, Li WH.Codon usage in regulatory genes in Escherichia coli does not reflect selection for 'rare' codons[J]. Nucleic Acids Res, 1986,14(19):7737-7749. DOI:10.1093/nar/14.19.7737 [11] Chen Y.A comparison of synonymous codon usage bias patterns in DNA and RNA virus genomes: quantifying the relative importance of mutational pressure and natural selection[J]. Biomed Res Int, 2013,2013:406342. DOI:10.1155/2013/406342 [12] Sueoka N.Directional mutation pressure and neutral molecular evolution[J]. Proc Natl Acad Sci U S A, 1988,85(8):2653-2657. DOI:10.1073/pnas.85.8.2653 [13] Sueoka N.Intrastrand parity rules of DNA base composition and usage biases of synonymous codons[J]. J Mol Evol, 1995,40(3):318-325. DOI:10.1007/BF00163236 [14] He W, Wang N, Tan J, et al.Comprehensive codon usage analysis of porcine deltacoronavirus[J]. Mol Phylogenet Evol, 2019,141:106618.DOI:10.1016/j.ympev.2019.106618 [15] Yan Y, Shin WI, Pang YX, et al.The first 75 days of novel coronavirus (SARS-CoV-2) outbreak: recent advances, prevention, and treatment[J]. Int J Environ Res Public Health, 2020,17(7):2323. DOI:10.3390/ijerph17072323 [16] Chen Y.A comparison of synonymous codon usage bias patterns in DNA and RNA virus genomes: quantifying the relative importance of mutational pressure and natural selection[J]. Biomed Res Int, 2013,2013:406342. DOI:10.1155/2013/406342 [17] Moriyama EN, Powell JR.Codon usage bias and tRNA abundance in Drosophila[J]. J Mol Evol, 1997,45(5):514-523. DOI:10.1007/pl00006256 [18] Holmquist GP, Filipski J.Organization of mutations along the genome: a prime determinant of genome evolution[J]. Trends Ecol Evol, 1994,9(2):65-69. DOI:10.1016/0169-5347(94)90277-1 [19] Wang M, Zhang J, Zhou JH, et al.Analysis of codon usage in bovine viral diarrhea virus[J]. Arch Virol, 2011,156(1):153-160. DOI:10.1007/s00705-010-0848-0 [20] Kandeel M, Ibrahim A, Fayez M, et al.From SARS and MERS CoVs to SARS-CoV-2: moving toward more biased codon usage in viral structural and nonstructural genes[J]. J Med Virol, 2020,92(6):660-666. DOI:10.1002/jmv.25754 [21] 田明明, 魏雪玲, 杨兴, 等. 云南新现蝙蝠SARS样冠状病毒密码子偏性及其聚类分析[J]. 中国人兽共患病学报, 2018,34(12):1079-1086. DOI:10.3969/j.issn.1002-2694.2018.00.203 [22] Tort FL, Castells M, Cristina J.A comprehensive analysis of genome composition and codon usage patterns of emerging coronaviruses[J]. Virus Res, 2020,283:197976. DOI:10.1016/j.virusres.2020.197976 [23] Fauver JR, Petrone ME, Hodcroft EB, et al.Coast-to-coast spread of SARS-CoV-2 during the early epidemic in the United States[J]. Cell, 2020,181(5):990-996. DOI:10.1016/j.cell.2020.04.021