9
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Article: not found

      A 336-Nucleotide In-Frame Deletion in ORF7a Gene of SARS-CoV-2 Identified in Genomic Surveillance by Next-Generation Sequencing

      letter

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Public health laboratories (PHLs) across the US have been taking a central role in the response to the coronavirus disease 2019 (COVID-19) pandemic [1]. Currently, PHLs have increased their capacity to sequence whole genomes of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative agent of COVID-19, which enhances surveillance. SARS-CoV-2 genomes are known to use non-canonical translation mechanisms such as leaky scanning, ribosomal frameshifting, and alternative initiation, as is commonly observed in the coronavirus family 2, 3, 4, 5, 6. Here, we report the longest in-frame deletion in ORF7a gene identified in a patient using Next-Generation Sequencing (NGS) during baseline surveillance. A nasal sample was collected on July 7th, 2021, from a previously tested SARS-CoV-2 positive pediatric asymptomatic female in San Mateo County Public Health Laboratory, California, USA. The presence of SARS-CoV-2 in a specimen was confirmed by the CDC 2019-Novel Coronavirus Real-Time RT-PCR Diagnostic Panel, that targets two unique regions in the SARS-CoV-2 nucleocapsid gene (N1 and N2) and the human RNase P gene (RP) [7], with the cycle threshold values of 15.4, 15.6, and 26.8, respectively. Subsequently, the total nucleic acids from the SARS-CoV-2 positive specimen were used for NGS library preparation utilizing the Respiratory Pathogen ID/AMR Enrichment Panel (RPIP) Kit (Illumina). The concentration and distribution size of the prepared library was checked on a Qubit 4 fluorometer (Life Technologies) and 4200 TapeStation (Agilent Technologies), respectively. Paired-end NGS was performed on an Illumina MiniSeq instrument using the 150-cycle MiniSeq high output reagent kit (Illumina). Sequence data were assembled using the Explify RPIP Analysis App v.3.2.8 (IDbyDNA) and validated by the viral-ngs v.2.1.8 workflows (Broad Viral Genomics) on the Terra platform (app.terra.bio) [8]. Briefly, the SARS-CoV-2 genome was assembled using a reference-based assembly approach of quality-filtered raw reads with the reference genome Wuhan-Hu-1 (NC_045512.2) [9]. A total of 8,363,544 raw paired-end reads were generated. The genome consensus of the isolate, designated CA-SMCPHL-072321.3, was called from 2,071,958 SARS-CoV-2 reads using the minimap2 aligner [10]. This resulted in the mean read depth coverage of 5,066 reads. The final genome consensus was 29,548 nucleotides (nt) long and was deposited in GISAID under the accession number EPI_ISL_6159211. The genome was classified as Nextstrain clade 20I [11] from Pango lineage B.1.1.7 [12]. During the annotation of this genome, we noticed a 336-nt deletion in the gene-encoding accessory protein ORF7a (27,418-27,753) (Figure 1 ). Although numerous studies have been performed for some other SARS-CoV-2 viral proteins 13, 14, 15, 16, 17, studies on putative activity and role of ORF7a is just starting to arise. It has been demonstrated that SARS-CoV-2 ORF7a protein inhibits type I interferon (IFN-1) signaling [18, 19], interacts with CD14+ monocytes [20], induces the nuclear factor kappa B (NF-κB) pathway [21], and thus triggers expression of proinflammatory cytokines, including interleukin-6 (IL-6) and tumor necrosis factor alpha (TNF-α). This forms the basis of a likely mechanism through which ORF7a mediates the potentially fatal cytokine storm progression, indicating that ORF7a may be a key viral factor for clinical severity of COVID-19 [22]. FIG 1 Coverage plot across hCoV-19/USA/CA-SMCPHL-072321.3/2021 (EPI_ISL_6159211) whole-genome assembly. Coverage depth values were obtained by mapping Illumina reads to the reference Wuhan-Hu-1 genome NC_045512.2 and expressed as the number of sequence reads at each nucleotide position along the length of the CA-SMCPHL-072321.3 genome, which shows a deletion in the ORF7a gene (around 27.5 Kbp). Mean sequencing depth was 5,066 reads. FIG 1 Previous studies reported complete ORF7a gene loss [23, 24], multiple length in-frame and frame-shift deletions in ORF7a 25, 26, 27, 28, 29, 30, 31, as well as large ORF7a∆370, ORF7a∆227, and ORF7a∆392 that resulted in the fusion of ORF7a with ORF6, ORF7b, and ORF8 genes, respectively [32, 33]. However, the ORF7a∆336 reported here is currently the longest detected in-frame deletion within the ORF7a gene, which makes the gene only 30 nt long (Figure 2 ). ORF7a deletions are found to impact SARS-CoV-2 pathogenicity [31]. In vitro viral challenge experiments demonstrated that the C-terminal truncation of ORF7a results in a replication defect, which was found to be associated with elevated IFN response to SARS-CoV-2 [31]. It was also shown that complete deletion of ORF7a reduces viral replication [34]. This suggests that strains with deletions in ORF7a are more likely to emerge in immunocompromised patients. Hence, further experiments are needed to determine the functional outcomes of different deletions. FIG 2 Complete SARS-CoV-2 genome organization diagram (A) and alignment of ORF7a gene sequences (B). To avoid sequences with gaps due to poor sequencing coverage, only sequences containing truncations and deletions in the ORF7a gene, that had high sequencing coverage, were downloaded from GISAID (Table S1). The multiple amino acid alignments of the ORF7a gene sequences (n = 47) with reference genome Wuhan-Hu-1 NC_045512.2 were performed by MUSCLE algorithm in MEGA software v. 7.0.26. Frequency-based differences coloring of aligned ORF7a proteins were visualized in NCBI MSA Viewer v. 1.20.1. The GISAID accession number of the hCoV-19/USA/CA-SMCPHL-072321.3/2021 (EPI_ISL_6159211) from this study is shown in red. The 336-nt (112-amino acid) deletion in CA-SMCPHL-072321.3 ORF7a was the longest observed deletion within the ORF7a gene sequences available from GISAID and GenBank as of October 2021. FIG 2 Unfortunately, a variety of deletions in the ORF7a region can be under-investigated similarly to ORF8 deletions that have been shown to often not be reported by the standard data analysis pipelines, which are frequently simply represented by stretch ambiguous bases or as gaps in the consensus sequence due to poor NGS coverage [35]. Thus, non-canonical genes are generally excluded from genomic and clinical analyses despite their importance for understanding SARS-CoV-2 evolution and replication dynamics, which have vital implications in vaccine development and control strategies for COVID-19 [6, 36, 37]. These findings highlight the necessity of submission of the raw sequencing reads in public databases in order to assess the spread of deletion strains. FUNDING This work was supported by the CDC ELC CARES and Cooperative Agreement #NU60OE000103. Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper

          Related collections

          Most cited references33

          • Record: found
          • Abstract: found
          • Article: not found

          Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China

          Summary Background A recent cluster of pneumonia cases in Wuhan, China, was caused by a novel betacoronavirus, the 2019 novel coronavirus (2019-nCoV). We report the epidemiological, clinical, laboratory, and radiological characteristics and treatment and clinical outcomes of these patients. Methods All patients with suspected 2019-nCoV were admitted to a designated hospital in Wuhan. We prospectively collected and analysed data on patients with laboratory-confirmed 2019-nCoV infection by real-time RT-PCR and next-generation sequencing. Data were obtained with standardised data collection forms shared by WHO and the International Severe Acute Respiratory and Emerging Infection Consortium from electronic medical records. Researchers also directly communicated with patients or their families to ascertain epidemiological and symptom data. Outcomes were also compared between patients who had been admitted to the intensive care unit (ICU) and those who had not. Findings By Jan 2, 2020, 41 admitted hospital patients had been identified as having laboratory-confirmed 2019-nCoV infection. Most of the infected patients were men (30 [73%] of 41); less than half had underlying diseases (13 [32%]), including diabetes (eight [20%]), hypertension (six [15%]), and cardiovascular disease (six [15%]). Median age was 49·0 years (IQR 41·0–58·0). 27 (66%) of 41 patients had been exposed to Huanan seafood market. One family cluster was found. Common symptoms at onset of illness were fever (40 [98%] of 41 patients), cough (31 [76%]), and myalgia or fatigue (18 [44%]); less common symptoms were sputum production (11 [28%] of 39), headache (three [8%] of 38), haemoptysis (two [5%] of 39), and diarrhoea (one [3%] of 38). Dyspnoea developed in 22 (55%) of 40 patients (median time from illness onset to dyspnoea 8·0 days [IQR 5·0–13·0]). 26 (63%) of 41 patients had lymphopenia. All 41 patients had pneumonia with abnormal findings on chest CT. Complications included acute respiratory distress syndrome (12 [29%]), RNAaemia (six [15%]), acute cardiac injury (five [12%]) and secondary infection (four [10%]). 13 (32%) patients were admitted to an ICU and six (15%) died. Compared with non-ICU patients, ICU patients had higher plasma levels of IL2, IL7, IL10, GSCF, IP10, MCP1, MIP1A, and TNFα. Interpretation The 2019-nCoV infection caused clusters of severe respiratory illness similar to severe acute respiratory syndrome coronavirus and was associated with ICU admission and high mortality. Major gaps in our knowledge of the origin, epidemiology, duration of human transmission, and clinical spectrum of disease need fulfilment by future studies. Funding Ministry of Science and Technology, Chinese Academy of Medical Sciences, National Natural Science Foundation of China, and Beijing Municipal Science and Technology Commission.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            A new coronavirus associated with human respiratory disease in China

            Emerging infectious diseases, such as severe acute respiratory syndrome (SARS) and Zika virus disease, present a major threat to public health 1–3 . Despite intense research efforts, how, when and where new diseases appear are still a source of considerable uncertainty. A severe respiratory disease was recently reported in Wuhan, Hubei province, China. As of 25 January 2020, at least 1,975 cases had been reported since the first patient was hospitalized on 12 December 2019. Epidemiological investigations have suggested that the outbreak was associated with a seafood market in Wuhan. Here we study a single patient who was a worker at the market and who was admitted to the Central Hospital of Wuhan on 26 December 2019 while experiencing a severe respiratory syndrome that included fever, dizziness and a cough. Metagenomic RNA sequencing 4 of a sample of bronchoalveolar lavage fluid from the patient identified a new RNA virus strain from the family Coronaviridae, which is designated here ‘WH-Human 1’ coronavirus (and has also been referred to as ‘2019-nCoV’). Phylogenetic analysis of the complete viral genome (29,903 nucleotides) revealed that the virus was most closely related (89.1% nucleotide similarity) to a group of SARS-like coronaviruses (genus Betacoronavirus, subgenus Sarbecovirus) that had previously been found in bats in China 5 . This outbreak highlights the ongoing ability of viral spill-over from animals to cause severe disease in humans.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation

              Structure of the nCoV trimeric spike The World Health Organization has declared the outbreak of a novel coronavirus (2019-nCoV) to be a public health emergency of international concern. The virus binds to host cells through its trimeric spike glycoprotein, making this protein a key target for potential therapies and diagnostics. Wrapp et al. determined a 3.5-angstrom-resolution structure of the 2019-nCoV trimeric spike protein by cryo–electron microscopy. Using biophysical assays, the authors show that this protein binds at least 10 times more tightly than the corresponding spike protein of severe acute respiratory syndrome (SARS)–CoV to their common host cell receptor. They also tested three antibodies known to bind to the SARS-CoV spike protein but did not detect binding to the 2019-nCoV spike protein. These studies provide valuable information to guide the development of medical counter-measures for 2019-nCoV. Science, this issue p. 1260
                Bookmark

                Author and article information

                Journal
                J Clin Virol
                J Clin Virol
                Journal of Clinical Virology
                Published by Elsevier B.V.
                1386-6532
                1873-5967
                14 February 2022
                14 February 2022
                : 105105
                Affiliations
                [0001]San Mateo County Public Health Laboratory, 225 37th Avenue, San Mateo, CA 94403, USA
                Author notes
                [* ]Corresponding author.
                Article
                S1386-6532(22)00041-5 105105
                10.1016/j.jcv.2022.105105
                8842577
                5b7da784-161b-4c2d-a860-07d5694f8c2d
                © 2022 Published by Elsevier B.V.

                Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.

                History
                : 13 November 2021
                : 9 February 2022
                Categories
                Correspondence

                Microbiology & Virology
                covid-19,sars-cov-2,orf7a gene,ngs,wgs,sequencing
                Microbiology & Virology
                covid-19, sars-cov-2, orf7a gene, ngs, wgs, sequencing

                Comments

                Comment on this article