10
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A putative new SARS-CoV protein, 3c, encoded in an ORF overlapping ORF3a

      brief-report
      1 , *
      The Journal of General Virology
      Microbiology Society
      SARS-CoV, coronavirus, overlapping gene, ORF3c, 3c, sarbecovirus

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Identification of the full complement of genes in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a crucial step towards gaining a fuller understanding of its molecular biology. However, short and/or overlapping genes can be difficult to detect using conventional computational approaches, whereas high-throughput experimental approaches – such as ribosome profiling – cannot distinguish translation of functional peptides from regulatory translation or translational noise. By studying regions showing enhanced conservation at synonymous sites in alignments of SARS-CoV-2 and related viruses (subgenus Sarbecovirus) and correlating the results with the conserved presence of an open reading frame (ORF) and a plausible translation mechanism, a putative new gene – ORF3c – was identified. ORF3c overlaps ORF3a in an alternative reading frame. A recently published ribosome profiling study confirmed that ORF3c is indeed translated during infection. ORF3c is conserved across the subgenus Sarbecovirus, and encodes a 40–41 amino acid predicted transmembrane protein.

          Related collections

          Most cited references25

          • Record: found
          • Abstract: found
          • Article: not found

          Basic local alignment search tool.

          A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score. Recent mathematical results on the stochastic properties of MSP scores allow an analysis of the performance of this method as well as the statistical significance of alignments it generates. The basic algorithm is simple and robust; it can be implemented in a number of ways and applied in a variety of contexts including straightforward DNA and protein sequence database searches, motif searches, gene identification searches, and in the analysis of multiple regions of similarity in long DNA sequences. In addition to its flexibility and tractability to mathematical analysis, BLAST is an order of magnitude faster than existing sequence comparison tools of comparable sensitivity.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            MUSCLE: a multiple sequence alignment method with reduced time and space complexity

            Background In a previous paper, we introduced MUSCLE, a new program for creating multiple alignments of protein sequences, giving a brief summary of the algorithm and showing MUSCLE to achieve the highest scores reported to date on four alignment accuracy benchmarks. Here we present a more complete discussion of the algorithm, describing several previously unpublished techniques that improve biological accuracy and / or computational complexity. We introduce a new option, MUSCLE-fast, designed for high-throughput applications. We also describe a new protocol for evaluating objective functions that align two profiles. Results We compare the speed and accuracy of MUSCLE with CLUSTALW, Progressive POA and the MAFFT script FFTNS1, the fastest previously published program known to the author. Accuracy is measured using four benchmarks: BAliBASE, PREFAB, SABmark and SMART. We test three variants that offer highest accuracy (MUSCLE with default settings), highest speed (MUSCLE-fast), and a carefully chosen compromise between the two (MUSCLE-prog). We find MUSCLE-fast to be the fastest algorithm on all test sets, achieving average alignment accuracy similar to CLUSTALW in times that are typically two to three orders of magnitude less. MUSCLE-fast is able to align 1,000 sequences of average length 282 in 21 seconds on a current desktop computer. Conclusions MUSCLE offers a range of options that provide improved speed and / or alignment accuracy compared with currently available programs. MUSCLE is freely available at .
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              EMBOSS: The European Molecular Biology Open Software Suite

                Bookmark

                Author and article information

                Journal
                J Gen Virol
                J Gen Virol
                jgv
                jgv
                The Journal of General Virology
                Microbiology Society
                0022-1317
                1465-2099
                October 2020
                15 July 2020
                15 July 2020
                : 101
                : 10
                : 1085-1089
                Affiliations
                [ 1] departmentDivision of Virology , Department of Pathology, Addenbrooke’s Hospital, University of Cambridge , Cambridge, UK
                Author notes
                *Correspondence: Andrew E. Firth, aef24@ 123456cam.ac.uk
                Author information
                https://orcid.org/0000-0002-7986-9520
                Article
                001469
                10.1099/jgv.0.001469
                7660454
                32667280
                63a0957e-f024-4374-9de3-d777dfef44b3
                © 2020 The Authors

                This is an open-access article distributed under the terms of the Creative Commons Attribution License. This article was made open access via a Publish and Read agreement between the Microbiology Society and the corresponding author’s institution.

                History
                : 13 May 2020
                : 25 June 2020
                Funding
                Funded by: H2020 European Research Council
                Award ID: 646891
                Award Recipient : Andrew Firth
                Funded by: Wellcome Trust
                Award ID: 106207
                Award Recipient : Andrew Firth
                Categories
                Short Communication
                Animal
                Positive-strand RNA Viruses
                Custom metadata
                0

                Microbiology & Virology
                sars-cov,coronavirus,overlapping gene,orf3c,3c,sarbecovirus
                Microbiology & Virology
                sars-cov, coronavirus, overlapping gene, orf3c, 3c, sarbecovirus

                Comments

                Comment on this article