Systematic discovery of recombinases for efficient integration of large DNA sequences into the human genome

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Large serine recombinases (LSRs) are DNA integrases that facilitate the site-specific integration of mobile genetic elements into bacterial genomes. Only a few LSRs, such as Bxb1 and PhiC31, have been characterized to date, with limited efficiency as tools for DNA integration in human cells. In this study, we developed a computational approach to identify thousands of LSRs and their DNA attachment sites, expanding known LSR diversity by >100-fold and enabling the prediction of their insertion site specificities. We tested their recombination activity in human cells, classifying them as landing pad, genome-targeting or multi-targeting LSRs. Overall, we achieved up to seven-fold higher recombination than Bxb1 and genome integration efficiencies of 40–75% with cargo sizes over 7 kb. We also demonstrate virus-free, direct integration of plasmid or amplicon libraries for improved functional genomics applications. This systematic discovery of recombinases directly from microbial sequencing data provides a resource of over 60 LSRs experimentally characterized in human cells for large-payload genome insertion without exposed DNA double-stranded breaks.

Abstract

Screening recombinases identifies tools for inserting large sequences into the human genome.

Related collections

Most cited references 106

Record: found
Abstract: found
Article: found

Is Open Access

The Sequence Alignment/Map format and SAMtools

Heng Li, Bob Handsaker, Alec Wysoker … (2009)

Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: rd@sanger.ac.uk

0 comments Cited 15783 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Fast and accurate short read alignment with Burrows–Wheeler transform

Heng Li, Richard Durbin (2009)

Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ∼10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: rd@sanger.ac.uk

0 comments Cited 11957 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability

Kazutaka Katoh, Daron Standley (2013)

We report a major update of the MAFFT multiple sequence alignment program. This version has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update. This report shows actual examples to explain how these features work, alone and in combination. Some examples incorrectly aligned by MAFFT are also shown to clarify its limitations. We discuss how to avoid misalignments, and our ongoing efforts to overcome such limitations.

0 comments Cited 11925 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Michael C. Bassik:

ORCID: http://orcid.org/0000-0001-5185-8427

bassik@stanford.edu

Lacramioara Bintu:

ORCID: http://orcid.org/0000-0001-5443-6633

lbintu@stanford.edu

Ami S. Bhatt:

ORCID: http://orcid.org/0000-0001-8099-2975

asbhatt@stanford.edu

Patrick D. Hsu:

ORCID: http://orcid.org/0000-0002-9380-2648

pdhsu@berkeley.edu

Journal

Journal ID (nlm-ta): Nat Biotechnol

Journal ID (iso-abbrev): Nat Biotechnol

Title: Nature Biotechnology

Publisher: Nature Publishing Group US (New York )

ISSN (Print): 1087-0156

ISSN (Electronic): 1546-1696

Publication date (Electronic): 10 October 2022

Publication date PMC-release: 10 October 2022

Publication date (Print): 2023

Volume: 41

Issue: 4

Pages: 488-499

Affiliations

[1 ]Arc Institute, Palo Alto, CA USA

[2 ]GRID grid.47840.3f, ISNI 0000 0001 2181 7878, Department of Bioengineering, , University of California, Berkeley, ; Berkeley, CA USA

[3 ]GRID grid.168010.e, ISNI 0000000419368956, Department of Genetics, , Stanford University, ; Stanford, CA USA

[4 ]GRID grid.47840.3f, ISNI 0000 0001 2181 7878, University of California, Berkeley—University of California, San Francisco Graduate Program in Bioengineering, ; Berkeley, CA USA

[5 ]GRID grid.168010.e, ISNI 0000000419368956, Department of Bioengineering, , Stanford University, ; Stanford, CA USA

[6 ]GRID grid.168010.e, ISNI 0000000419368956, Cancer Biology Program, , Stanford University, ; Stanford, CA USA

[7 ]GRID grid.250671.7, ISNI 0000 0001 0662 7144, Laboratory of Molecular and Cell Biology, Salk Institute for Biological Studies, ; La Jolla, CA USA

[8 ]GRID grid.168010.e, ISNI 0000000419368956, Department of Medicine (Hematology), , Stanford University, ; Stanford, CA USA

[9 ]GRID grid.47840.3f, ISNI 0000 0001 2181 7878, Innovative Genomics Institute, , University of California, Berkeley, ; Berkeley, CA USA

[10 ]GRID grid.47840.3f, ISNI 0000 0001 2181 7878, Center for Computational Biology, , University of California, Berkeley, ; Berkeley, CA USA

Author information

Peter P. Du http://orcid.org/0000-0003-2652-0541

Peter Lotfy http://orcid.org/0000-0003-0809-7073

Michael C. Bassik http://orcid.org/0000-0001-5185-8427

Lacramioara Bintu http://orcid.org/0000-0001-5443-6633

Ami S. Bhatt http://orcid.org/0000-0001-8099-2975

Patrick D. Hsu http://orcid.org/0000-0002-9380-2648

Article

Publisher ID: 1494

DOI: 10.1038/s41587-022-01494-w

PMC ID: 10083194

PubMed ID: 36217031

SO-VID: 2eb424e1-690e-40bb-af08-c8bde667dccc

License:

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

History

Date received : 9 November 2021

Date accepted : 1 September 2022

Funding

Funded by: FundRef https://doi.org/10.13039/100000001, National Science Foundation (NSF);

Award ID: DGE-1656518

Award ID: 2019284848

Award Recipient : Matthew G. Durrant Alison Fanton

Funded by: FundRef https://doi.org/10.13039/100000002, U.S. Department of Health & Human Services | National Institutes of Health (NIH);

Award ID: F99DK126120

Award ID: 5UM1HG009436-02

Award ID: 1DP2HD084a06901

Award ID: R01AI143757

Award ID: R01AI148623

Award ID: DP5OD021369

Award ID: R01GM131073

Award Recipient : Josh Tycko Michael C. Bassik Ami S. Bhatt Patrick D. Hsu

Funded by: FundRef https://doi.org/10.13039/100000057, U.S. Department of Health & Human Services | NIH | National Institute of General Medical Sciences (NIGMS);

Award ID: R35M128947

Award Recipient : Lacramioara Bintu

Custom metadata

ScienceOpen disciplines: Biotechnology

Keywords: gene delivery,genetics,genetic engineering,mobile elements

Data availability:

ScienceOpen disciplines: Biotechnology

Keywords: gene delivery, genetics, genetic engineering, mobile elements

Comments

Comment on this article

scite_

Cited by 45

See all cited by

Most referenced authors 7,501

See all reference authors

- Version 1

Systematic discovery of recombinases for efficient integration of large DNA sequences into the human genome

Read this article at

Abstract

Abstract

Related collections

BIO Integration

Most cited references 106

The Sequence Alignment/Map format and SAMtools

Fast and accurate short read alignment with Burrows–Wheeler transform

MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability

Author and article information

Contributors

Journal

Affiliations

Author information

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 155

Cited by 45

Most referenced authors 7,501