Main > NUCLEIC ACID > DNA > Shuffling Procedures > Recombination > of Insertion modified nucleic acids

Product USA. M

PATENT NUMBER This data is not available for free
PATENT GRANT DATE April 2, 2002
PATENT TITLE Recombination of insertion modified nucleic acids

PATENT ABSTRACT Methods of modulating, tuning and improving hybridization properties and recombination properties of molecules for use in nucleic acid shuffling procedures, relates recombination mixtures and methods of modulating, tuning, improving and evolving splicing of RNAs and proteins are provided.
PATENT INVENTORS This data is not available for free
PATENT ASSIGNEE This data is not available for free
PATENT FILE DATE March 3, 2000
PATENT REFERENCES CITED Holford, et al., "Adding `spice` to protein engineering," Structure, Aug. 15, 1998, 6:951-956.
Affholter & Arnold Engineering a revolution. Chemistry in Britain 35:48-51.
Affholter & Arnold, Engineering a revolution. Chemtech 29:34-39.
Affholter and Stemmer (1998) Directed evolution of proteins and pathways by DNA shuffling Book of Abstracts, 216.sup.th ACS National Meeting Boston, Aug. 23-27, BTEC-042.
Carlo (1996) "An intron splicing enhancer containing a G-righ repeat facilitates inclusion of a vertebrate micro-exon." RNA 2:342-353.
Crameri et al., (1993) 10.sub.20 --Fold aptamer library amplification without gel purification Nuc. Acid Res. 21(18):4410.
Howard, (1998) Chemistry of the future: Exploitation of the power of biology. Book of Abstracts, 216.sup.th ACS National Meeting Boston, Aug. 23-27, BTEC-045.
Leong et al., Maximizing the genetic diversity by molecular breeding: evolution of DNA vaccine vectors and adjuvant cytokines. 1999 Winter Biotechnology Conference: Molecular Approaches to Vaccine Design. Dec. 2-5 (1999).
Merz A., et al., Improving the catalytic activity of a thermophilic enzyme at low temperatures. Biochemistry 39(5) 880-889.
Minshull and Stemmer (1999) Protein evolution by molecular breeding Current Opinion in Chem. Biol. 3:284-290.
Patten et al., (1997) Applications of DNA shuffling to pharmaceuticals and vaccines. Current Opinion in Biotech. 8:724-733.
Punnonen et al., (1998) Evolution of genetic vaccines by DNA shuffling Keystone Symposium on Molecular Aspect of Viral Immunity Tamarron, CO, Feb. 16-22, 1998.
Punnonen et al., Evolution of DNA vaccine vectors, antigens, and adjuvant cytokines by DNA shuffling Keystone Symposium on DNA vaccines, Snowbird, UT, Apr. 12-17, 1999.
Punnonen J., Molecular breeding of allergy vaccines and antiallergic cytokines. International Archives of Allergy and Immunology 121:173-182.
Punnonen, et al., (1997) Evolution of DNA vaccine vectors by DNA shuffling The First Gordon Conference on Genetic Vaccines/DNA Vaccines Plymouth State College, Plymouth, NH, Jul. 20-25, 1997.
Soong et al., (1998) Directed evolution of novel retroviral tropisms by DNA shuffling Abstract #97, Programs & Abstracts, 1.sup.st Annual Meeting of the American Society of Gene Therapy May 28-31, 1998, Seattle, WA.
Soong et al., "Directed evolution of novel retroviral tropisms by DNA shuffling" Abstract, Retroviruses, 1998 Meeting, May 26-31, 1998 Cold Spring Harbor Laboratory, Cold Spring Harbor, NY.
Soong et al., "DNA shuffling as a tool to evolve desired retroviral phenotypes" Abstract, p. 228 Gene Therapy, 1998 Meeting Sep. 23-27, 1998 Cold Spring Harbor Laboratory, Cold Spring Harbor, NY.
Stemmer "Directed evolution of enzymes and pathways by DNA shuffling" FASEB Journal 13(7):A1431.
Stemmer et al., (1999) "Molecular Breeding of viruses for targeting and other clinical properties" Tumor Targeting 4:1-4.
Stemmer et al., Molecular evolution of genes and pathways by DNA shuffling. FASEB Journal 11(9):A1124.
Stemmer, Directed evolution of enzymes and pathways by DNA shuffling. Book of Abstracts, 217.sup.th ACS National Meeting, Anaheim, CA Mar. 21-25, BIOT-080.
Stemmer, Directed evolution of proteins, pathways, episomes and viruses by DNA shuffling FASEB Journal 12(8):A1303 .
Stemmer, DNA sequence evolution by sexual PCR. Experientia (Basel) 52(ABSTR): A25.
Tobin et al., Colorless green ideas . . . Nature Biotechnology 17:333-334.
Welch et al., DNA shuffling of diverse natural genes to produce industrial enzymes with novel properties. Abstracts of the General Meeting of the American Society for Microbiology 99:507-508 Meeting info: 99.sup.th General Meeting of the American Society for Microbiology, Chicago, IL May 30-Jun. 3, 1999.
Wright et al., Evolution of viruses and vectors by DNA shuffiling: Applications in vaccination and gene therapy. 1999 Winter Biotechnology Conference: Molecular Approaches to Vaccine Design. Dec. 2-5 (1999).
Crameri et al., (1993) "10(20)-Fold aptamer library amplification without gel purification," Nuc. Acids Res. 21(18):4410.
Patten, P.A. et al., (1997) "Application of DNA Shuffling to Pharmaceuticals and Vaccines." Current Opinion in Biotechnology 8:724-733.
Chang et al., (1999) "Evolution of a cytokine using DNA family shuffling" Nature Biotechnology 17:793-797.
Christians et al., (1999) "Directed evolution of thymidine kinase for AZT phosphorylation using DNA family shuffling" Nature Biotechnology 17:259-264.
Colston and Davis (1994) Mol. Micobiol., 12:359-363.
Cook et al., "Photochemically initiated protein splicing" (1995) Angew. Chem. Int. Ed. Engel 34, 1620-1630.
Crameri and Stemmer (1995) "Combinatorial multiple cassette mutagenesis creates all the permutations of mutant and wildtype cassettes" Bio Techniques 18:194-195.
Crameri et al., (1996) "Construction and evolution of antibody-phage libraries by DNA shuffling" Nature Medicine 2:100-103.
Crameri et al., (1996) "Improved green fluorescent protein by molecular evolution using DNA shuffling" Nature Biotechnology 14:315-319.
Crameri et al., (1997) "Molecular evolution of an arsenate detoxification pathway by DNA shuffling" Nature Biotechnology 15:436-438.
Crameri et al., (1998) "DNA shuffling of a family of genes from diverse species accelerates directed evolution" Nature 391:288-291.
Davis et al., (1992) "Novel structure of the recA Locus of Mycobacterium . . . " J. Bacteriol, 173:5653-5662.
Davis et al., "Protein splicing in the maturation of M. Tuberculosis RecA Protein: A mechanism . . . " (1992) Cell 71:201-210.
Gates et al., (1996) "Affinity selective isolation of ligands from peptide libraries through display on a lac repressor headpiece dimer" Journal of Molecular Biology 255:373-386.
Hirtata et al., (1990) J. Biol Chem. 265:6726-6733.
Hodges et al., "Protein splicing removes intervening sequences in an archaea DNA polymerase" (1992) Nucleic Acids Res. 20:6153-6157.
Kane et al., (1990) "Protein splicing converts the yeast TFP1 Gene Product to the 69-kD Subunit . . . " Science 250:651-657.
Modefferi and Black (1997) Mol Cell Biol. 17:6537-6545.
Ness et al., (1999) DNA shuffling of subgenomic sequences of subtillsin Nature Biotechnology 17:893-896.
Ostermeier et al., "A combinatorial approach to hybrid enzymes independent of DNA homology" Nature Biology (1999) vol. 17 Dec. 1205-1208.
Perler et al., (1994) Nucleic Acid Research 22:1125-1127.
Stemmer (1994) "DNA shuffling by random fragmentation and reassembly: In vitro recombination for molecular evolution" Proceedings of the National Academy of Sciences, USA 91:10747-10751.
Stemmer (1994) "Rapid evolution of a protein in vitro by DNA shuffling" Nature 370:389-391.
Stemmer (1995) "Searching Sequence Space" Bio/Technology 13:549-553.
Stemmer (1995) "The Evolution of Molecular Computation" Science 270:1215.
Stemmer (1996) "Sexual PCR and Assembly PCR" The Encyclopedia of Molecular Bioogy pp 447-457.
Stemmer et al., (1995) "Single-step assembly of a gene and entire plasmid form large numbers of oligodeoxyribonucleotides" Gene 164:49-53.
Xu et al., (1993) Cell 75:1371-1377.
Zhang et al., (1997) "Directed evolution of an effective fucosidase from a galactosidease by DNA shuffling and screening" Proceedings of the National Academy of Sciences, U.S.A. 94:4504-4509.
Carlo et al., An intron splicing enhancer containing a G-rich repeat facilitates inclusion of a vertebrate micro-exon. RNA, 2, 343-353, 1996.

PATENT PARENT CASE TEXT This data is not available for free
PATENT CLAIMS What is claimed is:

1. A method of recombining a first and a second target DNA, the method comprising:

providing a first and a second target DNA, wherein at least one of the first and second target DNAs comprises a plurality of homologous or non-homologous insertion DNA sequences, wherein the plurality of insertion DNA sequences encode one or more intein; and,

recombining the first and second target DNAs, thereby providing a recombinant DNA.

2. The method of claim 1, wherein the method occurs in vitro.

3. The method of claim 2, wherein the recombinant DNA encodes a protein subsequence, which protein subsequence is spliced to a second protein subsequence to produce an active protein.

4. The method of claim 3, wherein the protein subsequence and the second protein subsequence are spliced in vivo.

5. The method of claim 3, wherein the protein subsequence and the second protein subsequence are spliced in cis.

6. The method of claim 3, wherein the protein subsequence and the second protein subsequence are spliced in trans.

7. The method of claim 3, wherein the protein subsequence and the second protein subsequence are spliced in a spontaneous splicing reaction.

8. The method of claim 3, wherein the protein subsequence and the second protein subsequence are spliced in a controlled splicing reaction.

9. The method of claim 1, wherein the plurality of insertion DNA sequences are present in both the first and second target DNAs.

10. The method of claim 1, wherein the plurality of insertion DNA sequences at least partially comprise at least one DNA subsequence which encodes at least one trans-splicing intein.

11. The method of claim 1, wherein the first or second target DNA comprises at least about 10 mini exteins.

12. The method of claim 1, wherein the first or second target DNA comprises at least about 10 insertion DNA sequences.

13. The method of claim 1, wherein the insertion DNA sequences modulate a recombination frequency between the first and second target DNAs.

14. The method of claim 1, wherein the insertion DNA sequences modulate an expression level or expression pattern of the first target DNA, the second target DNA, or the recombinant DNA in one or more cells.

15. The method of claim 1, wherein the insertion DNA sequences are recombined with one or more parental DNAs to produce the first or second target DNA.

16. The method of claim 1, the method further comprising:

providing a first parental DNA sequence and a second parental DNA sequence, which first and second parental DNA sequences are homologous or non-homologous; and,

inserting a plurality of insertion DNA sequences into one or more of the first and second parental DNA sequences, wherein the plurality of insertion DNA sequences encode one or more intein, thereby providing the first and the second target DNAs.

17. The method of claim 16, wherein the step of inserting the plurality of insertion DNA sequences into one or more of the first and second parental DNA sequences is performed by physically joining a plurality of subsequences of the first or second parental DNA sequences to the plurality of insertion DNA sequences.

18. The method of claim 1, the method further comprising selecting the recombinant DNA for a desired trait or property.

19. The method of claim 1, further comprising expressing the recombinant DNA in a cell.

20. The method of claim 1, further comprising expressing the recombinant DNA in a cell, thereby producing a protein, which protein is proteolytically cleaved to produce an active protein or to remove the intein.

21. The method of claim 1, the first and second target DNAs each comprising a plurality of insertion DNAs, wherein, during recombination of the first and second target DNAs, the crossover frequency between the insertion sequences in the first and second DNAs is higher than the crossover frequency of non-insertion sequences in the first and second target DNAs.

22. The method of claim 1, wherein the recombinant DNA encodes a molecule which does not comprise or encode a translated insertion sequence.

23. The method of claim 22, wherein the molecule is selected from a DNA, an RNA, an mRNA, a viral RN,A a sn RNA, a tRNA, an rRNA, a gRNA, a protein, and a proteolytically cleaved protein.

24. The method of claim 1, wherein the recombinant DNA encodes a protein with an activity selected from an insulin protein activity, a peptide hormone activity, a cytokine activity, an epidermal growth factor activity, a fibroblast growth factor activity, a hepatocyte growth factor activity, an insulin-like growth factor activity, an interferon activity, an interleukin activity, a keratinocyte growth factor activity, a leukemia inhibitory factor activity, an oncostatin M activity, a PD-ECSF activity, a pleiotropin activity, an SCF activity, a c-kit ligand activity, a VEGF activity, a G-CSF activity, an oncogene activity, a tumor suppressor activity, a steroid hormone receptor activity, an herbicide resistance activity, a monooxygenase activity, a nuclease activity, a lipase activity, an antibody V gene activity, a TGF.beta. activity, an NGF activity, a PDGF.beta. activity, a TNK.sub.or activity, a CNTF activity, a 4F activity, an RNase activity, an antibody activity, a tumor necrosis factor activity, a GM-CSF activity, a plant hormone activity, a disease resistance protein activity, a bacterial protein activity, a protease activity, a peptide ligand activity, a angiogenisis inhibitor activity, a C-X-C chemokine activity, a C--C chemokine activity, a cystein knot protein activity, and an EPO activity, wherein the recombinant nucleic acid does not hybridize under stringent conditions to a cDNA which encodes said activity, which cDNA is a copy of a naturally occurring mRNA.

25. The method of claim 1, wherein the first target DNA comprises two non-homologous subsequences and a plurality of insertion subsequences.

26. The method of claim 1, wherein the first or second target DNA or recombinant DNA is present in an expression vector.

27. A method of recombining a first and a second target nucleic acid, the method comprising:

providing a first and a second target nucleic acid, wherein at least one of the first and second target nucleic acids comprises a plurality of homologous or non-homologous insertion nucleic acid sequences, wherein the insertion nucleic acid sequences comprise at least one intein; and,

recombining the first and second target nucleic acids, thereby providing a recombinant nucleic acid,

wherein the recombinant nucleic acid encodes a protein subsequence, and, wherein the protein subsequence and the second protein subsequence are spliced in vitro to produce an active protein.

28. A method of recombining a first and a second target nucleic acid, the method comprising:

providing one or more parental nucleic acids, wherein at least one of the parental nucleic acids is recombined with a plurality of insertion nucleic acid sequences, thereby providing a first or a second target nucleic acid, wherein at least one of the first and second target nucleic acids comprises the plurality of homologous or non-homologous insertion nucleic acid sequences; and,

recombining the first and second target nucleic acids, thereby providing a recombinant nucleic acid,

wherein the parental nucleic acid corresponds to one or more of: a gene or cDNA encoding EPO, a gene or cDNA encoding an insulin protein, a gene or cDNA encoding a peptide hormone, a gene or cDNA encoding a cytokine, a gene or cDNA encoding an epidermal growth factor, a gene or cDNA encoding a fibroblast growth factor, a gene or cDNA encoding a hepatocyte growth factor, a gene or cDNA encoding insulin-like growth factor, a gene or cDNA encoding an interferon, a gene or cDNA encoding an interleukin, a gene or cDNA encoding a keratinocyte growth factor, a gene or cDNA encoding a leukemia inhibitory factor, a gene or cDNA encoding oncostatin M, a gene or cDNA encoding PD-ECSF, a gene or cDNA encoding PDGF, a gene or cDNA encoding pleiotropin, a gene or cDNA encoding SCF, a gene or cDNA encoding c-kit ligand, a gene or cDNA encoding VEGF, a gene or cDNA encoding G-CSF, a gene or cDNA encoding an oncogene, a gene or cDNA encoding a tumor suppressor, a gene or cDNA encoding a steroid hormone receptor, a gene or cDNA encoding a plant hormone, a gene or cDNA encoding a disease resistance gene, a gene or cDNA encoding an herbicide resistance gene, a gene or cDNA encoding a bacterial gene, a gene or cDNA encoding a monooxygenase, a gene or cDNA encoding a protease, a gene or cDNA encoding a nuclease, a gene or cDNA encoding a lipase, a gene or cDNA encoding a C-X-C chemokine, a gene or cDNA encoding a C--C chemokine, a gene or cDNA encoding an antibody V gene, a gene or cDNA encoding a cystein knot protein, a gene or cDNA encoding TGF.beta., a gene or cDNA encoding NGF, a gene or cDNA encoding PDGF.beta. a gene or cDNA encoding a TNKor family member, a gene or cDNA encoding CNTF, a gene or cDNA encoding 4F, a gene or cDNA encoding an RNase, a gene or cDNA encoding an antibody, a gene or cDNA encoding peptide ligand, a gene or cDNA encoding a tumor necrosis factor and a gene or cDNA encoding an angiogenisis inhibitor.

29. The method of recombining a first and a second target nucleic acid, the method comprising:

providing a first parental nucleic acid sequence and a second parental nucleic acid sequence, wherein the first and second parental nucleic acid sequences are homologous or non-homologous;

inserting a plurality of insertion nucleic acid sequences into one or more of the first and second parental nucleic acid sequences, thereby providing first and second target nucleic acids, wherein the first and second parental nucleic acid sequences hybridize under stringent conditions, and the first and second target nucleic acids do not hybridize under stringent conditions; and,

recombining the first and second target nucleic acids, wherein at least one of the first and second target nucleic acids comprises the plurality of homologous or non-homologous insertion nucleic acid sequences, thereby providing a recombinant nucleic acid.

30. The method of recombining a first and a second target nucleic acid, the method comprising:

providing a first parental nucleic acid sequence and a second parental nucleic acid sequence, wherein the first and second parental nucleic acid sequences are homologous or non-homologous;

inserting a plurality of insertion nucleic acid sequences into one or more of the first and second parental nucleic acid sequences, thereby providing the first and the second target nucleic acids, wherein the first and second parental nucleic acid sequences do not hybridize under stringent conditions, and wherein the first and second target nucleic acids hybridize under stringent conditions; and,

recombining the first and second target nucleic acids, wherein at least one of the first and second target nucleic acids comprises the plurality of homologous or non-homologous insertion nucleic acid sequences, thereby providing a recombinant nucleic acid.

31. The method of recombining a first and a second target nucleic acid, the method comprising:

providing a first parental nucleic acid sequence and a second parental nucleic acid sequence, wherein the first and second parental nucleic acid sequences are homologous or non-homologous;

inserting a plurality of insertion nucleic acid sequences into one or more of the first and second parental nucleic acid sequences, thereby providing the first and the second target nucleic acids, wherein the first and second target nucleic acids hybridize under stringent conditions, and wherein the first target nucleic acid does not hybridize under stringent conditions to the second parental nucleic acid, or wherein the second target nucleic acid does not hybridize under stringent conditions to the first parental nucleic acid sequence; and,

recombining the first and second target nucleic acids, wherein at least one of the first and second target nucleic acids comprises the plurality of homologous or non-homologous insertion nucleic acid sequences, thereby providing a recombinant nucleic acid.

32. The method of recombining a first and a second target nucleic acid, the method comprising:

providing a first parental nucleic acid sequence and a second parental nucleic acid sequence, wherein the first and second parental nucleic acid sequences are homologous or non-homologous;

inserting a plurality of insertion nucleic acid sequences into one or more of the first and second parental nucleic acid sequences, thereby providing the first and the second target nucleic acids, wherein the first or second parental nucleic acid sequence hybridizes to a third nucleic acid under stringent conditions, wherein the first and second target nucleic acids do not hybridize under stringent conditions to the third nucleic acid; and,

recombining the first and second target nucleic acids, wherein at least one of the first and second target nucleic acids comprises the plurality of homologous or non-homologous insertion nucleic acid sequences, thereby providing a recombinant nucleic acid.

33. The method of recombining a first and a second target nucleic acid, the method comprising:

providing a first and a second target nucleic acid, wherein at least one of the first and second target nucleic acids comprises a plurality of homologous or non-homologous insertion nucleic acid sequences;

recombining the first and second target nucleic acids, thereby providing a recombinant nucleic acid; and,

recombining the recombinant nucleic acid with a third nucleic acid, and, optionally, selecting the resulting secondary recombinant nucleic acid for a desired trait or property.

34. The method of recombining a first and a second target nucleic acid, the method comprising: providing a first and a second target nucleic acid, wherein at least one of the first and second target nucleic acid comprises a plurality of homologous or non-homologous insertion nucleic acid sequences, wherein the first and second target nucleic acids are derived from first and second parental nucleic acids by integration of a plurality of insertion sequences into the first and second parental nucleic acids, wherein the first and second parental nucleic acid are less than 50% identical over the full length of the first and second parental nucleic acids, when the first and second nucleic acids are aligned for maximum identity; and, recombining the first and second target nucleic acids, thereby providing a recombinant nucleic acid.

35. The method of claim 34, wherein the first and second parental nucleic acids are less than 25% identical over the full length of the first and second parental nucleic acids, when the first and second target nucleic are aligned for maximum identity
PATENT DESCRIPTION COPYRIGHT NOTIFICATION

Pursuant to 37 C.F.R. 1.71(e), Applicants note that a portion of this disclosure contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.

FIELD OF THE INVENTION

The present invention relates to molecular shuffling, and to splicing of nucleic acids and proteins.

BACKGROUND OF THE INVENTION

Nucleic acid shuffling provides for the rapid evolution of nucleic acids, in vitro and in vivo. Rapid evolution provides for the commercial production of encoded molecules (e.g., nucleic acids and proteins) with new and/or improved properties. Proteins and nucleic acids of industrial, agricultural and therapeutic value can be created or improved through shuffling procedures. A number of publications by the inventors and their co-workers describe nucleic acid shuffling and applications of this technology. For example, Stemmer et al. (1994) "Rapid Evolution of a Protein" Nature 370:389-391; Stemmer (1994) "DNA Shuffling by Random Fragmentation and Reassembly: in vitro Recombination for Molecular Evolution," Proc. Natl. Acad. USA 91:10747-10751; Stemmer U.S. Pat. No. 5,603,793 METHODS FOR IN VITRO RECOMBINATION; Stemmer et al. U.S. Pat. No. 5,830,721 DNA MUTAGENESIS BY RANDOM FRAGMENTATION AND REASSEMBLY; Stemmer et al., U.S. Pat. No. 5,811,238 METHODS FOR GENERATING POLYNUCLEOTIDES HAVING DESIRED CHARACTERISTICS BY ITERATIVE SELECTION AND RECOMBINATION describe, e.g., in vivo and in vitro nucleic acid, DNA and protein shuffling in a variety of formats, e.g., by repeated cycles of mutagenesis, shuffling and selection, as well as methods of generating libraries of displayed peptides and antibodies.

Applications of DNA shuffling technology have also been developed by the inventors and their co-workers. In addition to the publications noted above, Minshull et al., U.S. Pat. No. 5,837,458 METHODS AND COMPOSITIONS FOR CELLULAR AND METABOLIC ENGINEERING provides for the evolution of metabolic pathways and the enhancement of bioprocessing through recursive shuffling techniques. Crameri et al. (1996), "Construction And Evolution Of Antibody-Phage Libraries By DNA Shuffling" Nature Medicine 2(1):100-103 describe, e.g., antibody shuffling for antibody phage libraries. Additional details regarding DNA Shuffling can also be found in WO95/22625, WO97/20078, WO96/33207, WO97/33957, WO98/27230, WO97/35966, W098/ 31837, WO98/13487, WO98/13485 and WO989/42832.

Physical nucleic acid shuffling techniques (as opposed, e.g., to "in silico" methods which are performed, at least in part, by manipulation of character strings in a computer) rely upon actual recombination between physical nucleic acids, whether the format is an in vitro or an in vivo format. Recombination occurs at a relatively high frequency, e.g., where there are complementary nucleic acids between strands to be recombined. Thus. nucleic acids to be recombined are typically e.g., about 70% identical/complementary in sequence over regions of, e.g., about 30-40 nucleotides. It would be desirable to be able to recombine low homology, or even non-homologous sequences, thereby increasing access to the potential sequence space encoded by recombinant nucleic acids resulting from shuffling methods. For example, for proteins which are commercially valuable, it would be desirable to be able to gain access to a recombination/mutation spectrum which is different than that of the native protein to provide for greater diversity in products produced by the various available shuffling strategies.

Similarly, nucleic acid recombination generally can be difficult to modulate, resulting in regions of high or low crossover frequency between two different targets for recombination. The crossover frequency for a particular pairing of sequences on two different targets is one feature that mediates the recombinant nucleic acids that result from recombination methods. Improved methods of modulating the recombination frequency at potential recombination sites would be desirable to weight/bias recombination product outcomes.

In general, new techniques which facilitate, improve or add levels of control to recombination methods are highly desirable. In particular, techniques which permit shuffling of divergent nucleic acids, or which provide for modulation and tuning of shuffling rates are desirable. The present invention provides such significant new recombination protocols, as well as other features which will be apparent upon complete review of this disclosure.

SUMMARY OF THE INVENTION

The present invention provides a number of new nucleic acid recombination formats for nucleic acid shuffling. In the methods, a number of insertion sequences are inserted into one or more parental nucleic acid to provide a modified target nucleic acid substrate for recombination and subsequent mutation. The number, type and placement of such insertion sequences provides for the ability to shuffle nucleic acids with little or no homology other than the insertion sequences. In addition, these insertion sequences provide for the ability to modulate or "tune" recombination frequencies between target nucleic acids. The methods typically take advantage of self-splicing, trans-splicing or use cellular machinery to remove the insertion sequences from final coded nucleic acids or proteins, e.g., where the insertion sequences are introns, inteins, proteolyzed polypeptide sequences or the like. The insertion sequences can also comprise markers, molecular tags, or the like, e.g., for purification of encoded molecules or can serve to allow for expression of otherwise toxic proteins (e.g., RNases, Dnases, restriction enzymes, proteases, lipases, recombinases, ligases, polymerases, etc.) e.g., in a form where an intein is excised in vivo. Similarly, in vitro expression of insertion modified sequences can result in the production of these and other proteins in vitro, e.g., using in vitro expression systems.

Methods of shuffling two target nucleic acids (i.e., a first and a second target nucleic acid) are provided. In the methods, a first and a second target nucleic acid are provided, e.g., by cloning, PCR amplification, synthesis, isolation from an environmental source (soil, air, water, etc.), or other methods. At least one of the first and second target nucleic acids (and typically both) have a plurality of homologous or non-homologous insertion nucleic acid sequences, such as one or more intron (e.g., self-splicing bacterial, eukaryotic or trans-splicing intron), intein, subsequence removed by site specific recombination (e.g., similar to V-D-J recombination for antibody production), or the like, optionally including intron splicing enhancers or the like. The target nucleic acids are recombined, producing a shuffled recombinant nucleic acid.

In addition to providing for new recombination methods per se, the invention also provides methods of producing selected proteins and RNAs, for any of the purposes that such proteins and RNAs are ordinarily produced. For example, in one aspect, a first shuffled nucleic acid subsequence encoding a first portion of the selected protein and a second nucleic acid subsequence encoding a second portion of the selected protein is provided. The nucleic adds can be on the same strand (as in cis-mediated reactions) or on different strands (as in trans mediated reactions). The first and second subsequences are expressed to produce a first protein subsequence and a second protein subsequence, which are spliced to produce the selected protein. Commonly, more than two subsequences are spliced, e.g., 3, 4, 5, 6, 7, 8, 9, 10 or more sequences, as set forth herein. The splicing reaction can be in cis or in trans (or both) and can be in viro or in vivo (or both). Splicing can occur by spontaneous or controlled mechanisms.

Similarly, in RNA production methods, a first shuffled nucleic acid subsequence encoding a first portion of the selected RNA is provided and a second nucleic acid subsequence encoding a second portion of the selected RNA is also provided. Again, these subsequences can be on the same or on different molecules (depending on whether cis or trans splicing is employed). The first and second nucleic acid subsequences, or RNA copies thereof, are spliced to produce the selected RNA, which can encode a useful RNA (e.g., an antisense, or sense molecule, or ribozyme) or the RNA can encode a protein. The intein and RNA shuffling/production methods are combinable, i.e., the spliced RNA molecules can encode intein-extein sequences which are spliced at the protein level to produce a useful protein.

In general, a parental nucleic acid can be broken into several exons or exteins by incorporation of a number of introns or inteins into the sequence of the parental nucleic acid. For example, the target nucleic acid resulting from incorporation of insertion sequences into the parental nucleic acid can have, e.g., about 5, 10, 15, 20, 30, 50, 100 or more "mini exons," or "mini exteins" separated by a corresponding number of insertion sequences.

In shuffling reactions, first and second target nucleic acids are optionally derived from a first and second parental nucleic acid which are sufficiently different in sequence that they do not substantially hybridize in solution. For example, the first and second target nucleic acids can be derived by integration of a plurality of insertion sequences into the first and second parental nucleic acid. The first and second parental nucleic acid can be, e.g., less than 50%, or less than e.g., 40%, or less than e.g., 30%, or less than e.g., 25% or less than e.g., 15% identical over the full length of the first and second parental nucleic acid, when the first and second nucleic acids are aligned for maximum identity.

The insertion nucleic acid sequences can modulate a recombination frequency between the first and second target nucleic acid. For example, by placing an intron into a parental sequence, the recombination efficiency of nucleic acid subsequences to either side of the intron can be decreased. Similarly, placing homologous mini introns within the parental sequences provides sites for recombination within the resulting targets, e.g., where the targets display regions of low similarity in non-intronic sequences.

Insertion sequences can also modulate expression in one or more cell type, e.g., where the insertion sequences comprise one or more enhancer or other regulatory sequence. Similarly, insertion sequences optionally comprise splicing enhancer sequences (e.g., ISEs, such as the chicken cardiac troponin T (cTNT) ISE) to facilitate splicing.

Essentially any nucleic acid can be a parental nucleic acid with which insertion sequences can be combined to produce a target nucleic acid for splicing. Example sequences include parental nucleic acids corresponding a gene or cDNA encoding EPO, a gene or cDNA encoding an insulin protein, a gene or cDNA encoding a peptide hormone, a gene or cDNA encoding a cytokine, a gene or cDNA encoding an epidermal growth factor, a gene or cDNA encoding a fibroblast growth factor, a gene or cDNA encoding a hepatocyte growth factor, a gene or cDNA encoding insulin-like growth factor, a gene or cDNA encoding an interferon, a gene or cDNA encoding an interleukin, a gene or cDNA encoding a keratinocyte growth factor, a gene or cDNA encoding a leukemia inhibitory factor, a gene or cDNA encoding oncostatin M, a gene or cDNA encoding PD-ECSF, a gene or cDNA encoding PDGF, a gene or cDNA encoding pleiotropin, a gene or cDNA encoding SCF, a gene or cDNA encoding c-kit ligand, a gene or cDNA encoding VEGF, a gene or cDNA encoding G-CSF, a gene or cDNA encoding an oncogene, a gene or cDNA encoding a tumor suppressor, a gene or cDNA encoding a steroid hormone receptor, a gene or cDNA encoding a plant hormone, a gene or cDNA encoding a disease resistance gene, a gene or cDNA encoding an herbicide resistance gene, a gene or cDNA encoding a bacterial gene, a gene or cDNA encoding a monooxygenase, a gene or cDNA encoding a protease, a gene or cDNA encoding a nuclease, an antibody, a peptide ligand, an angiogenisis inhibitor, a gene, or cDNA encoding a lipase, a gene or cDNA encoding a C-X-C chemokine, a gene or cDNA encoding a C--C chemokine, a gene or cDNA encoding an antibody V gene, a gene or cDNA encoding a cystein knot protein such as TGF.beta., NGF, PDGF.beta. or the like, a gene or cDNA encoding a TNK.sub.or family member, a gene or cDNA encoding CNTF, a gene or cDNA encoding 4F, and/or gene or cDNA encoding an RNase.

The methods herein are amenable to both physical recombination of nucleic acids and to virtual or "in silico" recombination of character strings representing nucleic acids, e.g., in a computer. Following complete or partial sequence recombination in silico, target nucleic acids, or nucleic acids derived from the target nucleic acids can be synthesized. Such synthetic nucleic acids can be recombined, cloned, selected or otherwise manipulated in the same manner as any other nucleic acid.

A variety of techniques can be used to produce target nucleic acids comprising insertion sequences. Such methods include chemical synthesis, PCR concatemerization, in silico character string formation or generation, and the like. For example, in one embodiment, insertion of the plurality of insertion nucleic acid sequences into one or more of the first and second parental nucleic acid sequences is performed by physically joining a plurality of subsequences of the first or second parental nucleic acid sequences to the plurality of insertion nucleic acid sequences.

As noted, the addition of insertion sequences to parental nucleic acids can modify or modulate the recombination of resulting target nucleic acids. Similarly, the addition of insertion sequences can alter the hybridization properties of resulting target sequences. For example, even non-homologous parental nucleic acids can be made to hybridize by the addition of a sufficient number and appropriate arrangement of insertion sequences. Similarly, a target nucleic acid derived from a parental sequence can be made which does not hybridize under a selected set of conditions (e.g., stringent hybridization conditions) to the parental nucleic acid. As noted above, such insertion sequences can be used to tune recombination rates between selected regions of a target nucleic acid, e.g., where a particular region is targeted for an increased or decreased recombination rate.

The target and parental nucleic acids can have dramatically different hybridization properties as a result of the insertion sequences being present in the target nucleic acids. The target nucleic acids can be prevented from hybridizing to the parents by inclusion of the target sequences, or, conversely, one or more target sequence can even be made to hybridize to one or more parent, thereby controlling the recombination properties of resulting nucleic acid shuffling reactions. Thus, in one embodiment, the first and second parental nucleic acid sequences hybridize under stringent conditions, and the first and second target nucleic acids do not hybridize under stringent conditions. Similarly, in another embodiment, the first and second parental nucleic acid sequences do not hybridize under stringent conditions, while the first and second target nucleic acids hybridize under stringent conditions. In yet another embodiment, the first and second nucleic target nucleic acid hybridize under stringent conditions, while the first target nucleic acid does not hybridize under stringent conditions to the second parental nucleic acid, or wherein the second target nucleic acid does not hybridize under stringent conditions to the first parental nucleic acid. Similarly, in one embodiment, the first or second parental nucleic acid hybridizes to a third nucleic acid under stringent conditions, where the first and second target nucleic acids do not hybridize under stringent conditions to the third nucleic acid. A variety of other modifications in hybridization due to the number and arrangement of insertion sequences will be apparent upon complete review.

Recombinant nucleic acids generated by recombining nucleic acid sequences comprising insertion subsequences can, of course, be recombined or shuffled, cloned, amplified, expressed in vivo or in vitro, synthesized, or otherwise modified using any available naturally mediated or laboratory-mediated technique. For example, in one embodiment, a shuffled recombinant nucleic acid made by recombining one or more target nucleic acid comprising a plurality of insertion sequences with one or more additional nucleic acid(s) is recombined with a third nucleic acid. The resulting secondary shuffled recombinant nucleic acid can be selected for a desired trait or property using any available selection method. In general, any recombinant nucleic acid can be selected for a desired trait or property.

Recombinant nucleic acids are also optionally expressed in a cell or in vitro, thereby producing a nucleic acid or protein. In one embodiment, the expressed protein can comprise intein and extein sequences. Typically, the intein (some times referred to as an "intervening protein sequence") is excised from an expressed protein sequences. Concomitantly, the ligation of the flanking sequences (exteins) form a mature "extein protein" which is, optionally, active in one or more cell or in one or more in vitro reaction or system. Thus, expressed proteins can be proteolytically cleaved and ligated to produce an active protein, and/or to remove an intein from an expressed protein. This ligation reaction can occur both cis- and trans-splicing reaction formats. Reactions occur in vitro or in vivo for cis or trans splicing inteins. For additional details regarding trans splicing of introns and inteins. see, Patten et al. "ENCRYPTION OF TRAITS USING SPLIT GENE SEQUENCES AND ENGINEERED GENETIC ELEMENTS" U.S. Ser. No. 60/164,618 Filed Nov. 10, 1999.

The presence of insertion sequences can be used to modulate recombination rates between regions of nucleic acids. For example, the cross over frequency between two points on a first and second target nucleic acids can typically be increased by placing insertion sequences between the two points. This is desirable, e.g., where low linkage rates between regions of nucleic acids to be recombined are desired, e.g., where one wishes to separately evolve different functional domains or elements of the nucleic acid.

Recombinant nucleic acids can be modified by removal of insertion sequencres to improve expression or facilitate cloning of any final product. For example, where a nucleic acid encodes a plurality of intronic insertion sequences, the encoded mRNA can be reverse transcribed and the resulting cDNA cloned or otherwise manipulated. It should be noted that this process can result in a cDNA which does not hybridize to the recombinant nucleic acid comprising the introns. Indeed, the cDNA can be the result of several rounds of selection and recombination, resulting in a cDNA with a highly unique sequence which does not hybridize under e.g., stringent conditions, to any previously known sequence. Thus, sequence space which is inaccesible between two known nucleic acids is accessible by this procedure, resulting in recombinant products that could not otherwise be obtained.

The final product produced by any of the procedures herein can be a DNA (e.g., a genomic DNA, an artificial DNA, a cDNA, or the like), an RNA, an mRNA, a viral RNA, a sn RNA, a tRNA, an rRNA, a gRNA, a protein, a proteolytically cleaved protein, a protein fragment, a spliced protein or any other molcule that can be encoded by a nucleic acid, including e.g., metabolic products and the like. As noted, target sequences can comprise homologous or non homologous nucleic acid subsequences which can be separated by homologous or non homologous insertion sequences. The target nucleic acids to be recombined can be homologous relative to each other, or comprise homologous and non-homologous sequences relative to each other. The nucleic acids can be present in vectors such as expression vectors, or can be free in solution.

The nucleic acids to be recombined can be present in recombination mixtures. For example one recombination mixture of the invention includes a first target nucleic acid comprising a plurality of insertion subsequences. Typically, the mixture also includes a second target nucleic acid having at least one region of sequence similarity to the first nucleic acid. The second target nucleic acid typically also includes a plurality of insertion subsequences.

In one format, a recombination mixture resulting from fragmenting a first target nucleic acid comprising a plurality of insertion subsequences, and a second target nucleic acid comprising at least one region of sequence similarity to the first target nucleic acid is provided. For example, the first and second target nucleic acids can be fragmented with a DNase or, e.g., cleaved chemically to produce nucleic acid fragments. Similarly, the first and second target nucleic acids can be "fragmented" by chemically synthesizing fragments of the first and second target nucleic acid.

Recombinant nucleic acids produced by recombining the recombination mixtures of invention are also provided. For example, the first or second nucleic acid can include one or more subsequence corresponding to one or more subsequence from one or more gene or cDNA such as a gene or cDNA encoding EPO, a gene or cDNA encoding an insulin protein, a gene or cDNA encoding a peptide hormone, a gene or cDNA encoding a cytokine, a gene or cDNA encoding an epidermal growth factor, a gene or cDNA encoding a fibroblast growth factor, a gene or cDNA encoding a hepatocyte growth factor, a gene or cDNA encoding insulin-like growth factor, a gene or cDNA encoding an interferon, a gene or cDNA encoding an interleukin, a gene or cDNA encoding a keratinocyte growth factor, a gene or cDNA encoding a leukemia inhibitory factor, a gene or cDNA encoding oncostatin M, a gene or cDNA encoding PD-ECSF, a gene or cDNA encoding PDGF, a gene or cDNA encoding pleiotropin, a gene or cDNA encoding SCF, a gene or cDNA encoding c-kit ligand, a gene or cDNA encoding VEGF, a gene or cDNA encoding G-CSF, a gene or cDNA encoding an oncogene, a gene or cDNA encoding a tumor suppressor, a gene or cDNA encoding a steroid hormone receptor, a gene or cDNA encoding a plant hormone, a gene or cDNA encoding a disease resistance gene, a gene or cDNA encoding an herbicide resistance gene, a gene or cDNA encoding a bacterial gene, a gene or cDNA encoding a monooxygenase, a gene or cDNA encoding a protease, a gene or cDNA encoding a nuclease, a gene or cDNA encoding an RNase, and/or a gene or cDNA encoding a lipase. Of course, many other nucleic acids/proteins can be made or modified by the methods herein. The resulting recombinant nucleic acid can also comprise activities and subsequences which correspond to these nucleic acids.

In one aspect, the invention provides methods of recombining a plurality of sequence domains from a plurality of homologous or non-homologous nucleic acid sequences. In the methods, a pre-mRNA comprising a plurality of sequence domains is provided which correspond to a plurality of different parental nucleic acid sequences. The pre-mRNA is alternatively spliced to produce a plurality of different mRNAs comprising a plurality of different sets of sequence domains. Typically, the pre-mRNA has between about 6 and about 20 exons or exteins, e.g., where the pre-mRNA has a plurality of mini exons or exteins. Most typically, the plurality of different mRNAs are selected for a desired trait or property. Optionally, the methods include cloning one or more of the plurality of different mRNAs.

In this alternative splicing/recombination strategy, the methods typically include recombining one or more of: the plurality of different mRNAs, the pre-mRNA, a DNA encoding the mRNA, and a DNA encoding the pre-mRNA, with one or more additional nucleic acid.

In one embodiment, the pre-mRNA is provided to a cell by transducing or transfecting the cell with a vector comprising a DNA encoding the pre-mRNA. As discussed throughout, in vitro formats are also available.

The present invention also provides methods of making a nucleic acid with a desired splicing phenotype. In the methods, a plurality of homologous nucleic acids are provided, each comprising a plurality of insertion nucleic acid sequences. The plurality of homologous nucleic acids are recombined to produce a library of recombinant nucleic acids, which are selected for production of a desired or selected mRNA or protein (or product thereof) when the selected recombinant nucleic acid is expressed in vitro or in a cell. As with any nucleic acid noted above, this selected nucleic acid is optionally recombined with an additional nucleic acid and the resulting secondary recombinant nucleic acid selected for production of a desired mRNA or protein (or product thereof).

The nucleic acids noted above which include insertion sequences will typically comprise as many as 10 insertion sequences and as many as 10 flanking sequences (e.g., exons or exteins) or more. Insertion nucleic acid sequences include those derived from bacterial introns, eukaryotic introns and archaebacterial introns, as well as bacterial inteins, eukaryotic inteins and archaebacterial inteins. The nucleic acids are recombined in vitro or vivo.

The present invention also provides apparatus, integrated systems and kits for practicing the methods herein, e.g., comprising use of the recombination mixtures herein, containers, instruction sets for practicing the methods herein, and the like.

PATENT EXAMPLES This data is not available for free
PATENT PHOTOCOPY Available on request

Want more information ?
Interested in the hidden information ?
Click here and do your request.


back