Abstract
We characterize the Shine and Dalgarno sequence of 124 known gene beginnings. This information is used to make "rules" which help distinguish gene beginning from other sites in a library of over 78,000 bases of mRNA. Gene beginnings are found to have information besides the initiation codon and Shine and Dalgarno sequence which can be used to make better "rules".
Full text
PDFSelected References
These references are in PubMed. This may not be the complete list of references from this article.
- Alton N. K., Vapnek D. Nucleotide sequence analysis of the chloramphenicol resistance transposon Tn9. Nature. 1979 Dec 20;282(5741):864–869. doi: 10.1038/282864a0. [DOI] [PubMed] [Google Scholar]
- Barnes W. M. DNA sequence from the histidine operon control region: seven histidine codons in a row. Proc Natl Acad Sci U S A. 1978 Sep;75(9):4281–4285. doi: 10.1073/pnas.75.9.4281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beck E., Sommer R., Auerswald E. A., Kurz C., Zink B., Osterburg G., Schaller H., Sugimoto K., Sugisaki H., Okamoto T. Nucleotide sequence of bacteriophage fd DNA. Nucleic Acids Res. 1978 Dec;5(12):4495–4503. doi: 10.1093/nar/5.12.4495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Belin D., Hedgpeth J., Selzer G. B., Epstein R. H. Temperature-sensitive mutation in the initiation codon of the rIIB gene of bacteriophage T4. Proc Natl Acad Sci U S A. 1979 Feb;76(2):700–704. doi: 10.1073/pnas.76.2.700. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Büchel D. E., Gronenborn B., Müller-Hill B. Sequence of the lactose permease gene. Nature. 1980 Feb 7;283(5747):541–545. doi: 10.1038/283541a0. [DOI] [PubMed] [Google Scholar]
- Cass L. G., Horwitz A. H., Miyada C. G., Greenfield L., Wilcox G. The araC regulatory gene mRNA contains a leader sequence. Mol Gen Genet. 1980;180(1):219–226. doi: 10.1007/BF00267373. [DOI] [PubMed] [Google Scholar]
- Crawford I. P., Nichols B. P., Yanofsky C. Nucleotide sequence of the trpB gene in Escherichia coli and Salmonella typhimurium. J Mol Biol. 1980 Oct 5;142(4):489–502. doi: 10.1016/0022-2836(80)90259-4. [DOI] [PubMed] [Google Scholar]
- Dallas W. S., Falkow S. Amino acid sequence homology between cholera toxin and Escherichia coli heat-labile toxin. Nature. 1980 Dec 4;288(5790):499–501. doi: 10.1038/288499a0. [DOI] [PubMed] [Google Scholar]
- Di Nocera P. P., Blasi F., Di Lauro R., Frunzio R., Bruni C. B. Nucleotide sequence of the attenuator region of the histidine operon of Escherichia coli K-12. Proc Natl Acad Sci U S A. 1978 Sep;75(9):4276–4280. doi: 10.1073/pnas.75.9.4276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dunn J. J., Studier F. W. Nucleotide sequence from the genetic left end of bacteriophage T7 DNA to the beginning of gene 4. J Mol Biol. 1981 Jun 5;148(4):303–330. doi: 10.1016/0022-2836(81)90178-9. [DOI] [PubMed] [Google Scholar]
- Farabaugh P. J. Sequence of the lacI gene. Nature. 1978 Aug 24;274(5673):765–769. doi: 10.1038/274765a0. [DOI] [PubMed] [Google Scholar]
- Franklin N. C., Bennett G. N. The N protein of bacteriophage lambda, defined by its DNA sequence, is highly basic. Gene. 1979 Dec;8(1):107–119. doi: 10.1016/0378-1119(79)90011-8. [DOI] [PubMed] [Google Scholar]
- Gardner J. F. Regulation of the threonine operon: tandem threonine and isoleucine codons in the control region and translational control of transcription termination. Proc Natl Acad Sci U S A. 1979 Apr;76(4):1706–1710. doi: 10.1073/pnas.76.4.1706. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Godson G. N., Barrell B. G., Staden R., Fiddes J. C. Nucleotide sequence of bacteriophage G4 DNA. Nature. 1978 Nov 16;276(5685):236–247. doi: 10.1038/276236a0. [DOI] [PubMed] [Google Scholar]
- Gold L., Pribnow D., Schneider T., Shinedling S., Singer B. S., Stormo G. Translational initiation in prokaryotes. Annu Rev Microbiol. 1981;35:365–403. doi: 10.1146/annurev.mi.35.100181.002053. [DOI] [PubMed] [Google Scholar]
- Grindley N. D. IS1 insertion generates duplication of a nine base pair sequence at its target site. Cell. 1978 Mar;13(3):419–426. doi: 10.1016/0092-8674(78)90316-1. [DOI] [PubMed] [Google Scholar]
- Gupta S. L., Waterson J., Sopori M. L., Weissman S. M., Lengyel P. Movement of the ribosome along the messenger ribonucleic acid during protein synthesis. Biochemistry. 1971 Nov 23;10(24):4410–4421. doi: 10.1021/bi00800a010. [DOI] [PubMed] [Google Scholar]
- Hall M. N., Gabay J., Débarbouillé M., Schwartz M. A role for mRNA secondary structure in the control of translation initiation. Nature. 1982 Feb 18;295(5850):616–618. doi: 10.1038/295616a0. [DOI] [PubMed] [Google Scholar]
- Hedgpeth J., Clement J. M., Marchal C., Perrin D., Hofnung M. DNA sequence encoding the NH2-terminal peptide involved in transport of lambda receptor, an Escherichia coli secretory protein. Proc Natl Acad Sci U S A. 1980 May;77(5):2621–2625. doi: 10.1073/pnas.77.5.2621. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heffron F., McCarthy B. J., Ohtsubo H., Ohtsubo E. DNA sequence analysis of the transposon Tn3: three genes and three sites involved in transposition of Tn3. Cell. 1979 Dec;18(4):1153–1163. doi: 10.1016/0092-8674(79)90228-9. [DOI] [PubMed] [Google Scholar]
- Hoess R. H., Foeller C., Bidwell K., Landy A. Site-specific recombination functions of bacteriophage lambda: DNA sequence of regulatory regions and overlapping structural genes for Int and Xis. Proc Natl Acad Sci U S A. 1980 May;77(5):2482–2486. doi: 10.1073/pnas.77.5.2482. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Horii T., Ogawa T., Ogawa H. Organization of the recA gene of Escherichia coli. Proc Natl Acad Sci U S A. 1980 Jan;77(1):313–317. doi: 10.1073/pnas.77.1.313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Humayun Z., Jeffrey A., Ptashne M. Completed DNA sequences and organization of repressor-binding sites in the operators of phage lambda. J Mol Biol. 1977 May 15;112(2):265–277. doi: 10.1016/s0022-2836(77)80143-5. [DOI] [PubMed] [Google Scholar]
- Ikemura T. Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes. J Mol Biol. 1981 Feb 15;146(1):1–21. doi: 10.1016/0022-2836(81)90363-6. [DOI] [PubMed] [Google Scholar]
- Iserentant D., Fiers W. Secondary structure of mRNA and efficiency of translation initiation. Gene. 1980 Apr;9(1-2):1–12. doi: 10.1016/0378-1119(80)90163-8. [DOI] [PubMed] [Google Scholar]
- Jay E., Seth A. K., Jay G. Specific binding of a chemically synthesized prokaryotic ribosome recognition site. Prospect for molecular cloning and expression of eukaryotic genes. J Biol Chem. 1980 May 10;255(9):3809–3812. [PubMed] [Google Scholar]
- Lawther R. P., Hatfield G. W. Multivalent translational control of transcription termination at attenuator of ilvGEDA operon of Escherichia coli K-12. Proc Natl Acad Sci U S A. 1980 Apr;77(4):1862–1866. doi: 10.1073/pnas.77.4.1862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lawther R. P., Nichols B., Zurawski G., Hatfield G. W. The nucleotide sequence preceding and including the beginning of the ilvE gene of the ilvGEDA operon of Escherichia coli K12. Nucleic Acids Res. 1979 Dec 20;7(8):2289–2301. doi: 10.1093/nar/7.8.2289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee F., Bertrand K., Bennett G., Yanofsky C. Comparison of the nucleotide sequences of the initial transcribed regions of the tryptophan operons of Escherichia coli and Salmonella typhimurium. J Mol Biol. 1978 May 15;121(2):193–217. doi: 10.1016/s0022-2836(78)80005-9. [DOI] [PubMed] [Google Scholar]
- Maizels N. E. coli lactose operon ribosome binding site. Nature. 1974 Jun 14;249(458):647–649. doi: 10.1038/249647b0. [DOI] [PubMed] [Google Scholar]
- Model P., Webster R. E., Zinder N. D. Characterization of Op3, a lysis-defective mutant of bacteriophage f2. Cell. 1979 Oct;18(2):235–246. doi: 10.1016/0092-8674(79)90043-6. [DOI] [PubMed] [Google Scholar]
- Movva N. R., Nakamura K., Inouye M. Regulatory region of the gene for the ompA protein, a major outer membrane protein of Escherichia coli. Proc Natl Acad Sci U S A. 1980 Jul;77(7):3845–3849. doi: 10.1073/pnas.77.7.3845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murray V., Holliday R. Increased error frequency of DNA polymerases from senescent human fibroblasts. J Mol Biol. 1981 Feb 15;146(1):55–76. doi: 10.1016/0022-2836(81)90366-1. [DOI] [PubMed] [Google Scholar]
- Musso R., Di Lauro R., Rosenberg M., de Crombrugghe B. Nucleotide sequence of the operator-promoter region of the galactose operon of Escherichia coli. Proc Natl Acad Sci U S A. 1977 Jan;74(1):106–110. doi: 10.1073/pnas.74.1.106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nakamura K., Inouye M. DNA sequence of the gene for the outer membrane lipoprotein of E. coli: an extremely AT-rich promoter. Cell. 1979 Dec;18(4):1109–1117. doi: 10.1016/0092-8674(79)90224-1. [DOI] [PubMed] [Google Scholar]
- Napoli C., Gold L., Singer B. S. Translational reinitiation in the rIIB cistron of bacteriophage T4. J Mol Biol. 1981 Jul 5;149(3):433–449. doi: 10.1016/0022-2836(81)90480-0. [DOI] [PubMed] [Google Scholar]
- Nichols B. P., Miozzari G. F., van Cleemput M., Bennett G. N., Yanofsky C. Nucleotide sequences of the trpG regions of Escherichia coli, Shigella dysenteriae, Salmonella typhimurium and Serratia marcescens. J Mol Biol. 1980 Oct 5;142(4):503–517. doi: 10.1016/0022-2836(80)90260-0. [DOI] [PubMed] [Google Scholar]
- Nichols B. P., Yanofsky C. Nucleotide sequences of trpA of Salmonella typhimurium and Escherichia coli: an evolutionary comparison. Proc Natl Acad Sci U S A. 1979 Oct;76(10):5244–5248. doi: 10.1073/pnas.76.10.5244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ninio J. Prediction of pairing schemes in RNA molecules-loop contributions and energy of wobble and non-wobble pairs. Biochimie. 1979;61(10):1133–1150. doi: 10.1016/s0300-9084(80)80227-6. [DOI] [PubMed] [Google Scholar]
- Nussinov R. Some rules in the ordering of nucleotides in the DNA. Nucleic Acids Res. 1980 Oct 10;8(19):4545–4562. doi: 10.1093/nar/8.19.4545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oliver D. B., Crowther R. A. DNA sequence of the tail fibre genes 36 and 37 of bacteriophage T4. J Mol Biol. 1981 Dec 15;153(3):545–568. doi: 10.1016/0022-2836(81)90407-1. [DOI] [PubMed] [Google Scholar]
- Platt T., Yanofsky C. An intercistronic region and ribosome-binding site in bacterial messenger RNA. Proc Natl Acad Sci U S A. 1975 Jun;72(6):2399–2403. doi: 10.1073/pnas.72.6.2399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Post L. E., Arfsten A. E., Davis G. R., Nomura M. DNA sequence of the promoter region for the alpha ribosomal protein operon in Escherichia coli. J Biol Chem. 1980 May 25;255(10):4653–4659. [PubMed] [Google Scholar]
- Post L. E., Arfsten A. E., Reusser F., Nomura M. DNA sequences of promoter regions for the str and spc ribosomal protein operons in E. coli. Cell. 1978 Sep;15(1):215–229. doi: 10.1016/0092-8674(78)90096-x. [DOI] [PubMed] [Google Scholar]
- Post L. E., Nomura M. DNA sequences from the str operon of Escherichia coli. J Biol Chem. 1980 May 25;255(10):4660–4666. [PubMed] [Google Scholar]
- Post L. E., Nomura M. Nucleotide sequence of the intercistronic region preceding the gene for RNA polymerase subunit alpha in Escherichia coli. J Biol Chem. 1979 Nov 10;254(21):10604–10606. [PubMed] [Google Scholar]
- Post L. E., Strycharz G. D., Nomura M., Lewis H., Dennis P. P. Nucleotide sequence of the ribosomal protein gene cluster adjacent to the gene for RNA polymerase subunit beta in Escherichia coli. Proc Natl Acad Sci U S A. 1979 Apr;76(4):1697–1701. doi: 10.1073/pnas.76.4.1697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pribnow D., Sigurdson D. C., Gold L., Singer B. S., Napoli C., Brosius J., Dull T. J., Noller H. F. rII cistrons of bacteriophage T4. DNA sequence around the intercistronic divide and positions of genetic landmarks. J Mol Biol. 1981 Jul 5;149(3):337–376. doi: 10.1016/0022-2836(81)90477-0. [DOI] [PubMed] [Google Scholar]
- Sanger F., Coulson A. R., Friedmann T., Air G. M., Barrell B. G., Brown N. L., Fiddes J. C., Hutchison C. A., 3rd, Slocombe P. M., Smith M. The nucleotide sequence of bacteriophage phiX174. J Mol Biol. 1978 Oct 25;125(2):225–246. doi: 10.1016/0022-2836(78)90346-7. [DOI] [PubMed] [Google Scholar]
- Scherer G. F., Walkinshaw M. D., Arnott S., Morré D. J. The ribosome binding sites recognized by E. coli ribosomes have regions with signal character in both the leader and protein coding segments. Nucleic Acids Res. 1980 Sep 11;8(17):3895–3907. doi: 10.1093/nar/8.17.3895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwartz M., Roa M., Débarbouillé M. Mutations that affect lamB gene expression at a posttranscriptional level. Proc Natl Acad Sci U S A. 1981 May;78(5):2937–2941. doi: 10.1073/pnas.78.5.2937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwarz E., Scherer G., Hobom G., Kössel H. Nucleotide sequence of cro, cII and part of the O gene in phage lambda DNA. Nature. 1978 Mar 30;272(5652):410–414. doi: 10.1038/272410a0. [DOI] [PubMed] [Google Scholar]
- Shine J., Dalgarno L. The 3'-terminal sequence of Escherichia coli 16S ribosomal RNA: complementarity to nonsense triplets and ribosome binding sites. Proc Natl Acad Sci U S A. 1974 Apr;71(4):1342–1346. doi: 10.1073/pnas.71.4.1342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Singer B. S., Gold L., Shinedling S. T., Colkitt M., Hunter L. R., Pribnow D., Nelson M. A. Analysis in vivo of translational mutants of the rIIB cistron of bacteriophage T4. J Mol Biol. 1981 Jul 5;149(3):405–432. doi: 10.1016/0022-2836(81)90479-4. [DOI] [PubMed] [Google Scholar]
- Singleton C. K., Roeder W. D., Bogosian G., Somerville R. L., Weith H. L. DNA sequence of the E. coli trpR gene and prediction of the amino acid sequence of Trp repressor. Nucleic Acids Res. 1980 Apr 11;8(7):1551–1560. doi: 10.1093/nar/8.7.1551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith B. R., Schleif R. Nucleotide sequence of the L-arabinose regulatory region of Escherichia coli K12. J Biol Chem. 1978 Oct 10;253(19):6931–6933. [PubMed] [Google Scholar]
- Smith D. R., Calvo J. M. Nucleotide sequence of the E coli gene coding for dihydrofolate reductase. Nucleic Acids Res. 1980 May 24;8(10):2255–2274. doi: 10.1093/nar/8.10.2255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- So M., McCarthy B. J. Nucleotide sequence of the bacterial transposon Tn1681 encoding a heat-stable (ST) toxin and its identification in enterotoxigenic Escherichia coli strains. Proc Natl Acad Sci U S A. 1980 Jul;77(7):4011–4015. doi: 10.1073/pnas.77.7.4011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sprinzl M., Grueter F., Spelzhaus A., Gauss D. H. Compilation of tRNA sequences. Nucleic Acids Res. 1980 Jan 11;8(1):r1–r22. [PMC free article] [PubMed] [Google Scholar]
- Staples D. H., Hindley J., Billeter M. A., Weissmann C. Localization of Q-beta maturation cistron ribosome binding site. Nat New Biol. 1971 Sep 15;234(50):202–204. doi: 10.1038/newbio234202a0. [DOI] [PubMed] [Google Scholar]
- Steitz J. A., Bryan R. A. Two ribosome binding sites from the gene 0-3 messenger RNA of bacteriophages T7. J Mol Biol. 1977 Aug 25;114(4):527–543. doi: 10.1016/0022-2836(77)90176-0. [DOI] [PubMed] [Google Scholar]
- Steitz J. A. Oligonucleotide sequence of replicase initiation site in Q RNA. Nat New Biol. 1972 Mar 22;236(64):71–75. doi: 10.1038/newbio236071a0. [DOI] [PubMed] [Google Scholar]
- Taniguchi T., Weissmann C. Inhibition of Qbeta RNA 70S ribosome initiation complex formation by an oligonucleotide complementary to the 3' terminal region of E. coli 16S ribosomal RNA. Nature. 1978 Oct 26;275(5682):770–772. doi: 10.1038/275770a0. [DOI] [PubMed] [Google Scholar]
- Tinoco I., Jr, Borer P. N., Dengler B., Levin M. D., Uhlenbeck O. C., Crothers D. M., Bralla J. Improved estimation of secondary structure in ribonucleic acids. Nat New Biol. 1973 Nov 14;246(150):40–41. doi: 10.1038/newbio246040a0. [DOI] [PubMed] [Google Scholar]
- Woo N. H., Roe B. A., Rich A. Three-dimensional structure of Escherichia coli initiator tRNAfMet. Nature. 1980 Jul 24;286(5771):346–351. doi: 10.1038/286346a0. [DOI] [PubMed] [Google Scholar]
- Zieg J., Simon M. Analysis of the nucleotide sequence of an invertible controlling element. Proc Natl Acad Sci U S A. 1980 Jul;77(7):4196–4200. doi: 10.1073/pnas.77.7.4196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zurawski G., Brown K., Killingly D., Yanofsky C. Nucleotide sequence of the leader region of the phenylalanine operon of Escherichia coli. Proc Natl Acad Sci U S A. 1978 Sep;75(9):4271–4275. doi: 10.1073/pnas.75.9.4271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van den Elzen P. J., Gaastra W., Spelt C. E., de Graaf F. K., Veltkamp E., Nijkamp H. J. Molecular structure of the immunity gene and immunity protein of the bacteriocinogenic plasmid Clo DF13. Nucleic Acids Res. 1980 Oct 10;8(19):4349–4363. doi: 10.1093/nar/8.19.4349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van der Laken K., Bakker-Steeneveld H., Berkhout B., van Knippenberg P. H. The role of the codon and the initiation factor IF-2 in the selection of N-blocked aminoacyl-tRNA for initiation. Eur J Biochem. 1980 Feb;104(1):19–33. doi: 10.1111/j.1432-1033.1980.tb04394.x. [DOI] [PubMed] [Google Scholar]