The restriction/modification system in bacteria is asmall-scale immune systemfor protection from infection by foreign DNA.
W. Arber and S. Linn (1969)
Plating efficiencies of bacteriophage lambda (l phage) grown onE. colistrainsC,K-12andB, when plated on these bacteria:
E. colistrain on which parental phage had been grown
E. colistrain for plating phage
C
K-12
B
C
1
<10-4
<10-4
K-12
1
1
<10-4
B
1
<10-4
1
The DNA of phage which had been grown on strains K-12 and B were found to have chemically modified bases which weremethylated.
Additional studies with other strains indicate that different strains hadspecificmethylated bases.
Typical sites of methylation include theN6position of adenine, theN4position of cytosine, or theC5position of cytosine.
In addition, only a fractional percentage of bases were methylated (i.e. not every adenine was methylated, for example) and these occurred atvery specific sitesin the DNA.
A characteristic feature of the sites of methylation, was that they involvedpalindromicDNA sequences.
(EcoR1 methylase specificity. Rubin and Modrich, 1977)
In addition to possessing a particular methylase, individual bacterial strains also contained accompanying specificendonucleaseactivities.
The endonucleasescleaved at or near the methylation recognition site.
These specific nucleases, however, wouldnotcleave at these specific palindromic sequences if the DNA wasmethylated.
Thus, this combination of a specific methylase and endonuclease functioned as a type ofimmune systemfor individual bacterial strains, protecting them from infection by foreign DNA (e.g. viruses).
In the bacterial strainEcoR1, the sequenceGAATTCwill be methylated at the internal adenine base (by the EcoR1 methylase).
The EcoR1 endonuclease within the same bacteria willnotcleave the methylated DNA.
Foreign viral DNA, which is not methylated at the sequence "GAATTC" will therefore be recognized as "foreign" DNA andwillbe cleaved by the EcoR1 endonuclease.
Cleavage of the viral DNA renders it non-functional.
Such endonucleases are referred to as"restriction endonucleases"because they restrict the DNA within the cell to being "self".
The combination of restriction endonuclease and methylase is termed the"restriction-modification"system.
Of course, this type of protective system is beaten if the attacking phage was previously grown on the same strain as that which it is infecting. In this case the phage will have its DNA already methylated at the appropriate sequence, and will be recognized as "self" (see the table above).E. colistrain 'C' (above) is strain which has no known restriction-modification system.
We will discuss DNA replication later, but it should be mentioned that:
replicating host DNA will initially have one strand (parental) methylated and the other (nascent strand) non-methylated.
This is recognized as "self" and is not cleaved by the restriction endonuclease.
It is subsequently methylated by the host methylase.
Structural and biochemical studies have indicated that for the common R/M systems (so calledtype II), the methylase recognizes and methylatesone strandof the DNA duplex, whereas the restriction endonuclease recognizesboth strandsof the DNA (i.e. both strands must be non-methylated for recognition). It is able to do this because it is a homo-dimer protein.
Restriction endonucleases
Since different bacterial strains and species have potentially different R/M systems, their characterization has made available over 200 endonucleases with different sequence specific cleavage sites.
They are one of the primary tools in modern molecular biology for the manipulation and identification of DNA sequences.
Restriction endonucleases are commonly named after the bacterium from which it was isolated.
Examples of different restriction enzymes
Name
Source
Recognition Sequence
Comments
Alu I
Arthrobacter luteus
|
5'… A G C T … 3'
3'… T C G A … 5'
|
"Four cutter". Leaves blunt ends to the DNA.
Bfa I
Bacteroides fragilis
|
5'… C T A G … 3'
3'… G A T C … 5'
|
"Four cutter". Leaves 5' overhang.
Nci I
Neisseria cinerea
|
C
5'… C C G G G … 3'
3'… G G C C C … 5'
G
|
"Five cutter". Middle base can be either cytosine or guanine. Leaves 5' overhang.Different recognition sites may have non-complementary sequences.
Eco R1
Escherichia coli
|
5'… G A A T T C … 3'
3'… C T T A A G … 5'
|
"Six cutter". Leaves 5' overhang. Behaves like a "four cutter" ('star' activity) in high salt buffer. $44 for 10,000 units.
Hae II
Haemophilus aegyptius
|
5'… Pu G C G C Py … 3'
3'… Py C G C G Pu … 5'
|
"Six cutter". Pu is any purine, Py is any pyrimidine. Leaves 3' overhang.
EcoO109I
Escherichia coli
|
5'… Pu G G N C C Py … 3'
3'… Py C C N G G Pu … 5'
|
"Seven cutter". Pu is any purine, Py is any pyrimidine, N is any base. Leaves 5' overhang.Different recognition sites may have non-complementary sequences.
Bgl I
Bacillus globigii
|
5'… GCCN NNNNGGC … 3'
3'… CGGNNNN NCCG … 5'
|
"Six cutter with interrupted palindrome". Leaves 5' overhang.Different recognition sites may have non-complementary sequences.
Bsa HI
Bacillus stearothermophilus
|
5'… G Pu C G Py C … 3'
3'… C Py G C Pu G … 5'
|
"Six cutter". Different recognition siteswillbe complementary.
Aat II
Acetobacter aceti
|
5'… G A C G T C … 3'
3'… C T G C A G … 5'
|
"Six cutter" with 3' overhang. Same recognition sequence as Bsa HI, but different cleavage position.
Bpm I
Bacillus pumilus
|
5'… C T G G A G N16 … 3'
3'… G A C C T C N14 … 5'
|
Non-palindrome, distal cleavage. Leaves 3' overhang. $50 for 50 units.
Not I
Nocardia otitidiscaviarum
|
5'… G C G G C C G C … 3'
3'… C G C C G G C G … 5'
|
"Eight cutter". Leaves 5' overhang.
Bsm I
Bacillus stearothermophilus
|
5'… G A A T G C N … 3'
3'… C T T A C G N … 5'
|
"weird". Leaves 3' overhang.
The utility of restriction endonucleases lies in their specificity and the frequency with which their recognition sites occur within any given DNA sample.
If there is a 25% probability for a specific base at any given site, then the frequency with which different restriction endonuclease sites will occur can be easily calculated (0.25n):
Nucleotide Specificity
Example
Frequency of Occurrence
Four
Alu I
256 (0.25 Kb)
Five
Nci I
1024 (1.0 Kb)
Six
EcoR I
4096 (4.1 Kb)
Seven
EcoO109I
16384 (16.4 Kb)
Eight
Not I
65536 (65.5 Kb)
Thus, on average, any given DNA will contain an Alu I site every 0.25 kilobases, whereas a Not I site occurs once about every 65.5 kilobases.
Not I is therefore a very useful enzyme for isolating large regions of DNA, typically in research involving genomic DNA manipulations.
Alu I would be expected to digest a DNA sample into lots of little pieces.
The assortment of DNA fragments would represent a specific "fingerprint" of the particular DNA being digested. Different DNA would not yield the same collection of fragment sizes. Thus, DNA from different sources can be either matched or distinguished based on the assembly of fragments after restriction endonuclease treatment.These are termed "Restriction Fragment Length Polymorphisms", or RFLP's. This simple analysis is used in various aspects of molecular biology as well as a law enforcement and genealogy. For example, genetic variations which distingish individuals also may result in fewer or additional restriction endonuclease recognition sites.