💡 Quick Summary
Restriction Digest performs a virtual restriction digest of one or more DNA sequences using up to three restriction enzymes simultaneously. You can digest linear or circular molecules, including mixtures of sequences in multi-FASTA format. The resulting fragments are sorted by size and each is reported in FASTA format with its length, position in the original sequence, and the enzyme sites that produced it.
📋 How to Use
- Paste one or more DNA sequences (raw or FASTA) into the text area.
- Select the molecule topology: linear (default) or circular.
- Select up to three restriction enzymes. Enzyme 1 is required; enzymes 2 and 3 default to "nothing" (no digestion).
- Click Submit. Fragments are displayed sorted from largest to smallest, each with its source sequence, position range, and the enzyme pair that produced it.
- Click Load Example to digest a sample pBR322 sequence with AluI.
- Use Copy All to copy the FASTA fragment report to your clipboard.
🧮 Formulas & Logic
📊 Result Interpretation
Each fragment entry reports: SIZE bp linear fragment from TOPOLOGY parent TITLE, base START to base STOP (ENZYME1 - ENZYME2).
If no enzyme site is found in a circular molecule, it is reported as: SIZE bp circular molecule from circular parent TITLE.
"sequence start" and "sequence end" indicate the natural ends of an uncut linear molecule. All other labels are enzyme names.
All fragments from all input sequences are pooled and sorted together by size — simulating running a mixed digest on a gel.
🔬 Applications
- Predicting fragment sizes before running a restriction digest gel
- Verifying a cloning strategy by checking expected fragment patterns
- Identifying diagnostic restriction sites that distinguish two sequences
- Designing Southern blot probes by finding appropriate fragment sizes
- Confirming the orientation of an insert by asymmetric digest pattern
⚠️ Common Mistakes & Warnings
This tool supports digestion with one, two, or three enzymes in a single run. For additional enzymes, perform successive digests using the previous run's output.
Some enzymes (e.g. HinfI g|antc, HincII gty|rac) use IUPAC degenerate codes in their recognition sequences. The sequence you enter should contain only standard bases (A, T, G, C) for reliable results; degenerate input bases are retained by removeNonDna() but may affect site detection.
This is a purely sequence-based virtual digest. Methylation-sensitive enzymes (e.g. ClaI blocked by Dam methylation at ATCGAT) may show more sites in silico than in a real digest.
❓ Frequently Asked Questions
How is a circular molecule handled?
Why do I see more fragments than expected?
What does the cut distance number mean?
/ggatcc/ (BamHI g|gatcc)5), the trailing number is the distance in bases from the end of the match to the cut site on the top strand. For a 6-bp site with the number 5, the cut is after the 1st base. For a blunt cutter (number = half the recognition sequence length), both strands are cut at the same position.