💡 Quick Summary
ORF Finder searches a DNA sequence for open reading frames and returns the position of each ORF together with its protein translation. Use it to find potential protein-coding regions in newly sequenced DNA. Supports all six reading frames (three on each strand), all IUPAC degenerate bases, and 17 genetic code tables.
📋 How to Use
- Paste a raw DNA sequence or a single FASTA record into the textarea. Input limit: 100,000,000 characters.
- Choose which codons can start an ORF: any codon (open search), atg only, or atg/gtg/ctg/ttg.
- Select the reading frame (1, 2, 3, or all three) and strand (direct or reverse complement).
- Enter the minimum ORF length in codons (default: 30). Only ORFs equal to or longer than this threshold are reported.
- Select the genetic code table appropriate for your organism.
- Click Run. Each ORF is reported in FASTA format: first the DNA sequence with its coordinates, then the protein translation.
🧮 Formulas & Logic
📊 Result Interpretation
"ORF number N in reading frame RF on the STRAND strand extends from base START to base END." Coordinates are 1-based.
The nucleotide sequence of the ORF, 60 bases per line.
The amino acid sequence of the ORF. Stop codon is shown as *. Unknown codons become X.
Coordinates refer to positions in the reverse-complement sequence, not the original.
🔬 Applications
- Finding protein-coding regions in newly sequenced genomic or cDNA sequences
- Checking all six reading frames of a PCR product for unexpected open reading frames
- Identifying the longest ORF in a sequence for expression vector cloning
- Translating annotated CDS coordinates to verify protein sequence
⚠️ Common Mistakes & Warnings
The original SMS ORF Finder accepts a single sequence. Paste one FASTA record at a time.
With start = "any codon", every codon can begin an ORF, so overlapping ORFs at the same stop codon will each be reported. Use "atg" for conventional gene-finding.
A minimum of 30 codons means the ORF must span at least 30 codons counting the stop. A 30-codon ORF encodes a 29-residue protein plus stop.