💡 Quick Summary
Translate accepts a DNA sequence and converts it into a protein in the reading frame you specify. Supports the full IUPAC alphabet, the uppercase reading frame for masked sequences, direct and reverse-complement strands, and 17 NCBI genetic code tables.
📋 How to Use
- Paste a raw DNA sequence or one or more FASTA sequences into the textarea. IUPAC degenerate bases (R, Y, S, W, K, M, B, D, H, V, N) are supported. Input limit: 200,000,000 characters.
- Select the reading frame: frame 1, 2, or 3 starts translation at offset 0, 1, or 2 respectively. The uppercase frame strips all lowercase bases first and then translates the uppercase portion.
- Choose the strand: direct translates the sequence as given; reverse takes the reverse complement before translating.
- Select the genetic code appropriate for your organism (default: standard code, table 1).
- Click Run. Each sequence is output in FASTA format with a header showing the reading frame and original sequence title.
- Use Copy to copy the full output to your clipboard.
🧮 Formulas & Logic
📊 Result Interpretation
">rf {frame} {title}" — e.g., ">rf 1 sample sequence" for reading frame 1. The uppercase frame is shown as ">rf \"uppercase\" …".
Stop codons are shown as * in the protein sequence.
An X in the protein indicates a codon that could not be unambiguously assigned to an amino acid under the selected genetic code.
Each FASTA record is translated independently with the same frame, strand, and genetic code settings.
🔬 Applications
- Translating a newly sequenced coding region to check for frameshifts or premature stops
- Checking all three reading frames of a DNA fragment to find the correct ORF
- Translating a reverse-complement strand to find genes encoded on the antisense strand
- Translating repeat-masked (lowercase) sequences using the uppercase frame to isolate exonic protein
- Verifying the protein product of a synthetic gene design before ordering
⚠️ Common Mistakes & Warnings
Frame 2 skips the first base, frame 3 skips the first two bases. If your sequence has a FASTA header, it is stripped before the offset is applied.
Characters that are not in the IUPAC DNA/RNA alphabet (A, C, G, T, U, R, Y, S, W, K, M, B, D, H, V, N, and their lowercase equivalents) are stripped before translation. Numbers, spaces, and other symbols are ignored.
The uppercase frame only makes sense when the input sequence intentionally uses lowercase to mark regions you want to skip (e.g., repeat-masked sequences). If the entire sequence is lowercase, nothing will be translated.