💡 Quick Summary
Codon Usage accepts one or more DNA sequences and returns the number and frequency of each codon type. Since it also compares the frequencies of codons that encode the same amino acid (synonymous codons), you can use it to assess whether a sequence shows a preference for particular synonymous codons — a property known as codon usage bias.
📋 How to Use
- Paste a raw DNA sequence or one or more FASTA sequences into the input area. Input limit is 500,000,000 characters.
- Choose a Genetic code from the dropdown. The standard code is selected by default.
- Click Run. Each input sequence produces its own codon usage table in GCG format.
- The table columns are: AmAcid (amino acid name), Codon (triplet), Number (raw count), /1000 (frequency per 1,000 codons), Fraction (fraction of synonymous codons).
- Use the Copy button to copy the result to your clipboard. The output format is compatible with the Codon Usage Database at kazusa.or.jp.
- Click Load Example to try with two sample coding sequences using the standard genetic code.
🧮 Formulas & Logic
📊 Result Interpretation
Number of FASTA records successfully processed.
Sum of all codons counted across all sequences.
This codon dominates usage for its amino acid — strong codon preference.
This codon is rarely or never used for its amino acid in the input.
The codon appears frequently relative to total codon count.
🔬 Applications
- Assessing codon usage bias in a gene before expressing it in a heterologous host
- Comparing the codon usage of a synthetic gene against a host organism's preference
- Generating a GCG-format codon usage table to upload to the Codon Usage Database
- Identifying over- or under-represented codons in a set of genes
- Checking whether stop codon usage is consistent with the expected genetic code
⚠️ Common Mistakes & Warnings
If the cleaned sequence length is not divisible by 3, the last 1 or 2 bases are ignored. A warning is shown for each affected sequence.
Any character that is not a valid IUPAC DNA/RNA letter is removed before counting. RNA sequences (with U) are treated as DNA (U → T).
Counts are not pooled across sequences. Each FASTA record produces its own separate codon usage table.