💡 Quick Summary
Codon Plot accepts a DNA sequence and generates a text-based plot with a horizontal bar for each codon. The bar length is proportional to the codon's usage fraction in the codon frequency table you supply. Use Codon Plot to identify portions of a sequence that may be poorly expressed in your organism of interest, or to visualise a codon usage table directly.
📋 How to Use
- Paste a raw DNA sequence or one or more FASTA sequences into the Input Sequence area. Input limit is 50,000,000 characters.
- Review or replace the Codon Usage Table. The tool ships with a default E. coli table from the Codon Usage Database. You can paste any GCG-format table in its place.
- Click Run. For each codon in the sequence the tool prints the codon, its position range, its amino acid, and a bar of X characters scaled to the usage fraction.
- A bar spanning the full width represents fraction 1.0 (the only codon used for that amino acid). A short bar indicates a rarely used codon that may limit expression.
- Use the Copy button to copy the plot to your clipboard.
- Click Load Example to try a sequence containing one of each codon type with the E. coli default table.
🧮 Formulas & Logic
📊 Result Interpretation
Number of FASTA records successfully processed.
Total number of codons analysed across all sequences.
Longer bars = higher usage fraction. A full-width bar means this codon accounts for all usage of that amino acid in the table organism.
🔬 Applications
- Identifying codon-poor regions in a heterologous gene that may limit expression in the host organism
- Comparing codon usage of a synthetic gene against the host organism's preferences before synthesis
- Visualising the codon composition of a coding sequence to guide codon optimisation efforts
- Using a sequence of one of each codon type to display a codon usage table as a bar chart
⚠️ Common Mistakes & Warnings
If the cleaned sequence length is not divisible by 3, the last 1 or 2 bases are ignored and a warning is shown.
If a codon contains non-ATGC characters (after stripping and U→T substitution) it cannot be looked up. Its bar is rendered at fraction 0 and the amino acid shown as "???". Check that your sequence does not contain long runs of ambiguous IUPAC codes.
The parser strips everything before the first ".." in the table. Standard GCG-format tables from the Codon Usage Database include this marker on the header line.