💡 Quick Summary
Color Align Conservation accepts a group of aligned sequences in FASTA or GDE format and colours the alignment by conservation level. Identical residues receive a black background, similar residues a grey background, and all others remain uncoloured. Use it to visually highlight conserved regions in a sequence alignment.
📋 How to Use
- Paste pre-aligned sequences in FASTA or GDE format into the textarea. All sequences must be the same length (gaps included). Input limit: 200,000,000 characters.
- Set residues per line — controls how many alignment columns are shown per block row (default: 80).
- Set the consensus threshold — the minimum percentage of sequences that must share a residue (or belong to the same similarity group) for conservation colouring to be applied (default: 100%).
- Choose the colour mode: Background fills identical residues black and similar residues grey; Text colours the characters instead (useful for printing on white paper).
- Enter similarity groups as comma-separated letter strings (e.g.
GAVLI, FYW, CM, ST, KRH, DENQ, P). Leave empty when colouring a DNA alignment. - Optionally enter starting positions as comma-separated integers to offset the per-sequence position counter (e.g.
0, 200, 0, -1). - Click Run. Use Copy to copy the plain-text alignment.
🧮 Formulas & Logic
📊 Result Interpretation
Identical residue: all (or ≥ threshold %) sequences share the exact same character at this column.
Similar residue: all (or ≥ threshold %) sequences have a character belonging to the same physicochemical group.
Residue is neither identical nor similar across the alignment at the selected threshold.
Printed at the end of each row — the cumulative count of non-gap residues up to that point in each sequence.
🔬 Applications
- Visualising conserved residues in a multiple protein or DNA sequence alignment
- Identifying structurally or functionally important positions shared across a protein family
- Enhancing figures produced by sequence alignment programs for publication
- Comparing conservation patterns between different alignment regions
- Designing primers or probes that target highly conserved regions
⚠️ Common Mistakes & Warnings
This tool does not perform sequence alignment. All input sequences must already be aligned and must have identical lengths (including gap characters).
The default similarity groups are designed for protein alignments. When colouring a DNA alignment, clear the similarity groups field so only identical nucleotides are highlighted.
Only the first 20 characters of each FASTA title are used for display. Ensure the first 20 characters of each title are distinct enough to identify the sequence.
❓ Frequently Asked Questions
What do the similarity groups represent?
GAVLI groups small non-polar aliphatic residues. A column is coloured "similar" if all (or ≥ threshold %) sequences have residues from the same group — even if the exact residues differ.