💡 Quick Summary
Three to One converts three-letter amino acid codes (e.g. AlaCysAspGlu) to their single-letter equivalents (e.g. ACDE). Digits and whitespace in the sequence body are removed automatically before conversion. Non-standard triplets are passed through without conversion. Stop codons written as "***" are converted to "*".
📋 How to Use
- Paste a raw three-letter sequence or one or more FASTA sequences into the input area. Input limit is 100,000,000 characters.
- Click Convert. Each sequence is output as a FASTA entry with a header line that identifies the source sequence name and its first few characters.
- The Summary panel shows how many sequences were processed, how many residues were successfully converted, and how many unrecognised characters were passed through.
- Use the Copy button to copy the single-letter output to your clipboard.
- Click Load Example to try with sample sequences containing all 23 supported three-letter codes including ambiguous residues and a stop codon.
- Click Clear to reset.
🧮 Formulas & Logic
📊 Result Interpretation
Number of FASTA entries (or bare sequences) successfully converted.
Total number of recognised three-letter codes that were successfully replaced with a single-letter equivalent.
Characters remaining after conversion that did not match any known three-letter code. These pass through to the output unchanged.
🔬 Applications
- Converting three-letter protein sequences from databases or publications into FASTA single-letter format for downstream tools
- Reformatting manually annotated protein sequences for alignment or BLAST searches
- Batch-converting multiple sequences copy-pasted from papers or lab notebooks
- Verifying that a three-letter sequence encodes the expected single-letter sequence
⚠️ Common Mistakes & Warnings
Any group of three characters that does not match a recognised amino acid code (after case normalisation) is left in the output as-is. This may cause unexpected characters in the single-letter output if the input contains non-standard codes.
All digits and whitespace (spaces, tabs, newlines) in the sequence body are removed before conversion. This allows sequences with embedded numbering or line-wrapped text to be handled automatically.
The input is processed in groups of three characters, and each group is normalised to Title Case (first letter upper, next two lower) before the lookup table is consulted. This means "ALA", "ala", and "AlA" are all recognised as Alanine.