💡 Quick Summary
Mutate Protein introduces random amino acid substitutions into a protein sequence. You can set the number of mutations per sequence and choose whether the first residue (typically Met) is protected. Mutated sequences are useful as negative controls or for evaluating the significance of sequence analysis results.
📋 How to Use
- Paste a raw protein sequence or one or more FASTA sequences into the textarea. Input limit: 1,000,000,000 characters.
- Set the number of mutations to introduce into each sequence (default: 10). Multiple mutations may land on the same position.
- Choose whether the first residue can or cannot be mutated (default: cannot — preserves the start methionine).
- Click Run. Each input sequence is mutated independently and output as a FASTA record. Use Copy to copy the plain-text result.
🔬 Applications
- Generating negative-control sequences for protein sequence analysis benchmarks
- Evaluating the sensitivity of motif finders, domain scanners, or alignment tools against mutant sequences
- Producing randomised protein variants for in silico docking or binding studies
- Testing how many mutations are required to disrupt a known functional motif or binding site
⚠️ Common Mistakes & Warnings
Each mutation picks a position at random. Multiple mutations can affect the same position, so the actual number of changed residues may be less than the requested count.
Replacement residues are chosen from the 20 standard amino acids (A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y). Degenerate codes in the input (B, X, Z, etc.) are retained but are never used as replacement residues.
Characters that are not valid amino acid codes are removed from the input before processing. The mutation count applies to the cleaned sequence.