💡 Quick Summary
DNA Stats returns the number and percentage of occurrences of each nucleotide residue in the sequences you enter. It covers all 16 IUPAC single bases, all 25 G/A/T/C/N dinucleotide combinations, and key base groups including GC content and degenerate code totals. Use it to quickly compare sequence composition.
📋 How to Use
- Paste a raw DNA sequence or one or more FASTA sequences into the input area. Input limit is 500,000,000 characters.
- Click Run. Each sequence produces three sections: single residue composition, dinucleotide composition, and group composition.
- Use the Copy button to copy all results to your clipboard.
- Click Load Example to try with a balanced 50-nt sample sequence.
🧮 Formulas & Logic
📊 Result Interpretation
Number of FASTA records successfully analysed.
Sum of all cleaned sequence lengths across all records.
Count and % for each of the 16 IUPAC DNA bases (G, A, T, C, N, U, R, Y, S, W, K, M, B, D, H, V).
Count and % for all 25 G/A/T/C/N × G/A/T/C/N pairs. Overlapping dinucleotides are counted (each position counted once).
Aggregated counts for GC, AT, 2-fold degenerate bases (R,Y,S,W,K), 3/4-fold degenerate (B,H,D,V,N), and all degenerate bases combined.
🔬 Applications
- Measuring GC content to assess primer melting temperature or genomic bias
- Checking CpG (cg) dinucleotide frequency relative to other dinucleotides
- Comparing base composition between two sequences or organisms
- Assessing the frequency of degenerate IUPAC codes in a mixed population or consensus sequence
- Quality-checking raw sequencing output for unexpected base distributions
⚠️ Common Mistakes & Warnings
Any character that is not a valid IUPAC DNA/RNA letter is removed before counting. Position numbers in the output reflect the cleaned sequence.
If your sequence contains U residues they are counted under U in the single residue table. They are not converted to T. Dinucleotides involving U are not tabulated.
For a sequence of N bases there are N−1 consecutive dinucleotide positions. This is the denominator for all dinucleotide percentages.