💡 Quick Summary
Combine FASTA merges multiple FASTA sequence records into a single concatenated sequence, and reports key statistics: record count, total and average length, shortest and longest sequence, overall GC content, and detected sequence type (DNA / Protein).
📋 How to Use
- Paste one or more FASTA-formatted sequences into the input area. Each record starts with a >title line followed by sequence data.
- Click Analyze. All title lines are stripped, non-letter characters removed, and sequences concatenated in order.
- The Analysis Summary panel shows sequence statistics immediately above the combined output.
- Use the Copy button to copy the full FASTA output to your clipboard.
- Click Load Example to try the tool with two sample records.
- Click Clear to reset and start again.
🧮 Formulas & Logic
📊 Result Interpretation
The number of individual FASTA records found. A record begins with a line starting ">". Input with no ">" lines is treated as a single bare-sequence record.
Percentage of G and C residues in the combined sequence. Values in the 40–60% range are typical for most organisms.
🔬 Applications
- Calculating codon usage across a set of coding sequences when a single-sequence input is required
- Concatenating orthologous loci from multiple organisms for multi-locus phylogenetic analysis
- Merging exon sequences before ORF finding or translation
- Combining replicate reads or assembled contigs into a single entry for length statistics
- Pre-processing multi-FASTA files for tools that accept only one sequence at a time
⚠️ Common Mistakes & Warnings
Sequences are concatenated in the exact order they appear in the input. Rearrange records before analyzing if order matters for your downstream application.
Title lines (starting with ">") are stripped entirely. The output carries only a generated summary header. Note down original identifiers beforehand if they are needed later.
Digits, spaces, hyphens, asterisks, and all non-alphabetic characters are removed before concatenation. This includes numbering added by sequence editors. Check your input format if the result length is unexpectedly short.