💡 Quick Summary
Split FASTA divides FASTA sequence records into smaller fragments of a size you specify. An optional overlap value can be used to create consecutive fragments that share bases with their neighbours — useful for sliding-window analyses, read simulation, or tiling PCR design.
📋 How to Use
- Paste one or more FASTA sequences into the input area. Input limit is 500,000,000 characters.
- Set the Fragment length — the number of bases in each output fragment (default 100).
- Optionally set an Overlap — the number of bases shared between consecutive fragments (default 0). Must be less than the fragment length.
- Click Run. Each input sequence is tiled into fragments; each fragment is output as a separate FASTA entry with its position and length recorded in the title.
- The Summary panel shows the number of source sequences processed and the total fragments produced.
- Use the Copy button to copy all fragments to your clipboard.
- Click Load Example to try with a 1,000-base random sequence split into 100-base fragments.
- Click Clear to reset.
🧮 Formulas & Logic
📊 Result Interpretation
Number of FASTA source records successfully tiled.
Total number of output fragments across all source sequences.
The configured fragment size and overlap used for this run, shown as "N bp" or "N bp / M bp overlap".
🔬 Applications
- Generating tiled overlapping fragments for primer design or cloning strategies
- Simulating short sequencing reads from a reference sequence
- Creating sliding-window sub-sequences for compositional or structural analyses
- Preparing BLAST queries by breaking a long sequence into manageable chunks
- Splitting a chromosome-scale sequence into segments for upload to tools with size limits
⚠️ Common Mistakes & Warnings
Digits, whitespace, gap characters (- .), and any other non-alphabetic characters are removed from each sequence before tiling. This strips GenBank/EMBL line numbering automatically but also removes alignment gaps — use Split FASTA on ungapped sequences.
When the source sequence length is not an exact multiple of the step size (fragment length minus overlap), the final fragment will contain fewer bases than requested. Its actual length is recorded in the FASTA title.
Setting overlap equal to or greater than the fragment length would cause fragments to never advance along the sequence. The tool validates this and returns an error if the values are incompatible.