💡 Quick Summary
Range Extractor DNA accepts a DNA sequence along with a set of positions or ranges, and returns the matching bases in your choice of four formats: merged into a new sequence, as separate FASTA records, uppercased within the full sequence, or lowercased within the full sequence. Ranges support numeric positions, spans (10..20), the keywords start/end/center/length, and arithmetic expressions such as (end-10)..end.
📋 How to Use
- Paste a raw DNA sequence or one or more FASTA sequences into the top input area.
- Enter positions or ranges in the Ranges field, separated by commas. Use
x..yfor a span. You can use the keywordsstart,end,center, andlengthin place of numbers, and arithmetic expressions such as(end-10)..end. - Choose a Strand: Direct returns the bases as-is; Complement returns the reverse complement of the extracted sequence.
- Choose a Return as mode: New sequence joins all ranges into one FASTA entry. Separate FASTA records gives each range its own header. Uppercased in context or Lowercased in context returns the full sequence with the range regions changed in case.
- Click Extract. Multiple FASTA input sequences are each processed independently.
- Use Copy to copy the result to your clipboard.
- Click Load Example to try the tool with a 190-base sample sequence using the default ranges
1, 5, 10..20, end. - Click Clear to reset.
🧮 Formulas & Logic
📊 Result Interpretation
Number of FASTA records (or bare sequences) found in the input.
Number of valid position or range entries extracted from the Ranges field.
🔬 Applications
- Extracting a specific exon, promoter, or restriction site by coordinate from a genomic sequence
- Obtaining the last N bases of a sequence using the expression (end-N+1)..end
- Pulling multiple non-contiguous regions and joining them as a synthetic sequence
- Highlighting the position of a feature within its genomic context using Uppercased in context mode
- Extracting the complement strand sequence of a specified region for primer design
⚠️ Common Mistakes & Warnings
All coordinates follow the biological convention: position 1 is the first base. The sequence is converted to 0-based indices internally.
If a position or range end is less than 1 or greater than the sequence length, or if the start is greater than the stop, that range entry is skipped and a warning is shown.
In Uppercased/Lowercased context modes, complement strand applies to the entire sequence after case-marking. The highlighted regions still show their original positions within the reverse complement.
Before extraction, the input sequence is cleaned to retain only valid IUPAC DNA characters. The range positions apply to the cleaned sequence.
❓ Frequently Asked Questions
Can I extract multiple ranges at once?
1..50, 100..150, end. In "New sequence" mode all ranges are concatenated in order. In "Separate FASTA records" mode each range becomes its own FASTA entry.What do the keywords start, end, center, and length mean?
start and begin equal 1. end and stop equal the sequence length. center equals Math.round(length / 2). length also equals the sequence length. These can be used in arithmetic expressions, e.g. (center-10)..(center+10).