Summary Statistics Calculator
Mean, median, mode, standard deviation, variance, range, quartiles, IQR, skewness, and kurtosis — 14 descriptive statistics from one dataset, instantly.
💡 Quick Summary
This free online summary statistics calculator computes 14 descriptive statistics for any numeric dataset in one click: mean (arithmetic average), median (middle value), mode (most frequent value), sample standard deviation, sample variance, minimum, maximum, range, Q1 (first quartile / 25th percentile), Q3 (third quartile / 75th percentile), IQR (interquartile range), skewness, and excess kurtosis — plus a count of valid values (N) and missing values. Enter your data by typing, or paste directly from Excel, Google Sheets, or any spreadsheet. Used in biostatistics, data analysis, research, and statistics coursework.
📋 How to Use
- Enter your numbers in the grid — one value per row. You can also paste from Excel or Google Sheets: just copy a column and paste into any cell. The calculator fills all values automatically.
- Click + 10 Rows to add more rows if your dataset is larger than the default grid.
- Choose Decimal Places (2–4) to control how results are rounded.
- Click Calculate. All 14 descriptive statistics appear instantly, grouped by category: central tendency (mean, median, mode), spread (SD, variance, range, min, max), quartiles (Q1, Q3, IQR), and shape (skewness, kurtosis).
- Non-numeric entries (text, empty cells) are automatically skipped and counted as Missing — no need to clean your data first.
- Click Load Example to see results with a built-in sample dataset, or Reset to start fresh.
🔢 Formulas
📖 Interpreting Results
Add up all values and divide by n. The mean is sensitive to outliers: a single extreme value can shift it significantly. Compare it to the median to judge whether your data is skewed.
The median splits sorted data into two equal halves. It is resistant to outliers, making it the preferred measure of central tendency for skewed distributions such as income, survival times, or reaction times.
The mode is the value that occurs most often. Data can be unimodal (one mode), bimodal (two modes), or multimodal. If all values are unique, there is no mode. The mode is the only measure of central tendency applicable to categorical data.
Range = Max − Min. It captures the total spread of your data but is heavily influenced by outliers. For a more robust measure of spread, use the IQR instead.
SD measures the average distance of each data point from the mean. In a normal distribution: ~68% of data falls within ±1 SD, ~95% within ±2 SD, and ~99.7% within ±3 SD (the empirical rule). A larger SD means more variability; SD = 0 means all values are identical.
Variance = SD². It is the average squared deviation from the mean. Variance is harder to interpret directly (units are squared), but it is the foundation of many statistical tests such as ANOVA.
Q1 (25th percentile) and Q3 (75th percentile) mark the boundaries of the middle 50% of your data. The interquartile range (IQR = Q3 − Q1) is a robust measure of spread, unaffected by outliers. A common rule flags values below Q1 − 1.5×IQR or above Q3 + 1.5×IQR as potential outliers (Tukey's fence).
The five-number summary — Minimum, Q1, Median, Q3, Maximum — gives a compact picture of your data's distribution and is the basis of box plots (box-and-whisker plots).
When mean ≈ median, the distribution is roughly symmetric. Mean > median signals right (positive) skew — a few high outliers pulling the mean up. Mean < median signals left (negative) skew — a few low outliers pulling the mean down.
The distribution is approximately symmetric. Values between −0.5 and +0.5 are generally considered roughly symmetric; |skewness| > 1 is considered substantially skewed.
The right tail is longer — a few unusually large values stretch the distribution rightward. Common in income data, reaction times, and biological concentration data.
The left tail is longer — a few unusually small values stretch the distribution leftward. Common in exam scores near the maximum or age at death in a healthy population.
Tail weight is similar to a normal distribution.
Heavier tails than normal — more extreme outliers than expected. Common in financial returns and biological measurements with rare extreme events.
Lighter tails than normal — fewer extreme values. The distribution is flatter and more uniform.
⚠️ Common Mistakes
Variance and SD use Bessel's correction (dividing by n−1 instead of n). This gives unbiased estimates when your data is a sample from a larger population — the standard in biostatistics, biology, and most research. This matches SPSS, R (var(), sd()), and Excel (VAR.S, STDEV.S). If your dataset is the entire population, use population formulas (divide by n) instead.
The adjusted Fisher-Pearson skewness formula is mathematically undefined for fewer than 3 values. Excess kurtosis is undefined for fewer than 4 values. These show as "n < 3" or "n < 4" when the sample is too small.
Mode is only reported when at least one value appears more than once. If every value in your dataset is unique, there is no mode. When multiple values share the highest frequency, all are shown as modes (multimodal data).
For skewed distributions (e.g. reaction times, income, cytokine concentrations), the mean and standard deviation can be misleading. In these cases, prefer the median as the measure of center and the IQR as the measure of spread.
❓ FAQ
What is the mean in statistics and how is it calculated?
How do you calculate the median step by step?
What is mode in statistics?
What is standard deviation and how do you interpret it?
What is variance and how is it different from standard deviation?
What is the interquartile range (IQR) and how is it calculated?
How do you find Q1 and Q3 (first and third quartiles)?
quantile(type=7)). Different software may give slightly different quartile values depending on the interpolation method used.What is the five-number summary in statistics?
What is the range in statistics?
What is skewness and how do you interpret it?
What is excess kurtosis and what does it measure?
What is the difference between sample and population standard deviation?
sd(), Excel STDEV.S, and SPSS.What are measures of central tendency?
What are measures of spread (dispersion) in statistics?
What is the difference between mean and median?
Can I paste data from Excel or Google Sheets?
Why does my dataset have no mode?
Why does this calculator divide variance by n−1 instead of n?
var()), Excel (VAR.S), and Python (statistics.variance()) all use.