Average Calculator — Mean Median Mode Calculator
Calculate the mean, median, mode, range, and weighted average of any data set instantly. Enter your numbers to get comprehensive statistical analysis including variance, standard deviation, quartiles, and interquartile range — all with clear step-by-step explanations.
Separate values with commas, spaces, or new lines
What Is an Average and How Is It Calculated?
An average is a single value that represents the central or typical value of a data set. In everyday language, the word "average" usually refers to the arithmetic mean — the sum of all values divided by the number of values. However, in statistics, there are several different types of averages, each serving a specific purpose and offering unique insights into the data. Understanding these different measures of central tendency is essential for accurate data analysis and informed decision-making.
The concept of averaging dates back thousands of years. Ancient astronomers averaged multiple observations to reduce measurement errors. The arithmetic mean as a formal mathematical concept was developed in the 16th and 17th centuries. Today, averages are used in virtually every field — from calculating student grades and batting averages to analyzing economic indicators, scientific measurements, and medical research data. The mean, median, and mode together form the three pillars of descriptive statistics that summarize where the center of a data set lies.
While the arithmetic mean is the most familiar type of average, it is not always the best choice. The median may be more appropriate for skewed data, the mode for categorical data, and the geometric mean for growth rates and multiplicative processes. Choosing the right type of average depends on the nature of your data and the question you are trying to answer. Our average calculator computes all of these measures simultaneously, giving you a comprehensive view of your data's central tendency.
Beyond central tendency, understanding the spread or variability of data is equally important. Two data sets can have the same mean but very different distributions. That is why our calculator also provides the range, variance, standard deviation, and quartile values. These measures of dispersion tell you how spread out the data points are around the center, giving you a complete picture of your data's characteristics.
How to Calculate Mean, Median, Mode, and Range
The four fundamental measures of central tendency and spread can be calculated using these formulas and methods:
Mean = Σxᵢ / n (Sum of all values divided by count)To calculate the arithmetic mean, add all the values in your data set and divide by the number of values. For example, for the data set {4, 8, 6, 5, 3, 2, 8, 9, 5, 6}, the sum is 56 and there are 10 values, so the mean is 56 / 10 = 5.6.
Median = Middle value of sorted dataTo find the median, first sort all values from smallest to largest. If there is an odd number of values, the median is the middle value. If there is an even number, the median is the average of the two middle values. For {2, 3, 4, 5, 5, 6, 6, 8, 8, 9}, the two middle values are 5 and 6, so the median is (5 + 6) / 2 = 5.5.
Mode = Most frequently occurring value(s)The mode is the value that appears most often. In {4, 8, 6, 5, 3, 2, 8, 9, 5, 6}, both 5, 6, and 8 appear twice each, making this a multimodal data set. If all values appear the same number of times, there is no mode.
Range = Maximum value − Minimum valueThe range measures the spread between the largest and smallest values. For {2, 3, 4, 5, 5, 6, 6, 8, 8, 9}, the range is 9 − 2 = 7. While simple to compute, the range is sensitive to outliers since it only uses the two extreme values.
Weighted Mean = Σ(wᵢ × xᵢ) / ΣwᵢFor a weighted average, multiply each value by its corresponding weight, sum all the products, and divide by the sum of the weights. For example, if a student scores 90 (weight 3), 80 (weight 2), and 70 (weight 1), the weighted average is (90×3 + 80×2 + 70×1) / (3+2+1) = (270 + 160 + 70) / 6 = 83.3.
Types of Statistical Averages
The table below describes the different types of averages and measures of central tendency, along with their primary use cases and characteristics.
| Type | Measure |
|---|---|
| Σxᵢ / n | Mean (Arithmetic Average) |
| Middle value | Median (Middle Value) |
| Most frequent | Mode (Most Frequent Value) |
| Max − Min | Range (Spread) |
| Σ(wᵢ×xᵢ) / Σwᵢ | Weighted Average |
| (∏xᵢ)^(1/n) | Geometric Mean |
Limitations of Average Calculations
While averages are among the most widely used statistical measures, they have several important limitations that you should understand to avoid misinterpreting your data:
Mean Is Heavily Affected by Outliers
The arithmetic mean is pulled toward extreme values. In the data set {10, 12, 11, 13, 100}, the mean is 29.2, which does not represent any typical value in the set. The median of 12 provides a much better summary of the central tendency. This is why income statistics typically report the median rather than the mean — a few extremely high earners can dramatically inflate the mean income while most people earn much less. When analyzing data with potential outliers, always calculate both the mean and median to check whether they diverge significantly.
Mode May Not Exist or May Be Misleading
If every value in a data set appears the same number of times, no mode exists. Conversely, in continuous data (like precise measurements), every value may be unique, making the mode meaningless without grouping the data into intervals. Even when a mode exists, it may not be near the center of the data. In a bimodal distribution like test scores with peaks at 65 and 90, the modes tell you about clustering but not about central tendency. The mode is most meaningful with discrete or categorical data that naturally repeats.
Median Ignores the Magnitude of Values
While the median's resistance to outliers is often an advantage, it also means it ignores valuable information about extreme values. In {1, 2, 3, 4, 1000}, the median is 3, the same as in {1, 2, 3, 4, 5}. If those extreme values represent real and important data points (not errors), the median alone fails to capture them. For example, in risk analysis, ignoring extreme values can lead to underestimating potential losses. Always consider the full distribution, not just a single summary statistic.
Range Only Considers Two Extreme Values
The range uses only the maximum and minimum values, making it extremely sensitive to a single outlier while ignoring how the remaining data is distributed. Two data sets can have identical ranges but very different distributions — {1, 5, 5, 5, 10} and {1, 1, 1, 9, 10} both have a range of 9 but are distributed very differently. The interquartile range (IQR) and standard deviation provide more robust and informative measures of spread that consider all data points.
Weighted Average Requires Accurate Weights
The reliability of a weighted average depends entirely on the accuracy and appropriateness of the assigned weights. Incorrect weights produce misleading results. In GPA calculations, using wrong credit hours will yield an incorrect GPA. In survey analysis, inappropriate weighting can bias results toward certain groups. Weights should reflect genuine importance or frequency, and their rationale should be clearly documented and justified.
Central Tendency Alone Is Insufficient
No single measure of central tendency — mean, median, or mode — tells the complete story about a data set. The famous Anscombe's quartet demonstrates four data sets with nearly identical means and variances but radically different distributions when graphed. Standard deviation, quartiles, skewness, and visualization are essential complements to any average. A comprehensive data analysis should always include both measures of center and measures of spread.
Small Samples May Not Be Representative
Averages calculated from small samples can be unreliable and may not represent the larger population. A mean calculated from 5 data points has much more uncertainty than one from 500 data points. Statistical theory tells us that larger samples produce more stable and reliable estimates of population parameters. When working with small samples, be cautious about drawing broad conclusions, and consider reporting confidence intervals alongside point estimates to communicate the uncertainty in your results.
When to Use More Advanced Statistical Tools
For situations beyond simple average calculations, consider these more specialized approaches:
- •Trimmed Mean — Removes a percentage of extreme values from both ends before calculating the mean, reducing outlier influence while using more data than the median.
- •Standard Deviation & Variance — Quantify how spread out data points are from the mean, essential for understanding data variability and constructing confidence intervals.
- •Percentiles & Box Plots — Show the distribution of data at multiple points (25th, 50th, 75th percentile), providing a richer picture of data distribution than any single average.
Understanding Different Types of Averages
The term "average" encompasses several distinct mathematical concepts, each with unique properties and ideal applications. Choosing the right type of average for your data is crucial for accurate analysis and meaningful conclusions.
Arithmetic Mean
The arithmetic mean is what most people think of when they hear "average." It is calculated by summing all values and dividing by the count: Mean = (x₁ + x₂ + ... + xₙ) / n. The arithmetic mean is the balance point of a data set — if you placed equal weights at each data point on a number line, the mean is where the line would balance. This property makes it mathematically elegant and widely used in statistical formulas.
The arithmetic mean has important mathematical properties: it minimizes the sum of squared deviations from any point (making it central to least-squares regression), and the sum of deviations from the mean always equals zero. However, these properties also make it sensitive to outliers. In practice, the arithmetic mean works best for symmetric distributions like test scores, temperature readings, and measurements without extreme values.
Geometric Mean
The geometric mean is calculated by multiplying all n positive values together and taking the nth root: GM = (x₁ × x₂ × ... × xₙ)^(1/n). Equivalently, it is the exponential of the arithmetic mean of the logarithms of the values. The geometric mean is always less than or equal to the arithmetic mean (a consequence of the AM–GM inequality), with equality only when all values are identical.
The geometric mean is the correct average for multiplicative processes and rates of change. If an investment returns +10%, −5%, and +20% over three years, the average annual return is the geometric mean of 1.10, 0.95, and 1.20, which is (1.10 × 0.95 × 1.20)^(1/3) ≈ 1.0789, or about 7.89% per year. Using the arithmetic mean would overestimate the actual return. The geometric mean is also used in the Human Development Index, normalized scoring systems, and comparing ratios across different scales.
Harmonic Mean
The harmonic mean is the reciprocal of the arithmetic mean of the reciprocals: HM = n / (1/x₁ + 1/x₂ + ... + 1/xₙ). It is always the smallest of the three Pythagorean means (arithmetic ≥ geometric ≥ harmonic). The harmonic mean gives greater weight to smaller values and is appropriate when averaging rates.
The classic application of the harmonic mean is averaging speeds for equal distances. If you drive 60 km/h for 100 km and 40 km/h for 100 km, your average speed is not 50 km/h (arithmetic mean) but 48 km/h (harmonic mean), because you spend more time at the slower speed. The harmonic mean is also used in physics (combining resistors in parallel, lens equations), finance (price-earnings ratios of portfolios), and ecology (population density averaging).
Trimmed Mean (Truncated Mean)
A trimmed mean removes a fixed percentage of the smallest and largest values before computing the arithmetic mean of the remaining data. A 5% trimmed mean removes the top and bottom 5% of values. This approach combines the robustness of the median with the mathematical properties of the mean, making it less sensitive to outliers while still using most of the data.
Trimmed means are widely used in competitive scoring — Olympic diving and figure skating judges drop the highest and lowest scores before averaging. In economics, the Federal Reserve Bank of Cleveland publishes a 16% trimmed-mean CPI as a measure of core inflation. The London Interbank Offered Rate (LIBOR) historically used a 25% trimmed mean of rate submissions. When you suspect outliers but want a more nuanced measure than the median, the trimmed mean is an excellent choice.
Why You Should Calculate Averages and Understand Central Tendency
Calculating averages is fundamental to making sense of numerical data. Whether you are a student computing your GPA, a business owner analyzing monthly sales figures, a scientist summarizing experimental results, or a sports fan examining player statistics, averages transform raw numbers into meaningful summaries. The arithmetic mean provides a quick snapshot of the overall level of a data set, while the median reveals the typical value without being distorted by extremes.
In finance, averages are indispensable. Investors calculate average returns to evaluate portfolio performance. Companies track average revenue per customer to measure business health. Economists monitor average wages, consumer spending, and inflation rates to assess economic conditions. The weighted average is particularly important in finance, where portfolio returns must account for different investment sizes and bond yields must consider varying maturities and coupon rates.
In education, the mean and weighted average determine student grades, class rankings, and school performance metrics. Understanding the difference between mean and median GPA can reveal important information about grade distribution within a class. If the mean GPA is significantly higher than the median, it suggests that a few high-performing students are pulling the average up, while most students cluster at a lower level.
In scientific research, averages reduce the impact of random measurement errors and reveal underlying patterns. Researchers calculate mean values from multiple trials and use standard deviation to quantify the reliability of their measurements. The geometric mean is essential in fields like microbiology (bacterial growth rates), epidemiology (relative risk ratios), and environmental science (pollutant concentration indices). Each type of average serves a specific analytical purpose, and choosing the right one is a critical skill in data literacy.
Who Should Use an Average Calculator
Students and educators use average calculations daily. Students need to calculate their GPA, test averages, and assignment scores. Many grading systems use weighted averages where exams count more than homework or participation. Our calculator makes it easy to determine current grades and calculate what scores are needed on remaining assignments to achieve a target GPA or final grade.
Business professionals and data analysts rely on averages to summarize key performance indicators. Average transaction value, average customer lifetime, average response time, and average daily active users are metrics that drive business decisions. Understanding when to use the mean versus the median is crucial — for example, average salary at a company can be misleading if a few executives earn vastly more than most employees; the median salary provides a more representative picture.
Researchers and scientists calculate averages to summarize experimental data, compare groups, and identify trends. In clinical trials, the mean response to a treatment is compared between groups. In quality control, the average measurement and its standard deviation determine whether a manufacturing process is within specification. Environmental scientists track average temperatures, rainfall, and pollution levels over time to identify climate trends and seasonal patterns.
Athletes, coaches, and sports analysts use averages extensively to evaluate performance. Batting averages, points per game, average speed, average heart rate during training, and average split times are all critical metrics. The moving average — an average calculated over a rolling window of recent data — helps identify performance trends while smoothing out day-to-day variability. Our calculator handles all these use cases, from simple classroom averages to weighted professional analytics.
Average Calculation Methods Compared
Multiple types of averages exist, each with distinct strengths and weaknesses. The table below compares the most common methods to help you choose the right one for your data.
| Average Type | How It Is Calculated | Best For | Limitations |
|---|---|---|---|
| Arithmetic Mean (This Calculator) | Sum of all values divided by count | General-purpose averaging; symmetric data; grades and scores | Heavily influenced by outliers; misleading for skewed data |
| Median | Middle value of sorted data | Skewed data; income and prices; resistant to outliers | Ignores magnitude of values; less mathematically tractable |
| Mode | Most frequently occurring value | Categorical data; identifying popular items; quality control | May not exist; not unique; not useful for continuous data |
| Weighted Mean | Sum of (value × weight) divided by sum of weights | GPA calculations; portfolio returns; survey data with sampling weights | Requires reliable weights; still sensitive to outliers |
| Geometric Mean | Nth root of the product of n values | Growth rates; investment returns; ratios and percentages | Cannot handle zero or negative values; harder to interpret intuitively |
| Harmonic Mean | Reciprocal of arithmetic mean of reciprocals | Averaging rates; speed over equal distances; P/E ratios | Cannot handle zero values; gives heavy weight to smallest values |
Arithmetic Mean (This Calculator)
- How It Is Calculated
- Sum of all values divided by count
- Best For
- General-purpose averaging; symmetric data; grades and scores
- Limitations
- Heavily influenced by outliers; misleading for skewed data
Median
- How It Is Calculated
- Middle value of sorted data
- Best For
- Skewed data; income and prices; resistant to outliers
- Limitations
- Ignores magnitude of values; less mathematically tractable
Mode
- How It Is Calculated
- Most frequently occurring value
- Best For
- Categorical data; identifying popular items; quality control
- Limitations
- May not exist; not unique; not useful for continuous data
Weighted Mean
- How It Is Calculated
- Sum of (value × weight) divided by sum of weights
- Best For
- GPA calculations; portfolio returns; survey data with sampling weights
- Limitations
- Requires reliable weights; still sensitive to outliers
Geometric Mean
- How It Is Calculated
- Nth root of the product of n values
- Best For
- Growth rates; investment returns; ratios and percentages
- Limitations
- Cannot handle zero or negative values; harder to interpret intuitively
Harmonic Mean
- How It Is Calculated
- Reciprocal of arithmetic mean of reciprocals
- Best For
- Averaging rates; speed over equal distances; P/E ratios
- Limitations
- Cannot handle zero values; gives heavy weight to smallest values
Practical Guide to Calculating and Using Averages
Whether you are calculating grades, analyzing business data, or solving math problems, here are practical tips for choosing and computing the right average for your needs.
How to Calculate Your Grade Average (GPA)
- •List each course with its grade points and credit hours. Multiply each grade point by its credit hours to get quality points. Sum all quality points and divide by total credit hours for your weighted GPA.
- •Example: A (4.0) in a 4-credit course and B (3.0) in a 3-credit course yields (4.0×4 + 3.0×3) / (4+3) = (16+9) / 7 = 3.57 GPA. Use our weighted average mode to calculate this automatically.
- •To find the minimum grade needed on a final exam to achieve a target course grade, set up the weighted average equation with the target grade and solve for the unknown. Many courses weight the final exam at 20–40% of the total grade.
- •Remember that cumulative GPA includes all courses across all semesters. To calculate what semester GPA you need to raise your cumulative GPA to a target, use the formula: Required Semester GPA = (Target × Total Credits − Current Quality Points) / Semester Credits.
Using Averages in Business Analysis
- •Track average transaction value (ATV) to measure customer spending patterns. A rising ATV indicates customers are spending more per visit, while a declining ATV may signal pricing issues or changing product mix.
- •Use median rather than mean for employee salary analysis. The mean salary is typically inflated by executive compensation. The median gives a more accurate picture of what the typical employee earns.
- •Calculate moving averages to identify trends in time-series data. A 7-day moving average of daily sales smooths out day-of-week effects, while a 30-day moving average reveals monthly trends. The choice of window size depends on the periodicity of your data.
Choosing the Right Average for Your Data
- •Use the arithmetic mean for symmetric data without outliers — test scores, height measurements, daily temperatures. If your data histogram looks roughly bell-shaped, the mean is the best summary.
- •Use the median for skewed data with outliers — income levels, home prices, response times, hospital lengths of stay. If the mean and median differ substantially, your data is skewed and the median is more representative.
- •Use the geometric mean for multiplicative data — compound growth rates, investment returns, bacterial population growth, index numbers. If your data involves percentages or ratios being multiplied together, the geometric mean is the correct average.
- •Use the mode for categorical data — survey responses, product preferences, diagnostic codes. The mode is the only average that works with non-numeric data and is useful for identifying the most common category.
Quick Reference: When to Use Which Average
- •Symmetric data, no outliers → Arithmetic Mean. This is the default choice and works well for most everyday calculations.
- •Skewed data, outliers present → Median. When you see a large gap between the mean and median, report the median.
- •Growth rates, returns, ratios → Geometric Mean. Always use this for calculating average percentage changes over time.
- •Rates with fixed distance/quantity → Harmonic Mean. Speed over equal distances, prices for equal quantities.
Common Mistakes to Avoid
Never average percentages or rates directly — use the weighted average or geometric mean depending on the context. For example, averaging 10% and 20% growth rates as 15% is wrong; the correct approach uses the geometric mean. Similarly, never average averages of groups with different sizes without weighting by group size. Always verify that your data is clean (no entry errors) and that your chosen average type matches the nature of your data and the question you are trying to answer.
Important Notes on Average Calculations
While averages are powerful summary statistics, they can also be misleading if used incorrectly. The arithmetic mean is particularly sensitive to outliers — extreme values that differ significantly from the rest of the data. A single outlier can dramatically shift the mean, making it unrepresentative of the typical value. Always examine your data for outliers and consider using the median when they are present.
Keep in mind when interpreting averages:
- The mean can be misleading for skewed distributions — consider using the median for income, home prices, and other right-skewed data
- A data set can have no mode, one mode, or multiple modes — the mode is most useful for categorical or discrete data
- The range only considers the two extreme values and tells you nothing about how data is distributed between them
- Weighted averages require accurate and meaningful weights — incorrect weights produce unreliable results
- Small sample sizes can produce averages that are not representative of the larger population
For a complete understanding of your data, always look beyond the average. Examine the standard deviation to understand variability, check the quartiles to see how data is spread, and visualize the distribution when possible. An average alone never tells the full story — as the classic joke illustrates, a statistician with one foot in boiling water and one foot in ice water is, on average, comfortable.
Frequently Asked Questions About Averages and Central Tendency
In everyday usage, "average" and "mean" are often used interchangeably, and both typically refer to the arithmetic mean — the sum of all values divided by the number of values. However, in statistics, "average" is a broader term that can refer to any measure of central tendency, including the mean, median, and mode. The arithmetic mean is just one type of average. When a teacher says "class average," they almost always mean the arithmetic mean. When a news report says "average income," they should specify whether they mean the mean or median, as the two can differ significantly for income data. In mathematical and statistical contexts, it is best to use the precise term (mean, median, or mode) rather than the ambiguous word "average" to avoid confusion.
Here is a step-by-step guide using the data set {12, 7, 3, 14, 6, 8, 7, 12, 7, 15}. For the mean: add all values (12+7+3+14+6+8+7+12+7+15 = 91) and divide by the count (10), giving a mean of 9.1. For the median: sort the values {3, 6, 7, 7, 7, 8, 12, 12, 14, 15}. With 10 values (even count), the median is the average of the 5th and 6th values: (7+8)/2 = 7.5. For the mode: count how often each value appears — 7 appears three times, 12 appears twice, and all others appear once. The mode is 7. For the range: subtract the minimum from the maximum: 15 − 3 = 12. These four measures together give you a comprehensive summary of the data's center and spread.
Use the median instead of the mean whenever your data is skewed or contains outliers. The most common example is income data: in a company where 9 employees earn $50,000 and the CEO earns $500,000, the mean salary is $95,000 — which overstates what most employees earn. The median salary of $50,000 is far more representative. Other cases where the median is preferred include: home prices in a market with a few luxury properties, response times for a service that occasionally has extreme delays, hospital length-of-stay data, wealth distribution, and any data set where a few extreme values pull the mean away from the center. As a general rule, if the mean and median differ by more than 10–15%, the data is likely skewed and the median may be the better summary measure.
A data set has no mode when every value appears the same number of times. In {2, 4, 6, 8, 10}, each value appears exactly once, so there is no mode. This is common in data sets with continuous measurements or small samples. When there is no mode, it simply means that no single value is more typical or frequent than any other. This does not indicate a problem with the data — it just means the mode is not a useful measure for that particular data set. In such cases, the mean and median are more informative. Some textbooks or tools may report "no mode" or list all values as modes, but the standard statistical convention is to state that the mode does not exist. For continuous data, modes are typically found by grouping data into intervals (bins) and identifying the interval with the highest frequency.
To calculate a weighted average for grades, multiply each grade by its weight (or credit hours), sum all the products, and divide by the total weight. For example, consider a student with these grades: Math (A = 4.0, 4 credits), English (B+ = 3.3, 3 credits), Science (A− = 3.7, 4 credits), and Art (B = 3.0, 2 credits). The weighted average is: (4.0×4 + 3.3×3 + 3.7×4 + 3.0×2) / (4+3+4+2) = (16.0 + 9.9 + 14.8 + 6.0) / 13 = 46.7 / 13 = 3.59 GPA. The weights ensure that courses with more credit hours have proportionally more impact on the GPA. Use our weighted average calculator mode to enter each grade and its credit hours for instant results.
In a perfectly symmetric distribution (like a normal bell curve), the mean, median, and mode are all equal and located at the center. In a right-skewed (positively skewed) distribution, the tail extends to the right, and the relationship is typically: mode < median < mean. The mean is pulled toward the longer tail because it is influenced by the high extreme values. Income distribution is a classic right-skewed example — most people earn moderate incomes while a few earn very high incomes. In a left-skewed (negatively skewed) distribution, the tail extends to the left, and the relationship reverses: mean < median < mode. An example is exam scores when most students score high but a few score very low. This empirical relationship (Karl Pearson's rule) is approximate and may not hold for all distributions, but it is a useful guideline for understanding data shape from summary statistics.
Outliers have a dramatic effect on the mean but virtually no effect on the median. Consider the data set {20, 22, 23, 24, 25, 26, 28}: the mean is 24.0 and the median is 24. Now add an outlier to get {20, 22, 23, 24, 25, 26, 28, 200}: the mean jumps to 46.0 (nearly doubling), while the median only changes to 24.5. This happens because the mean uses every value in its calculation, so one extreme value directly shifts the result. The median only cares about the middle position, so an outlier merely adds one more value above or below the center without affecting which values are in the middle. This property is called resistance or robustness. Because of this, the median is the preferred measure for data sets with potential outliers, including income, home prices, and wait times.
The arithmetic mean adds all values and divides by the count: AM = (x₁ + x₂ + ... + xₙ) / n. The geometric mean multiplies all values and takes the nth root: GM = (x₁ × x₂ × ... × xₙ)^(1/n). The key difference is that the arithmetic mean is appropriate for additive data (values that combine by addition), while the geometric mean is appropriate for multiplicative data (values that combine by multiplication). For investment returns, this distinction is critical: if a stock returns +50% one year and −50% the next, the arithmetic mean return is 0%, suggesting you broke even. But $100 × 1.50 × 0.50 = $75 — you actually lost 25%. The geometric mean correctly gives (1.50 × 0.50)^(1/2) ≈ 0.866, indicating an average annual loss of about 13.4%. The geometric mean is always less than or equal to the arithmetic mean (AM–GM inequality), with equality only when all values are identical.
Yes, a data set can have more than one mode. When a data set has exactly one mode, it is called unimodal. When it has two modes, it is called bimodal. When it has three or more modes, it is called multimodal. For example, {1, 2, 2, 3, 4, 4, 5} is bimodal with modes 2 and 4 (both appear twice). A bimodal distribution often indicates that the data contains two distinct groups or subpopulations — for instance, adult heights might show modes near 165 cm and 178 cm, reflecting the difference between female and male averages. Multimodal distributions can arise from survey data with multiple popular responses, manufacturing defects at specific values, or combined data from several distinct sources. Identifying multiple modes helps reveal hidden structure in your data that a single mean or median would obscure.
When a data set has an even number of values, there is no single middle value. Instead, the median is calculated as the average of the two middle values. Here is the step-by-step process: First, sort all values from smallest to largest. Then identify the two middle positions — for n values, these are positions n/2 and (n/2 + 1). Finally, average those two values. For example, in {3, 7, 9, 12, 15, 18} (6 values), the two middle values are at positions 3 and 4: values 9 and 12. The median is (9 + 12) / 2 = 10.5. Note that the median does not have to be a value that actually appears in the data set — in this example, 10.5 is not one of the original values. For an odd number of values, the median is simply the value at position (n+1)/2. For example, in {3, 7, 9, 12, 15} (5 values), the median is the 3rd value: 9.