Descriptive Statistics

Descriptive Statistics

Descriptive statistics are brief descriptive coefficients that summarize a given data set, which can be either a representation of the entire population or a sample of a population. The range of that data set is 95, which is calculated by subtracting the lowest number (5) in the data set from the highest (100). Descriptive statistics are used to describe or summarize the characteristics of a sample or data set, such as a variable's mean, standard deviation, or frequency. For example, while the measures of central tendency may give a person the average of a data set, it does not describe how the data is distributed within the set. 1:36 Descriptive statistics, in short, help describe and understand the features of a specific data set by giving short summaries about the sample and measures of the data. A person analyzes the frequency of each data point in the distribution and describes it using the mean, median, or mode, which measures the most common patterns of the analyzed data set.

Descriptive statistics summarizes or describes the characteristics of a data set.

What Are Descriptive Statistics?

Descriptive statistics are brief descriptive coefficients that summarize a given data set, which can be either a representation of the entire population or a sample of a population. Descriptive statistics are broken down into measures of central tendency and measures of variability (spread). Measures of central tendency include the mean, median, and mode, while measures of variability include standard deviation, variance, minimum and maximum variables, kurtosis, and skewness.

Descriptive statistics summarizes or describes the characteristics of a data set.
Descriptive statistics consists of two basic categories of measures: measures of central tendency and measures of variability (or spread).
Measures of central tendency describe the center of a data set.
Measures of variability or spread describe the dispersion of data within the set.

Understanding Descriptive Statistics

Descriptive statistics, in short, help describe and understand the features of a specific data set by giving short summaries about the sample and measures of the data. The most recognized types of descriptive statistics are measures of center: the mean, median, and mode, which are used at almost all levels of math and statistics. The mean, or the average, is calculated by adding all the figures within the data set and then dividing by the number of figures within the set.

For example, the sum of the following data set is 20: (2, 3, 4, 5, 6). The mean is 4 (20/5). The mode of a data set is the value appearing most often, and the median is the figure situated in the middle of the data set. It is the figure separating the higher figures from the lower figures within a data set. However, there are less common types of descriptive statistics that are still very important.

People use descriptive statistics to repurpose hard-to-understand quantitative insights across a large data set into bite-sized descriptions. A student's grade point average (GPA), for example, provides a good understanding of descriptive statistics. The idea of a GPA is that it takes data points from a wide range of exams, classes, and grades, and averages them together to provide a general understanding of a student's overall academic performance. A student's personal GPA reflects their mean academic performance.

Types of Descriptive Statistics

All descriptive statistics are either measures of central tendency or measures of variability, also known as measures of dispersion.

Central Tendency

Measures of central tendency focus on the average or middle values of data sets, whereas measures of variability focus on the dispersion of data. These two measures use graphs, tables and general discussions to help people understand the meaning of the analyzed data.

Measures of central tendency describe the center position of a distribution for a data set. A person analyzes the frequency of each data point in the distribution and describes it using the mean, median, or mode, which measures the most common patterns of the analyzed data set.

Measures of Variability

Measures of variability (or the measures of spread) aid in analyzing how dispersed the distribution is for a set of data. For example, while the measures of central tendency may give a person the average of a data set, it does not describe how the data is distributed within the set.

So while the average of the data maybe 65 out of 100, there can still be data points at both 1 and 100. Measures of variability help communicate this by describing the shape and spread of the data set. Range, quartiles, absolute deviation, and variance are all examples of measures of variability.

Consider the following data set: 5, 19, 24, 62, 91, 100. The range of that data set is 95, which is calculated by subtracting the lowest number (5) in the data set from the highest (100).

Why do we need statistics that simply describe data?

Descriptive statistics are used to describe or summarize the characteristics of a sample or data set, such as a variable's mean, standard deviation, or frequency. Inferential statistics can help us understand the collective properties of the elements of a data sample. Knowing the sample mean, variance, and distribution of a variable can help us understand the world around us.

What are mean and standard deviation?

These are two commonly employed descriptive statistics. Mean is the average level observed in some piece of data, while standard deviation describes the variance, or how dispersed the data observed in that variable is distributed around its mean.

Can descriptive statistics be used to make inference or prediction?

No. While these descriptives help understand data attributes, inferential statistical techniques — a separate branch of statistics — are required to understand how variables interact with one another in a data set.

Related terms:

Bell Curve

A bell curve describes the shape of data conforming to a normal distribution. read more

Business Valuation , Methods, & Examples

Business valuation is the process of estimating the value of a business or company. read more

Kurtosis

Kurtosis is a statistical measure used to describe the distribution of observed data around the mean. It is sometimes referred to as the "volatility of volatility."  read more

Mean

The mean is the mathematical average of a set of two or more numbers that can be computed with the arithmetic mean method or the geometric mean method. read more

Median

The median is the middle number in a sorted, ascending or descending, list of numbers and can be more descriptive of that data set than the average. read more

Mode

The mode is a statistical term that refers to the most frequently occurring number found in a set of numbers. read more

Quartile

A quartile is a statistical term describing a division of a data set into four defined intervals. read more

Skewness , Formula, & Calculation

Skewness refers to distortion or asymmetry in a symmetrical bell curve, or normal distribution, in a set of data. read more

Standard Error

The standard error is the standard deviation of a sample population. It measures the accuracy with which a sample represents a population. read more

Statistics

Statistics is the collection, description, analysis, and inference of conclusions from quantitative data. read more