What type of statistics is used in presenting organizing and summarizing data?

Population, Sample and Data
Section 4.1

Statistics is the science of collecting, organizing and summarizing data such that valid conclusions can be made from them.  The collecting, organizing and summarizing part is called �descriptive statistics�, while making valid conclusions is inferential statistics.

Population Data vs. Sample Data

Population: the universal set of all objects under study.

Sample: Any subset of the population.

A large population may be impractical and costly to study, collecting data from every member of the population.  A sample is more manageable and easier to study.

After collecting and organizing the data, a summary is made such as average values.  Hopefully valid conclusions can be made on the whole population based on the sample data.  Therefore it is important that the sample data collected be representative of the population.  Otherwise conclusions may be invalid.  Conclusions are only as reliable as the sampling process, and information can change from sample to sample.

Collecting: data points; each element in a set of data.

Organizing: frequency distribution; a chart that lists each data point with the number of times it occurs.

Relative Frequency: expressed as a percent of the total number of data points.

Example 1: 

What is your major? 

Major Tally Frequency Relative Frequency
Liberal Arts       
Physical Ed.      
Education      
Undeclared      
Other      

Only a few distinct data are found which are repeated.  Charting the data is easier to compute frequency.

 

Grouped Data:

 When data points consist of many different values, group them by taking the

largest value � smallest value = range.  When the range is established for the data points, decide how many groups to form, usually 4-8 groups.   Divide range by the number of groups wanted to get endpoints of intervals.

Example 2: Age of class. (Hypothetical)

16        21        18        22        19        20        19        21        26        30        27        25        20

24        21        20        29        19        32        35        20        19        18        21        23        25

Range = 35 � 16 = 19          n = 26 total                 5 groups         19/5 ~ 4

X = age tally frequency  relative frequency
16
What type of statistics is used in presenting organizing and summarizing data?
 x < 20
\\\\\\\ 7 7/26    ~          .269
20
What type of statistics is used in presenting organizing and summarizing data?
 x  < 24
\\\\\\\\\\ 10 10/26  ~          .385
24
What type of statistics is used in presenting organizing and summarizing data?
 x < 28
\\\\\ 5 5/26    ~          .192
28
What type of statistics is used in presenting organizing and summarizing data?
 x < 32
\\ 2 2/26    ~          .077
32
What type of statistics is used in presenting organizing and summarizing data?
 x < 36
\\ 2 2/26   ~           .077

When raw data consists of many different values, create intervals and work with grouped data.  Not all charts must begin with the smallest data point value; a smaller value can be used.

Histograms: Bar chart of grouped data.

If groups are not of equal intervals (width) then relative frequency is not accurate (visually). 

Relative frequency density:  rfd =    f/n   Dx         (delta x) is width of the interval
            x

RFD only used when widths are unequal.  It gives a more truthful representation of distribution with respect to the vertical axis.

Pie Charts Used for categorical data: grouped by a common feature or quality, usually financial expenditures.  Easier to visualize the whole and its parts.

Each slice is representative in size.  360* represents the whole pie, then a slice is  a central angle portion.  Take rf % and multiply by 360* to get angle measurement.

Using EX 1 data, create pie chart. 


Back to Statistics Main Page

Back to the Survey of Math Ideas Home Page

Back to the Math Department Home Page

e-mail Questions and Suggestions

What type of statistics is used to summarize and organize data?

Descriptive statistics summarize and organize characteristics of a data set. A data set is a collection of responses or observations from a sample or entire population.

What statistics are used to summarize data?

Variance and Standard Deviation If there are no extreme or outlying values of a variable, the mean is the most appropriate summary of a typical value, and to summarize variability in the data we specifically estimate the variability in the sample around the sample mean.

What type of statistics involves methods for organizing summarizing analyzing?

Descriptive statistics involves methods of organizing, picturing and summarizing information from data. Inferential statistics involves methods of using information from a sample to draw conclusions about the population.