Purpose
NHANES data are often used to provide national estimates on important public health issues. This module introduces how to generate the descriptive statistics for NHANES data that are most often used to obtain these estimates. Topics covered in this module include checking frequency distribution and normality, generating percentiles, generating means, and generating proportions.
Task 1: Check Frequency Distribution and Normality
It is highly recommended that you examine the frequency distribution and normality of the data before starting any analysis. These descriptive statistics are useful in determining whether parametric or non-parametric methods are appropriate to use, and whether you need to recode or transform data to account for extreme values and outliers.
- Key Concepts about Checking Frequency Distribution and Normality
- How to Check Frequency Distribution and Normality in SAS
- How to Check Frequency Distribution and Normality in Stata
- Download Sample Code and Datasets
Task 2: Generate Percentiles
Percentiles are used to indicate the relative position of an individual within a given dataset. Frequency distribution and percentiles also can be used to describe the characteristics of a distribution and to check for outliers.
Although SAS 9.1 and Stata have commands for calculating estimates of weighted percentiles, they do not have commands to directly produce standard errors for the percentiles. So this tutorial will not provide sample programs in SAS 9.1 and Stata for percentiles and their standard errors. In SAS 9.2 Survey Procedures, variance estimation for percentiles using the Woodruff method is available. See the SAS 9.2 documentation for information on using this method.
- Key Concepts about Generating Percentiles
- How to Generate Percentiles Using SUDAAN
- Download Sample Code and Datasets
Task 3: Generate Means
Means are used to estimate averages of a particular variable of interest, e.g., the average total cholesterol levels or the average systolic blood pressure levels in a given population.
- Key Concepts about Means
- How to Generate Means Using SUDAAN
- How to Generate Means Using SAS Survey Procedures
- How to Generate Means Using Stata
- Download Sample Code and Datasets
Task 4: Generate Proportions
Proportions are used for prevalence estimates of an event or trait, e.g., the prevalence of persons with high blood pressure (HBP) in the U.S.
- Key Concepts about Proportions
- How to Generate Proportions Using SUDAAN
- How to Generate Proportions Using SAS Survey Procedures
- How to Generate Proportions Using Stata
- Download Sample Code and Datasets
Contact Us:
- National Center for Health Statistics
3311 Toledo Rd
Hyattsville, MD 20782 - 1 (800) 232-4636
- cdcinfo@cdc.gov