How to Identify and Describe the Impact of Influential Outliers

Before you analyze your data, it is very important that you examine the data for the presence of outlying values.

Check for Outliers by Running a Univariate Analysis

Use the PROC UNIVARIATE procedure to get all default descriptive statistics such as mean, minimum and maximum values, standard deviation, and skewness. Use the VAR statement to identify the variables of interest (ALLMEAN_CNT and ALLMEAN_MV). Use the ID statement to list the sequence numbers associated with extreme values in the output.

Sample Code

proc univariate data =paxmstr normal plot ;
 var allmean_cnt;
 where ridageyr > 20

and ridageyr