SAS Studio: Finding prevalence with pointing and clicking

ByAnnMaria De Mars February 24, 2016

Policy makers have very good reason for wanting to know how common a condition or disease is. It allows them to plan and budget for treatment facilities, supplies of medication, rehabilitation personnel. There are two broad answers to the question, “How common is condition X?” and, interestingly, both of these use the exact same SAS procedures. Prevalence is the number of persons with a condition divided by the number in the population. It’s often given as per thousand, or per 100,000, depending on how common the condition is. Prevalence is often referred to as a snapshot. It’s how many people have a condition at any given time.

Just for fun, let’s take a look at how to compute prevalence with SAS Studio.

Step 1: Access your data set

First, assign a libname so that you can access your data. To do that, you create a new SAS program by clicking on the first tab in the top menu and selecting SAS Program.

libname mydata "/courses/number/number/" access=readonly;

(Students only have readonly access to data sets in the course directory. This prevents them from accidentally deleting files shared by the whole class. As a professor with many years of experience, let me just tell you that this is a GREAT idea.)

Click on the little running guy at the top of your screen and, voila, your LIBNAME is assigned and the directory is now available for access.

(Didn’t believe me there is a little running guy that means “run”? Ha!)

Next, in the left window pane, click on Tasks and in the window to the right, click on the icon next to the data field.

From the drop down menu of directories, select the one with your data and then click on the file you need to analyze.

Step 2: Select the statistic that you want and then select the variable. In this case, I selected one-way frequencies, and one cool thing is that SAS will automatically show you ONLY the roles you need for a specific test. If you were doing a two-sample t-test, for example, it would ask for you groups variable and your analysis variable. Since I am doing a one-way frequency, there is only an analysis variable.

When you click on the plus next to Analysis Variables, all of the variables in your data set pop up and you can select which you want to use. Then, click on your little running guy again, and voila again, results.

So … the prevalence of diabetes is about 11% of the ADULT population in California, or about 110 per 1,000.

You can also code it very simply if you would like:
libname mydata “/courses/number/number/” access=readonly;

PROC FREQ DATA = mydata.datasetname ;

TABLE variable ;

Of course, all of this assumes that your data is cleaned and you have a binary variable with has disease/ doesn’t have disease, which is a pretty large assumption.

Now, curiously, the code above is the exact SAME code we used to compute incidence of Down syndrome a few weeks ago. What’s up with that and how can you use the exact same code to compute two different statistics?

Patience, my dear. That is a post for another day.

Software | statistics | Technology

Logistic regression using SAS On-Demand with SAS Enterprise Guide – a movie and a rant
ByAnnMaria De Mars December 6, 2012

If you have a mad desire to do logistic regression with SAS On-Demand with SAS Enterprise Guide, here is a movie that shows how to do it. It is a .avi file so you may want to just download it and run it on your PC. Here is why the movie is not all that…

Read More Logistic regression using SAS On-Demand with SAS Enterprise Guide – a movie and a rant
Software | Technology

Watch me work: Compress Function for Test Scoring
ByAnnMaria De Mars February 5, 2016February 9, 2016

Did you ever fill out one of those online forms where you kept trying to submit it and got messages like, “You need to enter your phone number in the format 311-234-12234” or You cannot have any special characters in this field. That one really irritates me because, in fact, my last name has a…

Read More Watch me work: Compress Function for Test Scoring
Grantwriting | Software | statistics | The Julia Group

Discovering if your data blow with help from SAS Enterprise Guide
ByAnnMaria De Mars August 4, 2009August 4, 2009

“Is there anything you can do to help? I’d kill you but there is a law against it. You’d better leave before I figure out a way around that.” This comment was made by a co-worker of mine who had saved all of the data for his thesis for a masters in computer science on…

Read More Discovering if your data blow with help from SAS Enterprise Guide
statistics

What’s epidemiology? A definition with a side of SAS
ByAnnMaria De Mars January 5, 2016

I’ll be teaching a graduate course in epidemiology in the spring and giving a talk on biostatistics at SAS Global Forum in April, so I thought I’d jump ahead and start rambling on about it now. When I tell people that I teach epidemiology, the first question I usually get is, What’s epidemiology? In short,…

Read More What’s epidemiology? A definition with a side of SAS
Software | statistics | Technology

Quantifying Disease with SAS – who knew?
ByAnnMaria De Mars December 1, 2013

This month, I’m teaching biostatistics for National University, and so far I am really enjoying it. There is just a really minor problem, though. While I received a copy of the textbook, I did not receive a copy of the instructor’s manual with answers to the homework problems. Since I am going to grade 20…

Read More Quantifying Disease with SAS – who knew?
statistics

What I Do When One Person Might Change My Results
ByAnnMaria De Mars January 13, 2018

In a previous post, I asked what you would do if one person’s score changed your results? Would you throw them out? Leave them in? Does it depend on whether they support your hypothesis or not? A few people suggested collecting more data and I completely agree with their very valid points that if one…

Read More What I Do When One Person Might Change My Results

3 Comments

E says:

February 24, 2016 at 1:34 pm

Curious- how current are the data sets that you are able to access?
Annmaria says:

February 25, 2016 at 5:02 am

The data used here was the 2011 California Health Interview Survey.
Pingback: SAS Studio – Import Excel with Tasks & Utilities : AnnMaria's Blog

Similar Posts

3 Comments

Leave a Reply