statistics

What’s the first thing you tell students about statistics?

ByAnnMaria De Mars November 22, 2013November 22, 2013

I’m looking forward to teaching my first masters level course in a lo-o-ng time next week. Since this may be the first course students take in their masters program, the question I’m faced with is,

“What would you tell someone at the very beginning of learning about statistics?”

I’m starting with this:

Bias = bad

Bias is to statisticians as sin is to preachers. We’re against it.

Bias is SYSTEMATIC error. While it is generally impossible to avoid error, in an unbiased study, error will be random.

Random = good

If error is random, we would be equally likely to err in one direction as the other, and so, on the average, would get the correct result. For example, if I was evaluating fighters to decide if they really did have brain damage as a result of being hit in the head too many times, in some borderline cases I might incorrectly decide the fighter was fine when, in fact, there was some minimal brain damage. In other cases, I might decide the person had damage, when he or she was just somewhat on the low side of the bell curve in terms of functioning brain cells. On the average, though, those errors should balance out and I should get the correct conclusion.

Random assignment is good because it means that people are equally likely to be assigned to one group versus another, so it is likely to control for confounding variables. What are confounding variables? Those are factors that may have complex relationships that distort the relationships found between your predictors/ risk factors and outcome variables. For example, people residing in nursing homes (my predictor) may be more likely to die (my outcome) but that might be because they are older or in poorer health (confounding variables).

Random selection is good because it means that everyone in the population has an equal chance to be selected, which means that, if you have a large enough sample, your sample is likely to be representative.

What’s a sample? What’s a population? What’s representative?

Well, we’ll get into that shortly.

But, speaking of random, I thought the most important thing to begin with was not how to find a mean or standard deviation but that bias is bad, because if you have bias, you are worse off after you found the mean than before you knew how to compute it. Before you didn’t have any information, you didn’t know the mean and you knew you didn’t know it.

With bias, you still don’t know the mean, but you think you do. You’ve actually gone backwards.

Think about it.

statistics

Satterthwaite, variances, walruses and uteruses
ByAnnMaria De Mars October 17, 2008

Statistics applies to everything. Today I was looking up examples of the Satterthwaite alternative to the pooled variance t-test. In short, a t-test is used when one wants to answer the question, “Is the difference between these two groups greater than one would expect to find by chance?” Any time you measure two groups, whether…

Read More Satterthwaite, variances, walruses and uteruses
Dr. De Mars General Life Ramblings | statistics

I’m Claiming my Love Stats Award
ByAnnMaria De Mars November 25, 2011

It’s about time I got some recognition ! You can claim your own Love Stats award here. Careful, of undeserved awards, though. The last person who falsely claimed a Love Stats award had multicollinearity in his measures, a high VIF and died of complications of homoscedasticity. You have been warned.

Read More I’m Claiming my Love Stats Award
Software | statistics

How Do I Write a Statistical Analysis Paper? Advice to Students
ByAnnMaria De Mars May 15, 2015May 24, 2015

I get asked this question fairly often so I thought I would do a few posts on it. The most common problem is that a student who is new to statistics has no idea where to even start. These examples use SAS but you could use any package you like. My recommendation to students beginning…

Read More How Do I Write a Statistical Analysis Paper? Advice to Students
statistics

Box and whisker plots – they’re not just fun to say!
ByAnnMaria De Mars September 17, 2013

Box and whisker plots can give you an understanding of your data at a glance – IF you know what you’re looking at. The BOX extends from the 25th percentile to the 75th percentile. That line in the middle is the median, also known as the 50th percentile. The diamond inside the box is the…

Read More Box and whisker plots – they’re not just fun to say!
Software | statistics | Technology

Plots of Relative Risk: A picture says 1,000 words
ByAnnMaria De Mars March 20, 2016

I can’t believe I haven’t written about this before – I’m going to tell you an easy (yes, easy) way to find and communicate to a non-technical audience standardized mortality rates and relative risk by strata. It all starts with PROC STDRATE . No, I take that back. It starts with this post I wrote…

Read More Plots of Relative Risk: A picture says 1,000 words
statistics

Native Americans: Why Heidi Heitkamp won & Nate Silver was wrong?
ByAnnMaria De Mars November 19, 2012

The past couple of weeks, I’ve been hearing my friends from Turtle Mountain and Spirit Lake talk about the election in North Dakota. I was particularly interested because this was the one election that Nate Silver predicted incorrectly. He had Heitkamp down by 3.9 percent, and yet she won. I have no idea how Silver’s…

Read More Native Americans: Why Heidi Heitkamp won & Nate Silver was wrong?

6 Comments

Michelle Homes says:

November 22, 2013 at 2:23 am

I think to also question the quality of the data and how it was obtained. Missing values, negative ages, numerous categories when there shouldn’t be etc. this affects the statistics and should also be investigated/explored whilst considering bias and randomness.
Tricia Aanderud says:

November 22, 2013 at 5:51 am

“on the low side of the bell curve in terms of functioning brain cells.”

lol
Quentin McMullen says:

November 22, 2013 at 9:31 am

Well, I’m not a prof, but I’ve taken intro stats courses several times. : ) I like the bias/precision discussion. I always found images like this useful for that: http://www.yorku.ca/psycho/en/postscript.asp

But for intro stats, might start with explaining *why* people use statistics. Simplisticly, in many settings we can’t count/measuere everything. For those populations that are too big to measure everything, we will never know the truth exactly. When that happens, we can either know nothing, or use statistics to develop an *estimate* based on a sample….

Which leads nicely into your discussion of whether you would rather have an estimate with bias or imprecision.

Love your blog. It’s inspiring.
Alex Reutter says:

November 23, 2013 at 2:51 pm

Regarding bias, and whether it is universally bad, I have always liked Maurice Kendall’s “Hiawatha designs an experiment” http://www.columbia.edu/~to166/hiawatha.html
AnnMaria says:

November 23, 2013 at 5:18 pm

Ha ha, Alex – The Hiawatha designs an experiment is funny!
Pingback: Throwback Thursday: What’s The First Thing You Tell Students About Statistics? | 7 Generation Games

Similar Posts

6 Comments

Leave a Reply