statistics

What’s the first thing you tell students about statistics?

ByAnnMaria De Mars November 22, 2013November 22, 2013

I’m looking forward to teaching my first masters level course in a lo-o-ng time next week. Since this may be the first course students take in their masters program, the question I’m faced with is,

“What would you tell someone at the very beginning of learning about statistics?”

I’m starting with this:

Bias = bad

Bias is to statisticians as sin is to preachers. We’re against it.

Bias is SYSTEMATIC error. While it is generally impossible to avoid error, in an unbiased study, error will be random.

Random = good

If error is random, we would be equally likely to err in one direction as the other, and so, on the average, would get the correct result. For example, if I was evaluating fighters to decide if they really did have brain damage as a result of being hit in the head too many times, in some borderline cases I might incorrectly decide the fighter was fine when, in fact, there was some minimal brain damage. In other cases, I might decide the person had damage, when he or she was just somewhat on the low side of the bell curve in terms of functioning brain cells. On the average, though, those errors should balance out and I should get the correct conclusion.

Random assignment is good because it means that people are equally likely to be assigned to one group versus another, so it is likely to control for confounding variables. What are confounding variables? Those are factors that may have complex relationships that distort the relationships found between your predictors/ risk factors and outcome variables. For example, people residing in nursing homes (my predictor) may be more likely to die (my outcome) but that might be because they are older or in poorer health (confounding variables).

Random selection is good because it means that everyone in the population has an equal chance to be selected, which means that, if you have a large enough sample, your sample is likely to be representative.

What’s a sample? What’s a population? What’s representative?

Well, we’ll get into that shortly.

But, speaking of random, I thought the most important thing to begin with was not how to find a mean or standard deviation but that bias is bad, because if you have bias, you are worse off after you found the mean than before you knew how to compute it. Before you didn’t have any information, you didn’t know the mean and you knew you didn’t know it.

With bias, you still don’t know the mean, but you think you do. You’ve actually gone backwards.

Think about it.

statistics

Phi coefficients, Christmas and the number 42

ByAnnMaria De Mars January 3, 2009January 3, 2009

People like familiarity. That’s probably one reason we enjoy the holidays so much – we know all the words to Silent Night, how to carve a turkey, which of the Christmas cookies taste the best. If I am going to convince you to give up statistics with which you feel comfortable, such as chi-square and…

Dr. De Mars General Life Ramblings | statistics

Why Present Your Data at a Software Conference?

ByAnnMaria De Mars August 13, 2015August 13, 2015

I read this in a review of a study on teacher expectancy effects but it could really apply to so many other studies. If these results bear any relationship at all to reality, it is indeed a fortunate coincidence. Those of us who choose careers in research like to believe that it is all like…

Dr. De Mars General Life Ramblings | statistics

Not all statistics are created equal: Proof from Mixed Martial Arts

ByAnnMaria De Mars March 4, 2012March 11, 2012

A few years ago, taking testimony in a court case, an attorney asked me, “Tell me, doctor, have you heard the saying, ‘Lies, damned lies and statistics’? Isn’t it true what they say, that you can lie with statistics?” I answered, “Not to me, you can’t.” My point that day was that if the person…

Algebra | statistics

Matrix Algebra, Just Because

ByAnnMaria De Mars October 2, 2014October 2, 2014

I was talking to a friend of mine today who had taken a test for a new job recently and he had a hard time with the math portion of it. We were in college about the same time and he did perfectly fine in math, but it had been a while. This got me…

Software | statistics | Technology

Super-Easy Outlier Check with Proc Freq

ByAnnMaria De Mars July 31, 2015

Sometimes, you can just eyeball it. Really, if something truly is an outlier, you ought to be able to spot it. Take this plot, for example. It should be pretty obvious that the vast majority of our sample for the Fish Lake game were students in grades, 4, 5 and 6. Those in the lower…

Dr. De Mars General Life Ramblings | statistics

Should transgender athletes compete in women’s MMA: The data

ByAnnMaria De Mars March 24, 2013March 27, 2013

There has been far more heat than light surrounding the current controversy over whether a transgender (male to female) fighter should be allowed to compete in mixed martial arts in the women’s division. This article on The Verge said that opponents of Ms. Fox competition “are not supported by the current science”, citing the fact…

6 Comments

Michelle Homes says:

November 22, 2013 at 2:23 am

I think to also question the quality of the data and how it was obtained. Missing values, negative ages, numerous categories when there shouldn’t be etc. this affects the statistics and should also be investigated/explored whilst considering bias and randomness.
Tricia Aanderud says:

November 22, 2013 at 5:51 am

“on the low side of the bell curve in terms of functioning brain cells.”

lol
Quentin McMullen says:

November 22, 2013 at 9:31 am

Well, I’m not a prof, but I’ve taken intro stats courses several times. : ) I like the bias/precision discussion. I always found images like this useful for that: http://www.yorku.ca/psycho/en/postscript.asp

But for intro stats, might start with explaining *why* people use statistics. Simplisticly, in many settings we can’t count/measuere everything. For those populations that are too big to measure everything, we will never know the truth exactly. When that happens, we can either know nothing, or use statistics to develop an *estimate* based on a sample….

Which leads nicely into your discussion of whether you would rather have an estimate with bias or imprecision.

Love your blog. It’s inspiring.
Alex Reutter says:

November 23, 2013 at 2:51 pm

Regarding bias, and whether it is universally bad, I have always liked Maurice Kendall’s “Hiawatha designs an experiment” http://www.columbia.edu/~to166/hiawatha.html
AnnMaria says:

November 23, 2013 at 5:18 pm

Ha ha, Alex – The Hiawatha designs an experiment is funny!
Pingback: Throwback Thursday: What’s The First Thing You Tell Students About Statistics? | 7 Generation Games

Similar Posts

6 Comments

Leave a Reply