statistics

What you need to know before multivariate statistics

ByAnnMaria De Mars October 8, 2014October 8, 2014

You might have gotten the misimpression from my previous post that I don’t think students need to learn all that much matrix algebra that I am a slacker as far as expecting students to come to courses with some prior knowledge. That’s not exactly the case. In fact, here are some things I just assume students coming into a multivariate statistics course should know and even though some textbooks begin with these, well, all I can say is if you have had three statistics courses and you still don’t know what a covariance is, I think something has gone awry in your education.

Know the equation to compute variance – it’s pretty darn basic – and have a really good understanding of interpreting variance, like what 0 variance means, the statistical and practical interpretation of explained variance. I personally view science as the search for explained variance.
REALLY understand covariance – that is, now how it is calculated, that it is a measure of linear relationship and that a covariance of 0 usually but not always signifies independence.
Be able to interpret a correlation.
Have a basic grasp of the Central Limit Theorem and the difference between population values and sample statistics.
Understand what a chi-square is, how you get it and how you interpret it
Remember the definition and interpretation of an F-test
Understand the difference between statistical significance and effect size
Know what the null hypothesis test
Realize that before you do ANYTHING with data, if you don’t check the data coding and quality you are an idiot. You should have some understanding of how to read a codebook and be able to compute a frequency distribution, descriptive statistics and data description (like a PROC CONTENTS with SAS). When I look at the scant attention many so-called researchers pay to issues like missing data, miscoded data and non-random sampling, I am surprised we’re ever able to replicate anything.

Diving into MANOVA was really what I wanted to blog about next, so maybe I will actually get to that in the context of analyzing missing data, but having failed already at my attempt to leave my desk before midnight, that will have to wait until next time.

Having found no significant differences in the missing and non-missing data, as I’d expected, I went on to do a couple of more analyses where I was quite surprised not to find differences, but that will also have to wait for next time. I’m really only mentioning it here so I don’t forget. Wouldn’t you think that there would be differences in hospital length of stay and age by race and region? Well, I would, but I was wrong.

On a random note, I have to say, I really do love this remote desktop set up for teaching. It solves the problem of whether students have Windows or Mac, having to get needed software installed. All the way around, I love it.

Software | statistics | Technology

SUPER BASIC INTRODUCTION TO DATA ANALYSIS
ByAnnMaria De Mars January 20, 2019January 20, 2019

I was going to write more about reading JSON data but that will have to wait because I’m teaching a biostatistics class and I think this will be helpful to them. What’s a codebook? If you are using even a moderately complex data set, you will want a code book. At a minimum, it will…

Read More SUPER BASIC INTRODUCTION TO DATA ANALYSIS
Dr. De Mars General Life Ramblings | statistics | Technology

Open Data Wikipedia or How many monkeys = 1 statistician?
ByAnnMaria De Mars February 12, 2011February 12, 2011

Remember that old saying that 1,000,000 monkeys on a typewriter would eventually produce Shakespeare? After the equivalent of more than a 1,000,000 monkey-years of text published on the web, so far, no Shakespeare. (For a superb, in-depth discussion of this point, read Jason Lanier’s book, “You are not a gadget”) In very, very, brief, Lanier …

Read More Open Data Wikipedia or How many monkeys = 1 statistician?
statistics | The Julia Group

There is no such thing as conservative math!
ByAnnMaria De Mars July 31, 2009July 31, 2009

Statisticians should not listen to talk radio or to anything on the Fox network. Those people who say that you can prove anything with statistics are mistaken. You can prove anything with statistics to people who don’t understand statistics. I think some of those same people you can prove anything to with a box of…

Read More There is no such thing as conservative math!
statistics

A statistical picture is worth 1,000 words
ByAnnMaria De Mars August 4, 2013August 4, 2013

One nice thing that SAS Enterprise Guide does is produce a series of graphs when you do a logistic regression. Too many people just skim over the table of Type III effects, say what is significant and isn’t and go on their merry way, which is too bad, because sometimes your graphs are very easy…

Read More A statistical picture is worth 1,000 words
statistics

Mucking about with data & making life better
ByAnnMaria De Mars October 7, 2010October 7, 2010

We interrupt the prior rambling discussion of high performance computing for a new rambling discussion. A lot of things bother me – hate crimes, domestic violence, terrorism, drug-related crimes in Mexico, low college graduation rates of minority youth – well, it’s a very long list. When I was younger, so much younger than today ……..

Read More Mucking about with data & making life better
statistics | The Julia Group

Controlling for Damn Near Everything: Propensity Score Matching
ByAnnMaria De Mars June 3, 2009June 3, 2009

Lately I have been on a roll looking at relatively less common statistical techniques, proportional hazards, survival analysis, etc. In keeping with that, I have been taking a look at propensity score matching, fondly known as PSM by, – well, by no one actually. The problem to be solved …. Think about some of these…

Read More Controlling for Damn Near Everything: Propensity Score Matching

Similar Posts

Leave a Reply