statistics

Let’s Talk about Multivariate Research Designs: Part 1

ByAnnMaria De Mars December 18, 2014December 18, 2014

(There may even be a part two, if I get around to it.)

Let me ask you a couple of questions:

1. Do you have more than just one dependent variable and one independent variable?

2. If you said, yes, do you have a CATEGORICAL or ORDINAL dependent variable? If so, use logistic regression. I have written several posts on it. You can find a list of them here. Some involve Euclid, marriage, SAS and SPSS. Alas, none involve a naked mole rat. I shall have to remedy that.

3. You said yes to #1, multiple variables, but no to number 2, so I am assuming you have multiple variables in your design and your dependent variable is interval or continuous, something like sales for the month of December, average annual temperature or IQ. The next question is do you have only ONE dependent variable and is it measured only ONCE per observation? For example, you have measured average annual temperature of each city in 2013 or sales in December , 2012. In this case, you would do either Analysis of Variance or multiple regression. It doesn’t matter much which you do if you code it correctly. Both are specific cases of the general linear model and will give you the same result. You may also want to do a general linear MIXED model, where you have city as a random effect and something else, say, whether the administration was Democratic or Republican as a fixed effect. In this case I assume that you have sales as your dependent variable because contrary to the beliefs of some extremists, political parties do not determine the weather. Generally, whether you use a mixed model or an Ordinary Least Squares (OLS) plain vanilla ANOVA or regression will not have a dramatic impact on your results unless the result is a grade in a course where the professor REALLY wants you to show that you know that school is a random effect when comparing curricula.

4. Still here? I’m guessing you have one of two other common designs. That is, you have measured the same subjects, stores, cities, whatever, more than once. Most commonly, it is the good old pretest posttest design and you have an experimental and control group. You want to know if it works. If you have only tested your people twice, you are perfectly fine with a repeated measures ANOVA. If you have tested them more than twice, you are very likely to have grossly violated the assumption of compound symmetry and I would recommend a mixed model.

5. All righty then, you DO have multiple variables, they are NOT categorical or ordinal, your dependent variable is NOT repeated, so you must have multiple dependent variables. In that case, you would do a multivariate Analysis of Variance.

Some might argue that logistic regression is not a multivariate design. Other people would argue with them that, assuming your data are multinomial, you need multiple logit functions so that really is a type of multivariate design. A third group of people would say it is multivariate in the ordinal or multinomial case because there are multiple possible outcomes.

Personally, I wonder about all of those types of people. I wonder about the amount of time in higher education spent in forcing students to learn answers to questions that have no real use or purpose as far as I can see.

On the other hand, while knowing whether something falls in the multivariate category or not probably won’t impact your life or analyses, if you treat time as an independent variable and analyze your repeated measures ANOVA with experiment and condition as a 2 x 2 ANOVA, you’re screwed.

Know your research designs.

SPSS Propensity Scores – Part 2

ByAnnMaria De Mars February 13, 2012February 13, 2012

I wrote Part 1 a couple of years ago, so I guess I’m due for a part 2. In this case, I started with a data set in SAS but because it was going to be used by a group who had some SAS users and some SPSS users, they wanted to have the code…

statistics

Sports equality, t-tests and standard error

ByAnnMaria De Mars August 25, 2011August 25, 2011

Today, taking a break from writing the grant proposal that has no end, I found myself thinking about easy ways to explain and understand standard error. To understand standard error, you have to have some statistic that you’re discussing the standard error of. As a random example, let’s just take the mean. T-TEST PROCEDURE FOR…

Software | statistics | Technology

MANOVA from beginning to end: Reliability

ByAnnMaria De Mars June 15, 2017

Where is the Multivariate Analysis of Variance ? You promised there would be MANOVA ! Now we’re in the third post! First there was recoding of variables. Then, there was creating scales. Now, we’re looking at reliability. Patience is a virtue. Before we get to doing a MANOVA we want to be sure that our…

20 Day Blogging | statistics

Amos, covariances and variances: Twenty Day Blogging Challenge

ByAnnMaria De Mars January 6, 2014January 14, 2014

I came across this really interesting post on the 20-Day Blogging Challenge for teachers. I’m not sure how likely I am to be able to finish it in January since it is already the sixth and January is a really busy month for me, but we will see. The first prompt is “Tell about a…

Software | statistics | Technology

Super-Easy Outlier Check with Proc Freq

ByAnnMaria De Mars July 31, 2015

Sometimes, you can just eyeball it. Really, if something truly is an outlier, you ought to be able to spot it. Take this plot, for example. It should be pretty obvious that the vast majority of our sample for the Fish Lake game were students in grades, 4, 5 and 6. Those in the lower…

statistics

The F-statistic in ANOVA explained

ByAnnMaria De Mars November 29, 2012September 5, 2014

I tried to find an easily comprehended explanation of the F-statistic for my students but I could not, so, here as a public service is mine. If you have some other pages you can recommend, please let me know. Okay, why ANOVA? Why not just do a t-test? Well, let’s say you have five groups….

2 Comments

Jan Karel Pieterse says:

December 18, 2014 at 4:17 am

Hi,

Excellent post once again.

I’d be interested to hear what you’d have to say about the stage prior to the analysis of the results: the design of the experiments.

After all, an experiment is pointless if it isn’t designed well!
AnnMaria says:

December 18, 2014 at 1:40 pm

That’s a line I quote to my students all of the time, “Calling in a statistician after the experiment is over is like calling in a physician after the patient is dead. At best, one can tell you what threats to validity caused the experiment’s death.”

Similar Posts

2 Comments

Leave a Reply