statistics

The Multivariate Social Scientist: Book Review & Notes on Generalized Linear Models

ByAnnMaria De Mars October 11, 2014

I’ve been looking high and low for a supplemental text for a course on multivariate statistics and I found this one –

The Multivariate Social Scientist, by Graeme Hutcheson 7 Nick Sofroniou

They are big proponents of generalized linear models, in fact, the subtitle is “Introductory statistics using generalized linear models”, so if you don’t like generalized models, you won’t like this book.

I liked this book a lot. Because this is a random blog, here is day one of my random notes

A generalized linear model has three components:

The random component is the probability distribution assumed to underlie the response variable. (y)
The systematic component is the fixed structure of the explanatory variables, usually linear. (x1, x2 … xn)
The link function maps the systematic component on to the random component.

The systematic component takes the form

η = α + ß1×1 + ß2×2 + … ßnxn

They use η to designate the predicted variable instead of y-hat. I know you were dying to know that.

Obviously, since that IS a multiple regression equation (which could also be used for ANOVA), when you have linear regression, the link function is actually identity. With logistic regression, it is the logit function, which maps the log odds of the random component on to the systematic one.

The reason I think this is such a good book for students taking a multivariate statistics course is that it relates to what they should know. They certainly should be familiar with multiple regression and logistic regression, and understand that the log of the odds is used in the latter.

The book also discusses the log link used in loglinear analyses, which I don’t necessarily assume every student will have used. I don’t say that as a criticism, merely an observation.

statistics

What I’m Learning at NCES

ByAnnMaria De Mars June 28, 2011June 28, 2011

I’m currently at a seminar in Washington, D.C., sponsored by the National Center for Education Statistics. I’ve seen the notices for these a lot of times over the years, and always thought, “Well, that looks interesting” but never applied to go, primarily because although your expenses are paid, there is no stipend, so that is…

Dr. De Mars General Life Ramblings | statistics

The Myth of Equivalent Groups

ByAnnMaria De Mars December 1, 2012December 1, 2012

In fantasy land and fairy tales, there is this thing called equivalent groups. People are randomly assigned to a control group and a treatment group. Everyone in the treatment group receives the same treatment, for example, being sprinkled with exactly three teaspoons of fairy dust, and everyone in the control group does not….

statistics

Cluster Analysis: Finding Groups in Data

ByAnnMaria De Mars March 16, 2010March 16, 2010

Cluster analysis is one of those techniques I don’t get to use very often. About once every couple of years someone will be doing a study of types of companies, patients or clients and have a need for a cluster analysis. The best description I read of cluster analysis came from a book many years…

statistics

Simple graphs, not so simple answers

ByAnnMaria De Mars February 12, 2013February 12, 2013

The truth is, what I wanted to be talking about today was either data mining, text mining or mixed models. Those are three things I want to be doing more and would be doing more except that we have a Kickstarter campaign going on to fund the next six levels of our game that teaches…

statistics

The Facts of Factor Patterns

ByAnnMaria De Mars July 15, 2013

About a week ago, I went through pointing and clicking your way to a factor analysis. At the time, I suggested rotating the factors. Now we’re going to interpret the rotated factor pattern. Let me recap, briefly. Agresti and Finlay (p.532) put it way better than me when they said: Factor analysis is a multivariate…

Software | statistics | Technology

SAS ENTERPRISE MINER NOT WORKING? HERE’S WHY (maybe)

ByAnnMaria De Mars June 2, 2010June 2, 2010

If I had time, which I don’t, I would start a series of how-to articles for statistical software and copy the Car Talk scale they use as a guide for whether or not you should attempt a job yourself, from a. There are two kinds of screwdrivers ? to e. I have built a working…

2 Comments

yop says:

October 11, 2014 at 5:22 am

eta is the linear predictor but not on the scale of the outcome variable. So y_hat is inv_link(eta), not eta.
AnnMaria says:

October 11, 2014 at 2:31 pm

It depends. If you are thinking in terms of multiple regression, which is where most students begin the course, then they are used to seeing that equation = y_hat because the link function is identity, and that equation + error = the actual y.

You’re right, though, that the whole point of generalized models is to generalize beyond that.

As you’ve probably guessed, sometimes I write this blog as sort of thinking out loud while working on lecture notes for an upcoming class. I’m assuming that many students will be used to seeing the same equation in a different context.

Your point helps clarify it, though. Thank you.

Similar Posts

2 Comments

Leave a Reply