| |

Fixing character data without beatings: SAS Enterprise Guide

At the JMP seminar on Monday, when Dick De Veaux said that 65-70% of time in all research projects is spent on data cleaning, everyone in the audience groaned in agreement. One of the biggest problems I run into is recoding those simple textboxes. For example, we often want to look at data for one…

|

When acceptance is really rejection: Death by Green Pants

The model is non-significant, therefore my theory is supported. Huh? Just when you thought it was safe to get back into statistics… It took you two years of graduate school but now you have it down. P-value low = good, relationship detected, publication, tenure, Abercrombie & Fitch models at your feet. P-value = high, no…

|

Controlling for Damn Near Everything: Propensity Score Matching

Lately I have been on a roll looking at relatively less common statistical techniques, proportional hazards, survival analysis, etc. In keeping with that, I have been taking a look at propensity score matching, fondly known as PSM by, – well, by no one actually. The problem to be solved …. Think about some of these…

| | |

The Sea Monkey Effect prevents robot uprising

Since I have written about odds ratios and logs lately, I was going to write about the natural log of the odds ratio, however, random events have caused me to do otherwise. I read an interesting blog by Adam Jackson lately, in which he is concerned that robots will take over the world. At the…

|

Logarithms

Logistic regression is based on logarithms. Ordinary Least Squares regression and analysis of variance uses the actual values as the dependent and independent variables in an equation. Logistic regression does not. What is a log, anyway? Let’s start with the very basics. First we learned to add: 5+5+5+5 = 20 After about eight years of…

|

Baby Steps to Logistic regression

Going from the phi coefficient to odds-ratios. Remember the numerator for the phi coefficient was well, the odds ratio is the same two numbers DIVIDED rather than subtracted. You might think it is four numbers, but really it is not. The first number is the product of the diagonal cells (see below). The second number…

| | | | | |

Phi coefficients, odds ratios and the F-word

Yes, I am the F-word – a feminist. I was at a faculty meeting this weekend and one of the presenters began by saying, pointing to a colleague in the audience, “I am sure Dr. Y knows more about this than me.” Several times in her presentation on analysis of assessment data she would pause…