Are we living longer – or not?

ByAnnMaria De Mars December 13, 2011December 13, 2011

It is not every day that my refrigerator provides insight into a statistical problem. My daughter gave me this magnet.

BIRTHDAYS ARE GOOD FOR YOU. STATISTICS SHOW THAT PEOPLE WHO HAVE THE MOST BIRTHDAYS LIVE THE LONGEST.

which led to my thoughts on life expectancy using open data. Kaiser Permanente collected data on two cohorts of patients, those who were 65 years old or older in 1971 and in 1980. After having published some research supported by the National Institute on Aging, on topics of interest to themselves and their funding agency, like cardiovascular disease, the investigators made their data available through ICPSR where I downloaded it.

I had read elsewhere that the life expectancy had increased even within that decade. If that is true, I reasoned, looking at the survival curves for people from the 1971 cohort (1) and the 1980 cohort (2) would show some differences.

When I look at the survival curves by strata

(

PROC LIFETEST DATA = saslib.death ;

STRATA cohort ;

TIME yrslived *dthflag(0) ;

)

I get the survival curves above and it is pretty clear they are the same. If anything, it looks like Cohort 2, those born later, actually had a slightly higher mortality near the end of the study. For those of you who feel uncomfortable just eyeballing the curves, even when they are as close to identical as this, the Log-Rank test for equality of strata = 1.27 (p > .25).

And yet, on the other hand, when I did a t-test by age, I found that the 1980 cohort did live significantly longer, with those who turned 65 in 1971 having a mean age at death of 84.7 while those who turned 65 in 1980 had a mean age at death of 85.3 (p = .01 ) which leads to the conclusion that people from the 1980 cohort did live longer.

What’s going on here? This is the point where the getting to know your data part that I am always harping on comes into play. Note that Kaiser-Permanente said that they collected data on people who were 65 or older in 1971 or 1980, not who turned 65.

In fact, the two samples were not the exact same age. The mean age of the 1971 sample was 75.7 and of the 1980 sample 76.1 . So, of that .6 year difference in lifespan, .4 of it existed before the study even started.

What difference would that make? Well, let’s go back up to my refrigerator magnet. What does the fact that someone has lived to 65 tell you? Most unequivocally that they did not die of anything at an age earlier than 65. They weren’t killed in the Vietnam War when they were 19 years old, in a car crash with a drunk driver when there were 33, from colon cancer at 56. Because 100% of the population of people who live to 65 have escaped these hazards for the first 65 years of life, they are NOT representative of the population in terms of life expectancy. This is why when you read articles they have statements like, “For an American male who has lived to age 65, life expectancy is ….”

The qualifying phrase is necessary because those who have had more birthdays already are expected to live longer than the general population.

So, I pulled out those who were 65 when the study started and looked at the survival curve

SURVIVAL CURVE FOR PATIENTS AGED 65

COHORT 1 (1971) AND COHORT 2 (1980)

A t-test of years lived for those in Cohort 1 versus Cohort 2 using only the 290 subjects who were 65 years old at the start of the study produced a very non-significant t-value of .56 (p > .50) .

T-tests for subjects at age 75 and age 85 produced similar results. So, based on these data, the answer to the question of at least whether patients of Kaiser Permanente have increased in life expectancy over the 1970s is, “No”. This isn’t a comment on Kaiser Permanente one way or the other, merely an observation that it is unlikely that their patients are completely representative of the population.
Just an aside, a million points to people who put their data on the web and open to all comers. This shows two traits I admire. The first is generosity, allowing someone else to benefit from your efforts in collecting the data, with no expectation of return. The second is courage. It takes a good amount of courage to publish your results and then make your data available so anyone who wants can re-analyze the data and perhaps come up with a competing conclusion. So, props to you.

P.S. You can buy the hand soaps on etsy. I have no affiliation with them, I just thought they were funny.

statistics

Statistics save the world

ByAnnMaria De Mars October 14, 2016

I will be the first to admit that I’m not the warm fuzzy type. Maybe you’re like me, you’d like to do good for your community but you just can’t see yourself as a physician. Maybe your bedside manner is to snap at someone to quit being a whiner. Or maybe you really are a…

Software | statistics | Technology

Bonus SAS Tips Before SAS Global Forum

ByAnnMaria De Mars March 26, 2015March 27, 2015

I’m giving a talk on Preparing Students for the Real World of Data at SAS Global Forum next month. You’d think 50 minutes would be long enough for me to talk, but that just goes to show you don’t know me as well as you think you do. One point made in the template for…

statistics

Why Effective Treatment Can Make a Disease MORE Common

ByAnnMaria De Mars January 30, 2015

Listen my dears and you will learn the difference between incidence and prevalence and why effective treatment for a disease may mean that more people have it. Prevalence is the number of people in the population who have a disease. It is computed as Number of people with disease X 1,000 Number in population…

statistics

Significance & Mauchly’s W: I don’t think that word means what you think it means

ByAnnMaria De Mars April 13, 2011April 13, 2011

One of my favorite movie lines ever comes from The Princess Bride, “You keep using that word. I do not think it means what you think it means.” Sometimes I want to say that to people who want me to give an explanation for every result that is “significant”. Perhaps you would like to test…

Software | statistics

SPSS – Four Cool Things

ByAnnMaria De Mars August 6, 2008

If you haven’t checked out the site 43thing.coms website, you really should. You can post up to 43 things that you want to do and then add updates as you make progress on your things. As no good idea should ever go uncopied, I decided to write a couple of posts on cool things about…

Software | statistics | Technology

Text Miner = Coolness

ByAnnMaria De Mars May 2, 2012May 2, 2012

In the past, when I had to do any type of parsing of text, I wrote my own code with a zillion SUBSTR functions and IF statements and it did the job but it was *so-o-o ugly and painful that I never even considered including text mining in any courses I taught. I looked into…

3 Comments

Gabe says:

December 14, 2011 at 10:51 am

Why not just treat it as left truncated data? The subjects are truncated such that we are observing them only if their age X is greater than the truncation time Y, with Y being the age upon entry into the study.
AnnMaria says:

December 15, 2011 at 1:02 pm

They DID include people with age >65. The problem is that in the second cohort the patients were significantly MORE >65 than in the first cohort.

Do you mean use age as the TIME variable instead of years survived after the study began? That would not work because of the correlation between age at first measurement and life expectancy I mentioned above.
Gabe says:

December 15, 2011 at 9:30 pm

I think there might’ve been a misunderstanding with what I wrote. I wasn’t claiming that they didn’t include people older than 65.

I was proposing an alternate way to handle the fact that the mean age between the 2 cohorts differed.

Specifically, treating the data as left truncated, IN ADDITION to already treating it as right censored. With left truncated data, subjects with a lifetime less than some threshold (in this case, age 65) are not observed. This seems like a fairly typical “delayed entry” study, where subjects are not observed at all until they have reached a certain age. The left truncation time varies by subject in this dataset, and can be 65 or can be older.

FWIW, the Klein and Moeschberger text on Survival (Chapter 3) covers this in pretty good detail.

Similar Posts

3 Comments

Leave a Reply