Visual Analytics are EVERYWHERE: SAS Global Forum Continued

ByAnnMaria De Mars April 22, 2016

The nice thing about going to SAS Global Forum is that it’s the gift that keeps on giving. Long after I have gone home, there are still points to ponder.

Visual analytics is big and not just in the sense of there is a product out called that which I have never used but that every presentation, no matter how ‘tech-y’ now makes very effective use of graphics. If I was the type of person to say I told you so, I would mention that I predicted this six years ago after I went to SAS Global Forum in 2010.

In my last post, I mentioned the propensity score graphic with mustaches.

Richard Culter’s presentation on PROC HPSPLIT, which was really excellent, made extensive use of graphics to illustrate fairly complex models.

You can create classification and regression trees (the model you can’t see in this tiny graphic on the left) and you can drill down into sub-trees for further analysis.

Sometimes your classification tree is very easily interpretable. For example, in this case here from the same presentation, each split represents a different type of vegetation/ land surface – water, two different species of tree, etc.

Speaking of classification, regression and PROC HPSPLIT ….

If you didn’t know, now you know

PROC HPSPLIT is a high performance procedure for fitting and classification now available in SAS/STAT which is useful for data sets where relationships are non-linear. It produces classification and regression trees, includes options for pruning trees and a whole lot more. It is now available on a single computer, not limited to high performance computing clusters. So, yay!

A regression tree is what you get when your dependent variable is continuous, and a classification tree when it is categorical, as in the vegetation example above.

On a semi-related note, graphics can even be used to show when a data set is not suited to a linear model as in the example below, also from Cutler’s presentation. You can see that all of the 1’s are in two quadrants and all of the 0’s in two other quadrants. Yes, you COULD use a regression line to fit this but that is not the best fit of the data.

Also, on a related topic that visualizing data, like all of statistics, really, is a process of iterations, I think this would be more obvious if the quadrants were color coded.

‘

I have a lot more to say on this but I am in North Dakota speaking at the ND STEM conference this weekend and a kind soul gave me tickets to the hockey game in the president’s box, so, peace, I’m out.

statistics

Minimum Sample Size in Factor Analysis & Other Small Sample Thoughts
ByAnnMaria De Mars May 16, 2016September 15, 2016

Someone handed me a data set on acculturation that they had collected from a small sample size of 25 people. There was a good reason that the sample was small – think African-American presidents of companies over $100 million in sales or Latina neurosurgeons. Anyway, small sample, can’t reasonably expect to get 500 or 1,000…

Read More Minimum Sample Size in Factor Analysis & Other Small Sample Thoughts
statistics

Every Picture Tells a Story and Why That Matters
ByAnnMaria De Mars April 19, 2010April 19, 2010

Being a professor can build humility. About twenty years ago, I was teaching the third course in the statistics sequence required of all graduate students. The second course had been taught by an adjunct professor, which was FAR less common then than it is now (that’s a whole different post). The first day I started…

Read More Every Picture Tells a Story and Why That Matters
55 things | Dr. De Mars General Life Ramblings | statistics

The point of view of truth: Another thing I’ve Learned in 55 Years
ByAnnMaria De Mars March 27, 2013April 21, 2013

” … we may then define intellect in general as the power of good response from the point of view of truth or fact.” – Thorndike, 1921 Edward Tufte impresses me. His books on visual data show him as possessing in copious amounts that very rare commodity – truly original thoughts . So, when he tweeted…

Read More The point of view of truth: Another thing I’ve Learned in 55 Years
Software | statistics | Technology

Plots of Relative Risk: A picture says 1,000 words
ByAnnMaria De Mars March 20, 2016

I can’t believe I haven’t written about this before – I’m going to tell you an easy (yes, easy) way to find and communicate to a non-technical audience standardized mortality rates and relative risk by strata. It all starts with PROC STDRATE . No, I take that back. It starts with this post I wrote…

Read More Plots of Relative Risk: A picture says 1,000 words
Software

Why Giving Away SAS Might be a Good Idea
ByAnnMaria De Mars March 9, 2011March 9, 2011

Yesterday, Matt Keranen (a.k.a. @HybridDBA) made the comment about SAS giving free software to universities as response to R << “Would be nice if they did the same for developers” The more I thought about that, the more convinced I became that he is right. There are a few reasons. First, let’s look at the…

Read More Why Giving Away SAS Might be a Good Idea
computer games | Software

I take back every bad thing I ever said about hackathons
ByAnnMaria De Mars March 10, 2015

Some people may have said that hackathons are a stupid ass idea where a bunch of people who have can’t afford to buy their own pizza spend 48 hours with a bunch of strangers and no showers. Okay, well, maybe that was me. I take it all back. We kicked off our hackathon at noon…

Read More I take back every bad thing I ever said about hackathons

One Comment

Pingback: What I learned from my favorite paper at SAS Global Forum : AnnMaria's Blog

Similar Posts

One Comment

Leave a Reply