{"id":5059,"date":"2016-04-22T18:48:08","date_gmt":"2016-04-22T23:48:08","guid":{"rendered":"http:\/\/www.thejuliagroup.com\/blog\/?p=5059"},"modified":"2016-04-22T18:48:08","modified_gmt":"2016-04-22T23:48:08","slug":"visual-analytics-are-everywhere-sas-global-forum-continued","status":"publish","type":"post","link":"https:\/\/www.thejuliagroup.com\/blog\/visual-analytics-are-everywhere-sas-global-forum-continued\/","title":{"rendered":"Visual Analytics are EVERYWHERE: SAS Global Forum Continued"},"content":{"rendered":"<p>The nice thing about going to SAS Global Forum is that it&#8217;s the gift that keeps on giving. Long after I have gone home, there are still points to ponder.<\/p>\n<p class=\"p1\"><span class=\"s1\">Visual analytics is big and not just in the sense of there is a product out called that which I have never used but that every presentation, no matter how &#8216;tech-y&#8217; now makes very effective use of graphics. If I was the type of person to say I told you so,<a href=\"http:\/\/www.thejuliagroup.com\/blog\/?p=433\"> I would mention that I predicted this six years ago after I went to SAS Global Forum in 2010<\/a>.<\/span><\/p>\n<p class=\"p1\"><span class=\"s1\"><a href=\"http:\/\/www.thejuliagroup.com\/blog\/?p=5056\">In my last post, I mentioned\u00a0the propensity score graphic with mustaches. <\/a><\/span><\/p>\n<p class=\"p1\"><span class=\"s1\">Richard Culter&#8217;s presentation on PROC HPSPLIT, which was really excellent,\u00a0made extensive use of graphics to illustrate fairly complex models.<\/span><\/p>\n<p class=\"p1\"><a href=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/cutler_class.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-5061\" src=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/cutler_class.png\" alt=\"Nodes in subtree\" width=\"450\" height=\"344\" srcset=\"https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/cutler_class.png 450w, https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/cutler_class-300x229.png 300w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/a><\/p>\n<p class=\"p1\">You can create classification and regression trees (the model you can&#8217;t see in this tiny graphic on the left) and you can drill down into sub-trees for further analysis.<\/p>\n<p class=\"p1\">Sometimes your classification tree is very easily interpretable. For example, in this case here from the same presentation, each split represents a different type of vegetation\/ land surface &#8211; water, \u00a0two different species of tree, etc.<\/p>\n<p class=\"p1\"><a href=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/split.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-5062\" src=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/split.png\" alt=\"Classification tree\" width=\"450\" height=\"1123\" srcset=\"https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/split.png 450w, https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/split-120x300.png 120w, https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/split-410x1024.png 410w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/a><\/p>\n<p class=\"p1\">Speaking of classification, regression and PROC HPSPLIT &#8230;.<\/p>\n<p class=\"p1\"><strong>If you didn&#8217;t know, now you know<\/strong><\/p>\n<p class=\"p1\"><span class=\"s1\">PROC HPSPLIT is a h<\/span><span class=\"s1\">igh performance procedure for fitting and classification now available in SAS\/STAT which i<\/span><span class=\"s1\">s useful for data sets where relationships are non-linear. It p<\/span><span class=\"s1\">roduces classification and regression trees, includes options for pruning trees and a whole lot more. It is now available on a single computer, not limited to high performance computing clusters. So, yay!<\/span><\/p>\n<p class=\"p1\"><span class=\"s1\">A regression tree is what you get when your dependent variable is continuous, and a classification tree when it is categorical, as\u00a0in the vegetation example above.<\/span><\/p>\n<p class=\"p1\">On a semi-related note, graphics can even be used to show when a data set is not suited to a linear model\u00a0as in the example below, also from Cutler&#8217;s presentation. You can see that all of the 1&#8217;s are in two quadrants and all of the 0&#8217;s in two other quadrants. Yes, you COULD use a regression line to fit this but that is not the best fit of the data.<\/p>\n<p class=\"p1\">Also, on a related topic that visualizing data, like all of statistics, really, is a process of iterations, I think this would be more obvious if the quadrants were color coded.<\/p>\n<p class=\"p1\">&#8216;<br \/>\n<a href=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/classify.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-5064\" src=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/classify.png\" alt=\"classify\" width=\"450\" height=\"585\" srcset=\"https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/classify.png 450w, https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/04\/classify-231x300.png 231w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/a><\/p>\n<p class=\"p1\">I have a lot more to say on this but I am in North Dakota speaking at the ND STEM conference this weekend and a \u00a0kind soul gave me tickets to the hockey game in the president&#8217;s box, so, peace, I&#8217;m out.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The nice thing about going to SAS Global Forum is that it&#8217;s the gift that keeps on giving. Long after I have gone home, there are still points to ponder. Visual analytics is big and not just in the sense of there is a product out called that which I have never used but that&#8230;<\/p>\n","protected":false},"author":5,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false,"_kad_post_classname":"","_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[9,11,8],"tags":[],"class_list":["post-5059","post","type-post","status-publish","format-standard","hentry","category-software","category-statistics","category-technology"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/posts\/5059","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/comments?post=5059"}],"version-history":[{"count":1,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/posts\/5059\/revisions"}],"predecessor-version":[{"id":5065,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/posts\/5059\/revisions\/5065"}],"wp:attachment":[{"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/media?parent=5059"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/categories?post=5059"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/tags?post=5059"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}