{"id":5135,"date":"2016-06-26T14:32:53","date_gmt":"2016-06-26T19:32:53","guid":{"rendered":"http:\/\/www.thejuliagroup.com\/blog\/?p=5135"},"modified":"2016-06-26T14:37:22","modified_gmt":"2016-06-26T19:37:22","slug":"data-analysis-by-example-thats-funny","status":"publish","type":"post","link":"https:\/\/www.thejuliagroup.com\/blog\/data-analysis-by-example-thats-funny\/","title":{"rendered":"Data Analysis by Example: That&#8217;s funny &#8230;"},"content":{"rendered":"<p><a href=\"http:\/\/www.thejuliagroup.com\/blog\/?p=5120\">In the last post, I used SAS Enterprise Guide<\/a> to filter out a couple of &#8216;bad&#8217; records that came from test data, then I created a summary table of the number of questions answered and the percentage correct. Then, I calculated the mean percentage correct for the\u00a0\u00a0around 84%. That seemed a bit high to me.<\/p>\n<p>Having (temporarily) answered the first question regarding the number of individual subjects and the average percent of correct answers from the 424 subjects, I turned to the next question:<\/p>\n<p><strong><em>Is there a correlation between percentage correct and the number of questions attempted? That is, do students who are getting the answers correct persist more often?<\/em><\/strong><\/p>\n<p>Since I had both variables, N and the mean correct (which, since this was score 0= correct, 1= incorrect gave me the percentage correct) from the summary tables I had created in the previous step, it was a simple procedure to compute the correlation.<\/p>\n<p>I just went to the TASKS menu, selected MULTIVARIATE and then CORRELATIONS<\/p>\n<p><a href=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/correlate1.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-5137\" src=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/correlate1.jpg\" alt=\"Selection menu for correlations\" width=\"450\" height=\"227\" srcset=\"https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/correlate1.jpg 450w, https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/correlate1-300x151.jpg 300w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/a><\/p>\n<p>Under ANALYSIS VARIABLES correct_ N for the &#8216;correct&#8217; variable, which is a variable that holds whether the\u00a0 student answered correctly, 0(= no) or 1(=yes).\u00a0 Under CORRELATE WITH I dragged correct_mean, which has the percentage each student answered correctly.<\/p>\n<p><a href=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/correlate2.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-5138\" src=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/correlate2.jpg\" alt=\"Variables selected for correlation\" width=\"450\" height=\"422\" srcset=\"https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/correlate2.jpg 450w, https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/correlate2-300x281.jpg 300w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/a>Since it is just a bivariate correlation and the correlation of X with Y = the correlation of Y with X , it would make absolutely no difference if I switched the spots where I dragged the two variables.<\/p>\n<p>I click run and <a href=\"http:\/\/www.thejuliagroup.com\/documents\/correlations1.html\">I get a somewhat unexpected result, you can see here, with a correlation of -.07<\/a>.<\/p>\n<p>I also note that the minimum number of answers attempted is 1. Now, I have done (and published) analyses of these data elsewhere, as this is an on-going project.<\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p>Other analyses from this same project can be found in:<\/p>\n<p><a href=\"http:\/\/www.lexjansen.com\/wuss\/2013\/133_Paper.pdf\">Telling Stories with Your Data<\/a> and<\/p>\n<p><a href=\"http:\/\/wuss.org\/Proceedings15\/58_Final_Paper_PDF.pdf\">Yes, PROC FREQ Does That!<\/a><\/p>\n<hr \/>\n<p>Because of these analyses of &#8216;Fidelity of Implementation&#8217;, that is the degree to which a project is implemented as planned, I am pretty sure that these data include a large proportion of students who only had the opportunity to play the game once.<\/p>\n<p>So &#8230; I decided to run a scatter plot and check my suspicion. This is pretty simple. I just go to the TASKS menu and select GRAPH then SCATTER PLOT.<\/p>\n<p>I selected 2-D Scatter Plot<\/p>\n<p><a href=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/scatterplot1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-5139\" src=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/scatterplot1.png\" alt=\"2D scatter plot selected\" width=\"279\" height=\"157\" \/><\/a><\/p>\n<p>Then, I clicked on the DATA tab, dragged correct_Mean under Horizontal and Correct_N and vertical, then clicked RUN.<\/p>\n<p><a href=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/scatter2.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-5140\" src=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/scatter2.png\" alt=\"Data window for scatter plot\" width=\"450\" height=\"244\" srcset=\"https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/scatter2.png 601w, https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/scatter2-300x163.png 300w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/a><\/p>\n<p>This produced the graph below.<\/p>\n<p><a href=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/scatter3-e1466969028457.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-5141\" src=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/06\/scatter3-e1466969028457.png\" alt=\"scatter3\" width=\"450\" height=\"357\" \/><\/a>Now, this graph isn&#8217;t fancy but it serves its purpose, which is to show me that there IS in fact a correlation of mean correct and the number of problems attempted. Look at that graph a minute and tell me that you don&#8217;t see a linear trend &#8211; but it is pulled off by the line of 1.0 at the far end.<\/p>\n<p>This did NOT fit my preconceived notion, though, that the lack of correlation was due to the players who played once, and so there would be a bunch of people who had answered 1 or 2 questions and got 100% of them correct. Actually, those 100-percenters were all over the distribution in terms of number of problems attempted.<\/p>\n<p>This reminds me of a great quote by Isaac Asimov,<\/p>\n<blockquote><p>The most exciting phrase to hear in science, the one that heralds new discoveries, is not &#8216;Eureka!&#8217; (I found it!) but &#8216;That&#8217;s funny &#8230;&#8217;<\/p><\/blockquote>\n<p>Well, we shall see, as our analysis continues &#8230;<\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p><a href=\"http:\/\/www.7generationgames.com\/buy\/\">Want to see these data at the source?<\/a><\/p>\n<p><a href=\"http:\/\/www.7generationgames.com\/buy\/\">Check out our game, playable on Mac or Windows. Download Spirit Lake or Fish Lake\u00a0 to play, or for Forgotten Trail, just click on the link provided, no download required.<\/a><\/p>\n<p><a href=\"http:\/\/www.7generationgames.com\/buy\/\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-5027\" src=\"http:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/03\/mom_background.jpg\" alt=\"Mom and kid\" width=\"450\" height=\"300\" srcset=\"https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/03\/mom_background.jpg 450w, https:\/\/www.thejuliagroup.com\/blog\/wp-content\/uploads\/2016\/03\/mom_background-300x200.jpg 300w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/a><\/p>\n<p>You can also follow the link above to donate a copy of the game to a school or give as a gift.<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In the last post, I used SAS Enterprise Guide to filter out a couple of &#8216;bad&#8217; records that came from test data, then I created a summary table of the number of questions answered and the percentage correct. Then, I calculated the mean percentage correct for the\u00a0\u00a0around 84%. That seemed a bit high to me&#8230;.<\/p>\n","protected":false},"author":5,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false,"_kad_post_classname":"","_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[9,11,8],"tags":[],"class_list":["post-5135","post","type-post","status-publish","format-standard","hentry","category-software","category-statistics","category-technology"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/posts\/5135","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/comments?post=5135"}],"version-history":[{"count":4,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/posts\/5135\/revisions"}],"predecessor-version":[{"id":5144,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/posts\/5135\/revisions\/5144"}],"wp:attachment":[{"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/media?parent=5135"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/categories?post=5135"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.thejuliagroup.com\/blog\/wp-json\/wp\/v2\/tags?post=5135"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}