{"id":4683,"date":"2019-06-03T18:39:17","date_gmt":"2019-06-03T17:39:17","guid":{"rendered":"https:\/\/www.blopig.com\/blog\/?p=4683"},"modified":"2019-06-03T18:43:54","modified_gmt":"2019-06-03T17:43:54","slug":"oxford-maths-festival-19","status":"publish","type":"post","link":"https:\/\/www.blopig.com\/blog\/2019\/06\/oxford-maths-festival-19\/","title":{"rendered":"Oxford Maths Festival \u201819"},"content":{"rendered":"\n<p>The Oxford Maths Festival returned this year and it was tons of fun, at least for this volunteer! I failed to take pictures, but a few opiglets were involved: Flo and company took their <a href=\"https:\/\/www.blopig.com\/blog\/2019\/05\/dimensions-the-mathematics-of-symmetry-and-space\/\">VR work<\/a> for the Ashmolean Dimensions exhibit and demonstrated it at Templars Square, and Conor did a spectacular job pretending to be a police constable for the maths escape room.<\/p>\n\n\n\n<p>Last year Mark <a href=\"https:\/\/www.blopig.com\/blog\/2018\/06\/opig-at-the-oxford-maths-festival\/\">blogged<\/a> about how we demonstrated the German Tank Problem at the festival. I thought this time round I\u2019d share another of the Mathematical Mayhem activities: a game illustrating biased sampling.<\/p>\n\n\n\n<!--more-->\n\n\n\n<h2 class=\"wp-block-heading\"><strong>The problem<\/strong><\/h2>\n\n\n\n<p>Complete, error-free data is the dream, but in reality we do sampling <em>all the time<\/em>. More often than not we happily convince ourselves that the samples we work with are representative of the whole unobserved population, but that&#8217;s not always the case.<\/p>\n\n\n\n<p>Suppose we want to know something, e.g. how much exercise the average opiglet does. Rather than stalking each and every group member (ill-advised, and also kind of illegal), we can just ask a bunch of them about their exercise habits and base our estimate on the responses. There is a slight issue, however. If you\u2019re like one of the sportier members of our group, you might be <a href=\"https:\/\/www.blopig.com\/blog\/2017\/07\/a-day-in-the-life-of-a-dphil-student-that-also-rows-for-oxford\/\">quite forthcoming<\/a> about your daily routine. And if you\u2019re a lazy so-and-so like myself, your responses might be a little more vague, or altogether absent. People who are keen to talk about exercise also tend to do more exercise, thereby making our sample biased. <\/p>\n\n\n\n<p>If we want to estimate how much exercise the average opiglet does, we should probably go a little under our observed sample average. Or <em>a lot<\/em> under if our data is collected exclusively from Blopig posts by university rowers.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>The activity<\/strong><\/h2>\n\n\n\n<p>Sample bias can arise due to a range of reasons, including self-selection (i.e. who answers our questions) and inspection (i.e. who we choose to question). It\u2019s inspection bias that we looked at during the Maths Festival. We presented each participant with a non-transparent cloth pouch with twenty-five marbles in it. There were two types of marbles in the bag: the majority were of your standard small glass variety, but a few were significantly larger and heavier. The aim was to guess their total weight.<\/p>\n\n\n\n<p>Participants were allowed to draw five marbles out of the bag and weigh them. They could take a number of such samples before making their guess. Since at any one point we weigh a fifth of the marbles, the obvious thing to do is to estimate the total weight as five times the observed sample weight (or five times the average of a few observations). If the samples were fair, that would certainly be a good tactic.<\/p>\n\n\n\n<p>However, when people tried this they often came up with numbers that were far too high to be even vaguely plausible&#8212;sometimes as much as three times the actual weight of the bag. Did they get their multiplication wrong? Were the scales broken? What inevitably happened&#8212;even to me, and I knew the ratio of large to small marbles in the bag&#8212;was that people took out more than their fair share of large marbles, resulting in uncharacteristically heavy samples. <\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>The take-home message<\/strong><\/h2>\n\n\n\n<p>So how do we make more accurate guesses? There\u2019s two pieces of information we need: firstly, the weight of each type of marble and secondly, the ratio of small to large marbles in the bag. Resampling can help with both of these. <\/p>\n\n\n\n<p>It only takes two different samples to figure our the individual marble weights, which most of our participants immediately set out to do. Guessing the ratio is a little trickier, and relates to population size estimation.<\/p>\n\n\n\n<p>Suppose that you draw three or four samples, and you pay attention to the large marbles only. You might make a note of what their colours are, or you might cheat a little and mark them with a sharpie the first time you draw them. If you draw the same two or three large marbles again and again, then there probably aren\u2019t all that many large marbles in the bag. However, if each of your samples contains totally different large marbles, you\u2019d guess their overall number is higher. This line of thinking is formalised by a method called \u201cmark and recapture\u201d. It&#8217;s employed by ecologists, who tag animals in order to estimate and track population sizes. If you want to read more about it, you could go to <a href=\"https:\/\/en.wikipedia.org\/wiki\/Mark_and_recapture\">Wikipedia<\/a>, or you could check out the much more amusing <a href=\"https:\/\/www.goodreads.com\/book\/show\/704170.Do_You_Feel_Lucky_\">Do you feel lucky?<\/a> explanation. Once you\u2019ve guessed how many big marbles there are in the bag, estimating the total weight is easy. You just need a calculator or a patient, numerically literate friend.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Final comments<\/strong><\/h2>\n\n\n\n<p>In real life bags are often of unknown size, marbles may have totally different weights to each other, and the source of bias might not be so obvious as in the example above. This is an incredibly common, yet at the same time often unintuitive, problem we face in data analysis. I liked the marble exercise because it showcased biased sampling in a clear, tangible way, and let people come up with their own workarounds.<\/p>\n\n\n\n<p>The Maths Festival, as I mentioned at the start, involved a  wide range of other activities. If you\u2019re curious, you can check out the <a href=\"https:\/\/mathsfest.web.ox.ac.uk\/home\">website<\/a>, or look them up on your favourite social media platform (provided you&#8217;re not too hipster and your favourite is either <a href=\"https:\/\/www.facebook.com\/OxfordMathsFestival\/\">Facebook<\/a> or <a href=\"https:\/\/twitter.com\/OxMathsFest\">Twitter<\/a>). I look forward to seeing what the festival looks like next year!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Oxford Maths Festival returned this year and it was tons of fun, at least for this volunteer! I failed to take pictures, but a few opiglets were involved: Flo and company took their VR work for the Ashmolean Dimensions exhibit and demonstrated it at Templars Square, and Conor did a spectacular job pretending to [&hellip;]<\/p>\n","protected":false},"author":40,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"nf_dc_page":"","wikipediapreview_detectlinks":true,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"ngg_post_thumbnail":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[190],"tags":[],"ppma_author":[529],"class_list":["post-4683","post","type-post","status-publish","format-standard","hentry","category-public-outreach"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"authors":[{"term_id":529,"user_id":40,"is_guest":0,"slug":"lyuba","display_name":"Lyuba","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/e10fff614041616d2611ef048154e92d6d6a208f36f8634b41a9abcec271c8c1?s=96&d=mm&r=g","0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/posts\/4683","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/users\/40"}],"replies":[{"embeddable":true,"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/comments?post=4683"}],"version-history":[{"count":5,"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/posts\/4683\/revisions"}],"predecessor-version":[{"id":4701,"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/posts\/4683\/revisions\/4701"}],"wp:attachment":[{"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/media?parent=4683"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/categories?post=4683"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/tags?post=4683"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.blopig.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=4683"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}