{"id":1227,"date":"2024-11-06T09:22:00","date_gmt":"2024-11-06T09:22:00","guid":{"rendered":"https:\/\/research.reading.ac.uk\/palaeoclimate\/?p=1227"},"modified":"2024-11-06T09:27:25","modified_gmt":"2024-11-06T09:27:25","slug":"digging-into-variable-selection-methods-team-training-session","status":"publish","type":"post","link":"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/","title":{"rendered":"Digging into variable selection methods: Team training session"},"content":{"rendered":"<p>One of the benefits of belonging to a large research group such as SPECIAL, is the opportunity to draw upon, and learn from, the skills that other members of the research group possess. At the <a href=\"https:\/\/research.reading.ac.uk\/palaeoclimate\/projects\/\">LEMONTREE-Leverhulme Project\u2019s<\/a> most recent Fire-Vegetation Interactions team meeting the group heard from SPECIAL Group PhD student <a href=\"https:\/\/research.reading.ac.uk\/palaeoclimate\/meet-the-team\/\">Theo Keeping<\/a> about the adapted GLM method he built during his PhD.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1228\" src=\"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-content\/uploads\/sites\/78\/2024\/11\/Theo_Paper_ContiguousUS.png\" alt=\"\" width=\"752\" height=\"382\" srcset=\"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-content\/uploads\/sites\/78\/2024\/11\/Theo_Paper_ContiguousUS.png 752w, https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-content\/uploads\/sites\/78\/2024\/11\/Theo_Paper_ContiguousUS-300x152.png 300w\" sizes=\"auto, (max-width: 752px) 100vw, 752px\" \/><\/p>\n<p><em>Figure 1: Theo\u2019s recent publication that outlines his variable selection method.<\/em><\/p>\n<p>Variable selection is a crucial step in building robust statistical models, particularly in generalized linear models (GLMs). Theo\u2019s method takes an iterative approach by adding one variable at a time to the model then employing both forward and backward selection strategies.<\/p>\n<p>Major steps of the process:<\/p>\n<ol>\n<li><strong>Define your predictors:<\/strong> it is important that you have a hypothesis behind how these might impact your response variable!<\/li>\n<li><strong>Fit a model to one variable,<\/strong> ensuring that your chosen model distribution makes sense for your data.<\/li>\n<li><strong>Use various model assessment tools<\/strong> such as AIC values to choose the best model.<\/li>\n<li>Assess using more model assessment tools whether <strong>replacing any existing variables<\/strong> with unused variables produces a more parsimonious model.<\/li>\n<li>Once you have your chosen set of predictors \u00e0 <strong>optimise the domains of those variables <\/strong>by clipping them to improve the model fit further.<\/li>\n<li>Finally, consider the need to <strong>minimise GLM smearing<\/strong>. Potentially apply a transformation to address this.<\/li>\n<\/ol>\n<p>This iterative process avoids the need to run all possible model permutations to find the global minimum of the model space, allowing researchers to systematically explore variables without overwhelming computational power or time. It\u2019s a practical solution that balances thorough exploration with efficiency, ensuring high-quality model selection.<\/p>\n<p><strong>Who Can Benefit from This Approach in Our Lab<\/strong><\/p>\n<p>This variable selection method has become especially relevant to two members of our research team, Yicheng Shen and Connor Mackenzie, who are currently building GLM models to study fire patterns. In their work, understanding the relationships between environmental variables, whether that be leaf traits or human predictors, and fire incidence is critical. Learning about the importance of thoughtful statistical choices, including variable selection, has enhanced their ability to approach their modelling more systematically. The iterative nature of the method helps them avoid common pitfalls in model building, such as overfitting.<\/p>\n<p><strong>Key Takeaways for GLM Model Building<\/strong><\/p>\n<p>One of the most important takeaways from this session was the importance of knowing your model space. Having a clear understanding of not only what your predictors are, but the relationship you hypothesise they might have, is crucial to avoid fishing around for a good model fit. The relationships between response and predictor variables will not only define any transformations you make, but the distribution you choose, and the link function associated. Developing a more in-depth understanding of these statistical processes will allow researchers to create models that are rooted in their hypotheses.<\/p>\n<p><strong>Learn More in the Manuscript<\/strong><\/p>\n<p>For those interested in a deeper dive into this method and its applications, all of this information can be found in greater detail in Theo\u2019s paper. A big thank you to Theo, on behalf of the SPECIAL Group, for helping us all to understand this process in greater detail.<\/p>\n<p>Keeping, T., Harrison, S.P., Prentice, I.C. Modelling the daily probability of wildfire occurrence in the contiguous United States. 2024.\u00a0<em>Environmental Research Letters,<\/em> 19: 024036,\u00a0<a href=\"https:\/\/doi.org\/10.1088\/1748-9326\/ad21b0\"> https:\/\/doi.org\/10.1088\/1748-9326\/ad21b0<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>One of the benefits of belonging to a large research group such as SPECIAL, is the opportunity to draw upon, and learn from, the skills that other members of the&#8230;<a class=\"read-more\" href=\"&#104;&#116;&#116;&#112;&#115;&#58;&#47;&#47;&#114;&#101;&#115;&#101;&#97;&#114;&#99;&#104;&#46;&#114;&#101;&#97;&#100;&#105;&#110;&#103;&#46;&#97;&#99;&#46;&#117;&#107;&#47;&#112;&#97;&#108;&#97;&#101;&#111;&#99;&#108;&#105;&#109;&#97;&#116;&#101;&#47;&#100;&#105;&#103;&#103;&#105;&#110;&#103;&#45;&#105;&#110;&#116;&#111;&#45;&#118;&#97;&#114;&#105;&#97;&#98;&#108;&#101;&#45;&#115;&#101;&#108;&#101;&#99;&#116;&#105;&#111;&#110;&#45;&#109;&#101;&#116;&#104;&#111;&#100;&#115;&#45;&#116;&#101;&#97;&#109;&#45;&#116;&#114;&#97;&#105;&#110;&#105;&#110;&#103;&#45;&#115;&#101;&#115;&#115;&#105;&#111;&#110;&#47;\">Read More ><\/a><\/p>\n","protected":false},"author":959,"featured_media":1228,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"__cvm_playback_settings":[],"__cvm_video_id":"","footnotes":""},"categories":[22],"tags":[],"class_list":["post-1227","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-blog"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.8.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Digging into variable selection methods: Team training session - SPECIAL Palaeoclimate<\/title>\n<meta name=\"description\" content=\"A recent training session run by SPECIAL Group PhD student Theo Keeping allowed other members of the group to learn about variable selection methods.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Digging into variable selection methods: Team training session - SPECIAL Palaeoclimate\" \/>\n<meta property=\"og:description\" content=\"A recent training session run by SPECIAL Group PhD student Theo Keeping allowed other members of the group to learn about variable selection methods.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/\" \/>\n<meta property=\"og:site_name\" content=\"SPECIAL Palaeoclimate\" \/>\n<meta property=\"article:published_time\" content=\"2024-11-06T09:22:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-11-06T09:27:25+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-content\/uploads\/sites\/78\/2024\/11\/Theo_Paper_ContiguousUS.png\" \/>\n\t<meta property=\"og:image:width\" content=\"752\" \/>\n\t<meta property=\"og:image:height\" content=\"382\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Sophia Cain\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sophia Cain\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/\",\"url\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/\",\"name\":\"Digging into variable selection methods: Team training session - SPECIAL Palaeoclimate\",\"isPartOf\":{\"@id\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/#website\"},\"datePublished\":\"2024-11-06T09:22:00+00:00\",\"dateModified\":\"2024-11-06T09:27:25+00:00\",\"author\":{\"@id\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/#\/schema\/person\/6ae1715d9ded9f0caa51878ed800cdc9\"},\"description\":\"A recent training session run by SPECIAL Group PhD student Theo Keeping allowed other members of the group to learn about variable selection methods.\",\"breadcrumb\":{\"@id\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Digging into variable selection methods: Team training session\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/#website\",\"url\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/\",\"name\":\"SPECIAL Palaeoclimate\",\"description\":\"Webpage of Sandy&#039;s PalaeoEnvironments and Climate Analysis research group at the University of Reading (UK)\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/#\/schema\/person\/6ae1715d9ded9f0caa51878ed800cdc9\",\"name\":\"Sophia Cain\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/a4b6b71e445f4365e2434d381d6a4e6733a438e4ce1c59fbbd12a0e11cf74672?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/a4b6b71e445f4365e2434d381d6a4e6733a438e4ce1c59fbbd12a0e11cf74672?s=96&d=mm&r=g\",\"caption\":\"Sophia Cain\"},\"url\":\"https:\/\/research.reading.ac.uk\/palaeoclimate\/author\/kw931720\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Digging into variable selection methods: Team training session - SPECIAL Palaeoclimate","description":"A recent training session run by SPECIAL Group PhD student Theo Keeping allowed other members of the group to learn about variable selection methods.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/","og_locale":"en_GB","og_type":"article","og_title":"Digging into variable selection methods: Team training session - SPECIAL Palaeoclimate","og_description":"A recent training session run by SPECIAL Group PhD student Theo Keeping allowed other members of the group to learn about variable selection methods.","og_url":"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/","og_site_name":"SPECIAL Palaeoclimate","article_published_time":"2024-11-06T09:22:00+00:00","article_modified_time":"2024-11-06T09:27:25+00:00","og_image":[{"width":752,"height":382,"url":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-content\/uploads\/sites\/78\/2024\/11\/Theo_Paper_ContiguousUS.png","type":"image\/png"}],"author":"Sophia Cain","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Sophia Cain","Estimated reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/","url":"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/","name":"Digging into variable selection methods: Team training session - SPECIAL Palaeoclimate","isPartOf":{"@id":"https:\/\/research.reading.ac.uk\/palaeoclimate\/#website"},"datePublished":"2024-11-06T09:22:00+00:00","dateModified":"2024-11-06T09:27:25+00:00","author":{"@id":"https:\/\/research.reading.ac.uk\/palaeoclimate\/#\/schema\/person\/6ae1715d9ded9f0caa51878ed800cdc9"},"description":"A recent training session run by SPECIAL Group PhD student Theo Keeping allowed other members of the group to learn about variable selection methods.","breadcrumb":{"@id":"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/research.reading.ac.uk\/palaeoclimate\/digging-into-variable-selection-methods-team-training-session\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/research.reading.ac.uk\/palaeoclimate\/"},{"@type":"ListItem","position":2,"name":"Digging into variable selection methods: Team training session"}]},{"@type":"WebSite","@id":"https:\/\/research.reading.ac.uk\/palaeoclimate\/#website","url":"https:\/\/research.reading.ac.uk\/palaeoclimate\/","name":"SPECIAL Palaeoclimate","description":"Webpage of Sandy&#039;s PalaeoEnvironments and Climate Analysis research group at the University of Reading (UK)","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/research.reading.ac.uk\/palaeoclimate\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/research.reading.ac.uk\/palaeoclimate\/#\/schema\/person\/6ae1715d9ded9f0caa51878ed800cdc9","name":"Sophia Cain","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/research.reading.ac.uk\/palaeoclimate\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/a4b6b71e445f4365e2434d381d6a4e6733a438e4ce1c59fbbd12a0e11cf74672?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a4b6b71e445f4365e2434d381d6a4e6733a438e4ce1c59fbbd12a0e11cf74672?s=96&d=mm&r=g","caption":"Sophia Cain"},"url":"https:\/\/research.reading.ac.uk\/palaeoclimate\/author\/kw931720\/"}]}},"cc_featured_image_caption":{"caption_text":"","source_text":"","source_url":""},"_links":{"self":[{"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/posts\/1227","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/users\/959"}],"replies":[{"embeddable":true,"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/comments?post=1227"}],"version-history":[{"count":1,"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/posts\/1227\/revisions"}],"predecessor-version":[{"id":1229,"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/posts\/1227\/revisions\/1229"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/media\/1228"}],"wp:attachment":[{"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/media?parent=1227"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/categories?post=1227"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/research.reading.ac.uk\/palaeoclimate\/wp-json\/wp\/v2\/tags?post=1227"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}