{"id":1062,"date":"2023-07-03T01:17:55","date_gmt":"2023-07-03T00:17:55","guid":{"rendered":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/?page_id=1062"},"modified":"2023-07-03T01:17:55","modified_gmt":"2023-07-03T00:17:55","slug":"abstract186","status":"publish","type":"page","link":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/","title":{"rendered":"Abstract 186"},"content":{"rendered":"<p>[vc_row][vc_column][vc_column_text]<strong>Abstract ID:<\/strong> 186<\/p>\n<h2 style=\"text-align: center\">Introducing the AI4S2S project; open-source python packages to make data-driven pipelines for S2S forecasting more efficient, transparent, and scalable<\/h2>\n<p style=\"text-align: center\"><span data-contrast=\"auto\"><strong>Lead Author:<\/strong> Sem Vijverberg<br \/>\nVrije Universiteit, Institute for Environmental Studies, Netherlands<br \/>\n<\/span><\/p>\n<p><strong>Keywords:<\/strong> open-source software, Machine Learning, S2S forecasting<\/p>\n<p><strong>Abstract: <\/strong>Reliable S2S forecasts remain a huge scientific challenge. The lead-time is too long such that the memory from the atmosphere\u2019s initial condition is lost, but too short for the atmosphere\u2019s boundary conditions to be felt strongly. Only for specific \u2018windows of predictability\u2019 (i.e. specific regions, timescales and climatic background states), skillful forecasts are possible, in an otherwise largely unpredictable future. The interest in machine learning (ML) is growing fast due to a number of successes in S2S forecasting. However, we argue there is a need for more standardization, consensus on best practices, higher efficiency, and higher reproducibility. Typical S2S ML use-cases, such as (1) pure statistical forecasting based on observations, (2) transfer learning, and (3) post-processing of dynamical model ensembles, require a large coding and preprocessing effort. Such experiments are not trivial to set up, and without sufficient experience and expertise there is a large risk of improper cross-validation and\/or improper and non-standard verification.<\/p>\n<p>Within a 3-year project, a dedicated team of software engineers and researchers are working on light-weight Python packages that make the construction of ML-based pipelines for S2S forecasting much more efficient, transparent, and scalable.<\/p>\n<p>We developed the python package \u201clilio\u201d to handle user-defined sequences of precursor and target periods, to be able to reliably and repeatedly resample raw input data to these periods. The \u201cs2spy\u201d package continues where \u201clilio\u201d leaves off, and facilitates orchestrating full S2S machine learning pipelines, from preprocessing and cross-validation, to dimensionality reduction, model fitting and model interpretation. Flexibility for the user is an important pillar of s2spy, leaving as much flexibility as possible for the user to insert their own new methods to forecast and\/or reduce dimensionality. Once such a clean ML pipeline has been designed, it becomes both more transparent, reproducible, as well as easily scalable to any HPC system and climate data platform.<\/p>\n<p>The AI4S2S project aims to contribute to a higher reproducibility and works towards a wider acceptance of ML standards and best practices. We will present our vision and the capabilities of our package, show-casing that we can build a model from raw climate data up to verification in only a few lines of code.<\/p>\n<p><strong>Co-authors:<br \/>\n<\/strong>Bart Schilperoort (Netherlands eScience Center)<br \/>\nYang Liu (Netherlands eScience Center)<br \/>\nJannes van Ingen (Vrije Universiteit, Institute for Environmental Studies)<br \/>\nPeter Kalverla (Netherlands eScience Center)<br \/>\nFakhereh (Sarah) Alidoost (Netherlands eScience Center)<br \/>\nDim Coumou (Vrije Universiteit, Institute for Environmental Studies)[\/vc_column_text][vc_separator][\/vc_column][\/vc_row][vc_row][vc_column width=&#8221;1\/6&#8243;][vc_single_image image=&#8221;344&#8243; img_size=&#8221;full&#8221; onclick=&#8221;custom_link&#8221; link=&#8221;http:\/\/s2sprediction.net\/&#8221;][\/vc_column][vc_column width=&#8221;1\/6&#8243;][vc_single_image image=&#8221;345&#8243; img_size=&#8221;full&#8221; onclick=&#8221;custom_link&#8221; link=&#8221;https:\/\/public.wmo.int\/en&#8221;][\/vc_column][vc_column width=&#8221;1\/6&#8243;][vc_single_image image=&#8221;346&#8243; img_size=&#8221;full&#8221; onclick=&#8221;custom_link&#8221; link=&#8221;https:\/\/community.wmo.int\/activity-areas\/wwrp&#8221;][\/vc_column][vc_column width=&#8221;1\/6&#8243;][vc_single_image image=&#8221;347&#8243; img_size=&#8221;full&#8221; onclick=&#8221;custom_link&#8221; link=&#8221;https:\/\/www.wcrp-climate.org\/&#8221;][\/vc_column][vc_column width=&#8221;1\/6&#8243;][vc_single_image image=&#8221;348&#8243; img_size=&#8221;full&#8221;][\/vc_column][vc_column width=&#8221;1\/6&#8243;][vc_single_image image=&#8221;349&#8243; img_size=&#8221;full&#8221; onclick=&#8221;custom_link&#8221; link=&#8221;https:\/\/www.reading.ac.uk\/&#8221;][\/vc_column][\/vc_row]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>[vc_row][vc_column][vc_column_text]Abstract ID: 186 Introducing the AI4S2S project; open-source python packages to make data-driven pipelines for S2S forecasting more efficient, transparent, and scalable Lead Author: Sem Vijverberg Vrije Universiteit, Institute for&#8230;<a class=\"read-more\" href=\"&#104;&#116;&#116;&#112;&#115;&#58;&#47;&#47;&#114;&#101;&#115;&#101;&#97;&#114;&#99;&#104;&#46;&#114;&#101;&#97;&#100;&#105;&#110;&#103;&#46;&#97;&#99;&#46;&#117;&#107;&#47;&#115;&#50;&#115;&#45;&#115;&#117;&#109;&#109;&#105;&#116;&#50;&#48;&#50;&#51;&#47;&#112;&#114;&#111;&#103;&#114;&#97;&#109;&#109;&#101;&#47;&#97;&#98;&#115;&#116;&#114;&#97;&#99;&#116;&#49;&#56;&#54;&#47;\">Read More ><\/a><\/p>\n","protected":false},"author":145,"featured_media":0,"parent":528,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"__cvm_playback_settings":[],"__cvm_video_id":"","footnotes":""},"coauthors":[13],"class_list":["post-1062","page","type-page","status-publish","hentry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.8.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Abstract 186 - WWRP\/WCRP S2S Summit 2023<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Abstract 186 - WWRP\/WCRP S2S Summit 2023\" \/>\n<meta property=\"og:description\" content=\"[vc_row][vc_column][vc_column_text]Abstract ID: 186 Introducing the AI4S2S project; open-source python packages to make data-driven pipelines for S2S forecasting more efficient, transparent, and scalable Lead Author: Sem Vijverberg Vrije Universiteit, Institute for...Read More &gt;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/\" \/>\n<meta property=\"og:site_name\" content=\"WWRP\/WCRP S2S Summit 2023\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"3 minutes\" \/>\n\t<meta name=\"twitter:label2\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data2\" content=\"Robert Lee\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/\",\"url\":\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/\",\"name\":\"Abstract 186 - WWRP\/WCRP S2S Summit 2023\",\"isPartOf\":{\"@id\":\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/#website\"},\"datePublished\":\"2023-07-03T00:17:55+00:00\",\"dateModified\":\"2023-07-03T00:17:55+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Programme\",\"item\":\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Abstract 186\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/#website\",\"url\":\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/\",\"name\":\"WWRP\/WCRP S2S Summit 2023\",\"description\":\"Advancing Sub-seasonal to Seasonal Predictions and their Applications\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/research.reading.ac.uk\/s2s-summit2023\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-GB\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Abstract 186 - WWRP\/WCRP S2S Summit 2023","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/","og_locale":"en_GB","og_type":"article","og_title":"Abstract 186 - WWRP\/WCRP S2S Summit 2023","og_description":"[vc_row][vc_column][vc_column_text]Abstract ID: 186 Introducing the AI4S2S project; open-source python packages to make data-driven pipelines for S2S forecasting more efficient, transparent, and scalable Lead Author: Sem Vijverberg Vrije Universiteit, Institute for...Read More >","og_url":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/","og_site_name":"WWRP\/WCRP S2S Summit 2023","twitter_card":"summary_large_image","twitter_misc":{"Estimated reading time":"3 minutes","Written by":"Robert Lee"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/","url":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/","name":"Abstract 186 - WWRP\/WCRP S2S Summit 2023","isPartOf":{"@id":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/#website"},"datePublished":"2023-07-03T00:17:55+00:00","dateModified":"2023-07-03T00:17:55+00:00","breadcrumb":{"@id":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/abstract186\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/"},{"@type":"ListItem","position":2,"name":"Programme","item":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/programme\/"},{"@type":"ListItem","position":3,"name":"Abstract 186"}]},{"@type":"WebSite","@id":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/#website","url":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/","name":"WWRP\/WCRP S2S Summit 2023","description":"Advancing Sub-seasonal to Seasonal Predictions and their Applications","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-GB"}]}},"_links":{"self":[{"href":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/wp-json\/wp\/v2\/pages\/1062","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/wp-json\/wp\/v2\/users\/145"}],"replies":[{"embeddable":true,"href":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/wp-json\/wp\/v2\/comments?post=1062"}],"version-history":[{"count":2,"href":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/wp-json\/wp\/v2\/pages\/1062\/revisions"}],"predecessor-version":[{"id":1091,"href":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/wp-json\/wp\/v2\/pages\/1062\/revisions\/1091"}],"up":[{"embeddable":true,"href":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/wp-json\/wp\/v2\/pages\/528"}],"wp:attachment":[{"href":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/wp-json\/wp\/v2\/media?parent=1062"}],"wp:term":[{"taxonomy":"author","embeddable":true,"href":"https:\/\/research.reading.ac.uk\/s2s-summit2023\/wp-json\/wp\/v2\/coauthors?post=1062"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}