Open In Colab   Open in Kaggle

Project title

Content creators: Names & Surnames

Content reviewers: Names & Surnames

Content editors: Konstantinos Tsafatinos

Production editors: Konstantinos Tsafatinos

Our 2024 Sponsors: Sponsors


The notebook should have the following outline:

  • Project background (video + template outline)

  • Imports and functions common to multiple data sources

  • Data source 1

  • Data source background

  • Imports and functions common to multiple datasets from the same data source

  • Dataset 1:

    Dataset background (if needed)

    Imports and functions specific to the dataset

    Code to load the dataset

    Code to pre-process the dataset (if not taught and likely difficult for students)

    Code to visualise the data table shape/content

  • Dataset N: same as for Dataset 1 …

  • Further reading


We encourage you to explain the key parts of your code through comments and text cells as you see fit.


If the same or analogous dataset is covered in the curriculum tutorials, we can expect students to know how to load that data. In that case, please provide only the dataset background and refer students to the respective curriculum day while maintaining the structural hierarchy of the notebook.


If you wish to include any plots in your template, those should follow the standard figure formatting. Please include the Figure settings cell from the tutorial notebook template. Let us know if you have any questions about this or are unsure how to proceed!

To include videos:

# @title Project Background

from ipywidgets import widgets
from IPython.display import YouTubeVideo
from IPython.display import IFrame
from IPython.display import display


class PlayVideo(IFrame):
  def __init__(self, id, source, page=1, width=400, height=300, **kwargs):
    self.id = id
    if source == 'Bilibili':
      src = f'https://player.bilibili.com/player.html?bvid={id}&page={page}'
    elif source == 'Osf':
      src = f'https://mfr.ca-1.osf.io/render?url=https://osf.io/download/{id}/?direct%26mode=render'
    super(PlayVideo, self).__init__(src, width, height, **kwargs)


def display_videos(video_ids, W=400, H=300, fs=1):
  tab_contents = []
  for i, video_id in enumerate(video_ids):
    out = widgets.Output()
    with out:
      if video_ids[i][0] == 'Youtube':
        video = YouTubeVideo(id=video_ids[i][1], width=W,
                             height=H, fs=fs, rel=0)
        print(f'Video available at https://youtube.com/watch?v={video.id}')
      else:
        video = PlayVideo(id=video_ids[i][1], source=video_ids[i][0], width=W,
                          height=H, fs=fs, autoplay=False)
        if video_ids[i][0] == 'Bilibili':
          print(f'Video available at https://www.bilibili.com/video/{video.id}')
        elif video_ids[i][0] == 'Osf':
          print(f'Video available at https://osf.io/{video.id}')  
      display(video)
    tab_contents.append(out)
  return tab_contents


video_ids = [('Youtube', '<video_id_1>'), ('Bilibili', '<video_id_2>'), ('Osf', '<video_id_3>')]
tab_contents = display_videos(video_ids, W=854, H=480)
tabs = widgets.Tab()
tabs.children = tab_contents
for i in range(len(tab_contents)):
  tabs.set_title(i, video_ids[i][0])
display(tabs)

To include slides:

# @title Tutorial slides
# @markdown These are the slides for the videos in all tutorials today

from IPython.display import IFrame
#link_id = "n7wdy"

Provide a description of what students will do in the project

Project Template#

[You can add an image with the main questions/project goals: https://raw.githubusercontent.com/ClimateMatchAcademy/course-content/main/projects/template-images/heatwaves_template_map.svg]

Data Exploration Notebook#

Project Setup#

# google colab installs

# !mamaba install netCDF4
# Imports

#Import only the libraries/objects that are necessary for more than one dataset.
#Dataset-specific imports should be in the respective notebook section.

#If any external library has to be installed, !pip install library --quiet
#follow this order: numpy>matplotlib.
#import widgets in hidden Figure settings cell

import random
import numpy as np
import matplotlib.pyplot as plt
# @title Helper functions

# If any helper functions you want to hide for clarity (that has been seen before
# or is simple/uniformative), add here
# If helper code depends on libraries that aren't used elsewhere,
# import those libaries here, rather than in the main import cell
# @title Figure settings

import ipywidgets as widgets       # interactive display
%config InlineBackend.figure_format = 'retina'
plt.style.use("https://raw.githubusercontent.com/ClimateMatchAcademy/course-content/main/cma.mplstyle")

[datasource-1]#

Data source description. You may provide some background that motivated the generation of the associated datasets using hyperlinks to reference relevant papers. Please outline how the data has been collected or generated! Remember to provide credit to dataset authors in line with their copyright policy!

# Data source-specific imports
# Data source-specific functions

[datasource-1]: Dataset 1:#

Further background on the particular dataset if applicable.

# Dataset-specific imports
# Dataset-specific functions
# Code to retrieve and load the data

Give advice on the required preprocessing. If applicable, refer to specific tutorials and let the students preprocess the data themselves. Otherwise, explain/illustrate the reasons for preprocessing and provide either the code itself (if students are likely to struggle on their own) or written guidance on the functions, data filtering thresholds, etc they should use.

# Code to preprocess data

We can now visualize the content of the dataset.

# Code to print the shape, array names, etc of the dataset

Consider adding some encouraging words along the lines of “Now you are all set to address the questions you are interested in!”

[datasource 1]: Dataset 2:#

Further background on the particular datafile/variables if applicable.

[datasource 2]:#

Data source description. You may provide some background that motivated the generation of the associated datasets using hyperlinks to reference relevant papers. Please outline how the data has been collected or generated! Remember to provide credit to dataset authors in line with their copyright policy!

Further Reading#

Use bullet points to list further readings