Project title
Content creators: Names & Surnames
Content reviewers: Names & Surnames
Content editors: Konstantinos Tsafatinos
Production editors: Konstantinos Tsafatinos
Our 2024 Sponsors: Sponsors
The notebook should have the following outline:
Project background (video + template outline)
Imports and functions common to multiple data sources
Data source 1
Data source background
Imports and functions common to multiple datasets from the same data source
Dataset 1:
Dataset background (if needed)
Imports and functions specific to the dataset
Code to load the dataset
Code to pre-process the dataset (if not taught and likely difficult for students)
Code to visualise the data table shape/content
…
Dataset N: same as for Dataset 1 …
Further reading
We encourage you to explain the key parts of your code through comments and text cells as you see fit.
If the same or analogous dataset is covered in the curriculum tutorials, we can expect students to know how to load that data. In that case, please provide only the dataset background and refer students to the respective curriculum day while maintaining the structural hierarchy of the notebook.
If you wish to include any plots in your template, those should follow the standard figure formatting. Please include the Figure settings cell from the tutorial notebook template. Let us know if you have any questions about this or are unsure how to proceed!
To include videos:
# @title Project Background
from ipywidgets import widgets
from IPython.display import YouTubeVideo
from IPython.display import IFrame
from IPython.display import display
class PlayVideo(IFrame):
def __init__(self, id, source, page=1, width=400, height=300, **kwargs):
self.id = id
if source == 'Bilibili':
src = f'https://player.bilibili.com/player.html?bvid={id}&page={page}'
elif source == 'Osf':
src = f'https://mfr.ca-1.osf.io/render?url=https://osf.io/download/{id}/?direct%26mode=render'
super(PlayVideo, self).__init__(src, width, height, **kwargs)
def display_videos(video_ids, W=400, H=300, fs=1):
tab_contents = []
for i, video_id in enumerate(video_ids):
out = widgets.Output()
with out:
if video_ids[i][0] == 'Youtube':
video = YouTubeVideo(id=video_ids[i][1], width=W,
height=H, fs=fs, rel=0)
print(f'Video available at https://youtube.com/watch?v={video.id}')
else:
video = PlayVideo(id=video_ids[i][1], source=video_ids[i][0], width=W,
height=H, fs=fs, autoplay=False)
if video_ids[i][0] == 'Bilibili':
print(f'Video available at https://www.bilibili.com/video/{video.id}')
elif video_ids[i][0] == 'Osf':
print(f'Video available at https://osf.io/{video.id}')
display(video)
tab_contents.append(out)
return tab_contents
video_ids = [('Youtube', '<video_id_1>'), ('Bilibili', '<video_id_2>'), ('Osf', '<video_id_3>')]
tab_contents = display_videos(video_ids, W=854, H=480)
tabs = widgets.Tab()
tabs.children = tab_contents
for i in range(len(tab_contents)):
tabs.set_title(i, video_ids[i][0])
display(tabs)
To include slides:
# @title Tutorial slides
# @markdown These are the slides for the videos in all tutorials today
from IPython.display import IFrame
#link_id = "n7wdy"
Provide a description of what students will do in the project
Project Template#
[You can add an image with the main questions/project goals: https://raw.githubusercontent.com/ClimateMatchAcademy/course-content/main/projects/template-images/heatwaves_template_map.svg]
Data Exploration Notebook#
Project Setup#
# google colab installs
# !mamaba install netCDF4
# Imports
#Import only the libraries/objects that are necessary for more than one dataset.
#Dataset-specific imports should be in the respective notebook section.
#If any external library has to be installed, !pip install library --quiet
#follow this order: numpy>matplotlib.
#import widgets in hidden Figure settings cell
import random
import numpy as np
import matplotlib.pyplot as plt
# @title Helper functions
# If any helper functions you want to hide for clarity (that has been seen before
# or is simple/uniformative), add here
# If helper code depends on libraries that aren't used elsewhere,
# import those libaries here, rather than in the main import cell
# @title Figure settings
import ipywidgets as widgets # interactive display
%config InlineBackend.figure_format = 'retina'
plt.style.use("https://raw.githubusercontent.com/ClimateMatchAcademy/course-content/main/cma.mplstyle")
[datasource-1]#
Data source description. You may provide some background that motivated the generation of the associated datasets using hyperlinks to reference relevant papers. Please outline how the data has been collected or generated! Remember to provide credit to dataset authors in line with their copyright policy!
# Data source-specific imports
# Data source-specific functions
[datasource-1]: Dataset 1:#
Further background on the particular dataset if applicable.
# Dataset-specific imports
# Dataset-specific functions
# Code to retrieve and load the data
Give advice on the required preprocessing. If applicable, refer to specific tutorials and let the students preprocess the data themselves. Otherwise, explain/illustrate the reasons for preprocessing and provide either the code itself (if students are likely to struggle on their own) or written guidance on the functions, data filtering thresholds, etc they should use.
# Code to preprocess data
We can now visualize the content of the dataset.
# Code to print the shape, array names, etc of the dataset
Consider adding some encouraging words along the lines of “Now you are all set to address the questions you are interested in!”
[datasource 1]: Dataset 2:#
Further background on the particular datafile/variables if applicable.
[datasource 2]:#
Data source description. You may provide some background that motivated the generation of the associated datasets using hyperlinks to reference relevant papers. Please outline how the data has been collected or generated! Remember to provide credit to dataset authors in line with their copyright policy!
Further Reading#
Use bullet points to list further readings