European Cloud Cover
A dataset containing reanalysis data from ERA5 and satellite retrievals from METeosat Second Generation.
Climate change is stated as one of the biggest challenges of our time resulting in many unwanted effects. The response of cloud fractional cover (CFC), i.e. the portion of the sky covered by clouds, to future climate is associated with high uncertainties. CFC will affect the rate of global warming and different parts of the society such as agriculture and solar energy production.
However unfortunately, projection of future CFC is challenging. Here we present the European Cloud Cover dataset that connects variables that usually are well projected by numerical climate models (temperature, pressure and specific and relative humidity) and CFC. The dataset can thus be used to potentially improve the projections of future CFC and again improve projection of future global warming and other climate effects. The data to compute the CFC are obtained from the The European Organisation for the Exploitation of Meteorological Satellites data portal and the other variables from European Centre for Medium-Range Weather Forecasts.
Given the large amounts of data (17 GB) with high spatial and temporal resolution and the complexities of cloud formations, we believe that machine learning can be useful to learn the relation between CRC and the other variables. To the best of our knowledge this is the first dataset of this kind.
Dataset Details
KvasirCapsule-SEG is an enchanced subset of Kvasir-Capsule which contains polyp images, their segmentation ground truth and bounding box information. The polyp class of Kvasir-Capsule has only 55 images which we annotated with the help of an expert gastroenterologist. As can be seen in the figure below, we have generated both ground truth and bounding box information with separate folders for images, ground truth, and images with bounding boxes. Examples of polyps and their corresponding masks from KvasirCapsule-SEG can be found here.
Suggested Evaluation Metrics
For evaluation purposes, we suggest standard computer vision metrics such as dice coefficient (DSC), mean intersection over union (mIoU), Precision, Recall, Specificity, Accuracy, and FPS. A detailed description of these metrics can be found in our paper.
Download
File | Description | Size | Download |
---|---|---|---|
ecc.zip | The ecc.zip archive containing the dataset. | 17GB |
All files can be viewed and downloaded in our OSF repository available here: https://osf.io/kqdgx
Cite
If you use this dataset in your research, Please cite the following paper:
@article{svennevik2021prediction,
title={Prediction of cloud fractional cover using machine learning},
author={
Svennevik, Hanna and Riegler, Michael A and Hicks, Steven and
Storelvmo, Trude and Hammer, Hugo L
},
journal={Big Data and Cognitive Computing},
volume={5},
number={4},
pages={62},
year={2021},
publisher={Multidisciplinary Digital Publishing Institute}
}
Terms of use
To download the dataset you must agree that the dataset will only be used for personal, educational or research purposes. More detailed information about terms of use is given here:
Terms of use: https://eoportal.eumetsat.int/userMgmt/terms.faces
Data Usage: https://eoportal.eumetsat.int/userMgmt/dataUsageHelp.faces
Contact
Email michael (at) simula (dot) no if you have any questions about the dataset and our research activities. We always welcome collaboration and joint research!