PyTorch implementation of probabilistic deep forecast applied to air quality.

Abdulmajid Murad

Last update: Nov 16, 2022

Related tags

Deep Learning deep_probabilistic_forecast

Overview

Probabilistic Deep Forecast

PyTorch implementation of a paper, titled: Probabilistic Deep Learning to Quantify Uncertainty in Air Quality Forecasting .

Introduction

In this work, we develop a set of deep probabilistic models for air quality forecasting that quantify both aleatoric and epistemic uncertainties and study how to represent and manipulate their predictive uncertainties. In particular: * We conduct a broad empirical comparison and exploratory assessment of state-of-the-art techniques in deep probabilistic learning applied to air quality forecasting. Through exhaustive experiments, we describe training these models and evaluating their predictive uncertainties using various metrics for regression and classification tasks. * We improve uncertainty estimation using adversarial training to smooth the conditional output distribution locally around training data points. * We apply uncertainty-aware models that exploit the temporal and spatial correlation inherent in air quality data using recurrent and graph neural networks. * We introduce a new state-of-the-art example for air quality forecasting by defining the problem setup and selecting proper input features and models.


Decision score as a function of normalized aleatoric and epistemic confidence thresholds . See animation video here

Installation

install probabilistic_forecast' locally in “editable” mode ( any changes to the original package would reflect directly in your environment, os you don't have to re-insall the package every time you make some changes):

pip install -e .

Use the configuration file equirements.txt to the install the required packages to run this project.

File Structure

.
├── probabilistic_forecast/
│   ├── bnn.py (class definition for the Bayesian neural networks model)
│   ├── ensemble.py (class definition for the deep ensemble model)
│   ├── gnn_mc.py (class definition for the graph neural network model with MC dropout)
│   ├── lstm_mc.py (class definition for the LSTM model with MC dropout)
│   ├── nn_mc.py (class definition for the standard neural network model with MC droput)
│   ├── nn_standard.py (class definition for the standard neural network model without MC dropout)
│   ├── swag.py (class definition for the SWAG model)
│   └── utils/
│       ├── data_utils.py (utility functions for data loading and pre-processing)
│       ├── gnn_utils.py (utility functions for GNN)
│       ├── plot_utils.py (utility functions for plotting training and evaluation results)
│       ├── swag_utils.py  (utility functions for SWAG)
│       └── torch_utils.py (utility functions for torch dataloader, checking if CUDA is available)
├── dataset/
│   ├── air_quality_measurements.csv (dataset of air quality measurements)
│   ├── street_cleaning.csv  (dataset of air street cleaning records)
│   ├── traffic.csv (dataset of traffic volumes)
│   ├── weather.csv  (dataset of weather observations)
│   └── visualize_data.py  (script to visualize all dataset)
├── main.py (main function with argument parsing to load data, build a model and evaluate (or train))
├── tests/
│   └── confidence_reliability.py (script to evaluate the reliability of confidence estimates of pretrained models)
│   └── epistemic_vs_aleatoric.py (script to show the impact of quantifying both epistemic and aleatoric uncertainties)
├── plots/ (foler containing all evaluation plots)
├── pretrained/ (foler containing pretrained models and training curves plots)
├── evaluate_all_models.sh (bash script for evaluating all models at once)
└── train_all_models.sh (bash script for training all models at once)

Evaluating Pretrained Models

Evaluate a pretrained model, for example:

python main.py --model=SWAG --task=regression --mode=evaluate  --adversarial_training

or evaluate all models:

bash evaluate_all_models.sh


PM-value regression using Graph Neural Network with MC dropout

Threshold-exceedance prediction


Threshold-exceedance prediction using Bayesian neural network (BNN)

Confidence Reliability

To evaluate the confidence reliability of the considered probabilistic models, run the following command:

python tests/confidence_reliability.py

It will generate the following plots:


Confidence reliability of probabilistic models in PM-value regression task in all monitoring stations.


Confidence reliability of probabilistic models in threshold-exceedance prediction task in all monitoring stations.

Epistemic and aleatoric uncertainties in decision making

To evaluate the impact of quantifying both epistemic and aleatoric uncertainties in decision making, run the following command:

python tests/epistemic_vs_aleatoric.py

It will generate the following plots:

Decision score in a non-probabilistic model as a function of only aleatoric confidence.	Decision score in a probabilistic model as a function of both epistemic and aleatoric confidences.

It will also generate an .vtp file, which can be used to generate a 3D plot with detailed rendering and lighting in ParaView.

Training Models

Train a single model, for example:

python main.py --model=SWAG --task=regression --mode=train --n_epochs=3000 --adversarial_training

or train all models:

bash train_all_models.sh


Learning curve of training a BNNs model to forecast PM-values. Left: negative log-likelihood loss, Center: KL loss estimated using MC sampling, Right: learning rate of exponential decay.

Dataset

Run the following command to visualize all data

python dataset/visualize_data.py

It will generate plots in the "dataset folder". For example:


Air quality level over two years in one representative monitoring station (Elgeseter) in Trondheim, Norway

Attribution

Parts of the SWAG code is based on the official code for the paper "A Simple Baseline for Bayesian Uncertainty in Deep Learning": https://github.com/wjmaddox/swa_gaussian.
Parts of the GNN code is based on the official code for the paper "Spectral Temporal Graph Neural Network for Multivariate Time-series Forecasting": https://github.com/microsoft/StemGNN.
Parts of the BNN code is based on https://github.com/JavierAntoran/Bayesian-Neural-Networks.
The function h5_to_vt in epistemic_vs_aleatoric.py that convert h5 file to vtp files to be used by ParaView, is based on https://github.com/tomgoldstein/loss-landscape.
The air quality dataset is a part of the open database of air quality measurements offered by the Norwegian Institute for Air Research (NILU) https://www.nilu.com/open-data/.
The meteorological data are based on historical weather and climate data offered by the Norwegian Meteorological Institute https://frost.met.no.
The traffic data is based on aggregated traffic volumes offered by the Norwegian Public Roads Administration https://www.vegvesen.no/trafikkdata/start/om-api.

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

GradTTS Unofficial Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech" (arxiv) About this repo This is an unoffic

103 Dec 23, 2022

Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch

Retrieval-Augmented Denoising Diffusion Probabilistic Models (wip) Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in P

55 Jan 1, 2023

InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy

InferPy: Deep Probabilistic Modeling Made Easy InferPy is a high-level API for probabilistic modeling written in Python and capable of running on top

141 Oct 13, 2022

Deep Probabilistic Programming Course @ DIKU

52 May 14, 2022

Registration Loss Learning for Deep Probabilistic Point Set Registration

RLLReg This repository contains a Pytorch implementation of the point set registration method RLLReg. Details about the method can be found in the 3DV

35 Nov 2, 2022

DeepProbLog is an extension of ProbLog that integrates Probabilistic Logic Programming with deep learning by introducing the neural predicate.

DeepProbLog DeepProbLog is an extension of ProbLog that integrates Probabilistic Logic Programming with deep learning by introducing the neural predic

KU Leuven Machine Learning Research Group

94 Dec 18, 2022

PyTorch implementation of probabilistic deep forecast applied to air quality.

Related tags

Overview

Probabilistic Deep Forecast

Introduction

Installation

File Structure

Evaluating Pretrained Models

Threshold-exceedance prediction

Confidence Reliability

Epistemic and aleatoric uncertainties in decision making

Training Models

Dataset

Attribution

You might also like...

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch

InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy

Deep Probabilistic Programming Course @ DIKU

Registration Loss Learning for Deep Probabilistic Point Set Registration

DeepProbLog is an extension of ProbLog that integrates Probabilistic Logic Programming with deep learning by introducing the neural predicate.

A Python library for Deep Probabilistic Modeling

Differentiable rasterization applied to 3D model simplification tasks

Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data

Owner

Abdulmajid Murad

A PaddlePaddle implementation of STGCN with a few modifications in the model architecture in order to forecast traffic jam.

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

Deep Learning applied to Integral data analysis

Using multidimensional LSTM neural networks to create a forecast for Bitcoin price

Using LSTM to detect spoofing attacks in an Air-Ground network

TensorFlow (v2.7.0) benchmark results on an M1 Macbook Air 2020 laptop (macOS Monterey v12.1).

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

Deep universal probabilistic programming with Python and PyTorch

Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models