This repository is the official implementation of Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Rickard Karlsson

Last update: Aug 19, 2022

Related tags

Deep Learning LearningUsingPrivilegedTimeSeries

Overview

Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Link to paper

Abstract

We study prediction of future outcomes with supervised models that use privileged information during learning. The privileged information comprises samples of time series observed between the baseline time of prediction and the future outcome; this information is only available at training time which differs from the traditional supervised learning. Our question is when using this privileged data leads to more sample-efficient learning of models that use only baseline data for predictions at test time. We give an algorithm for this setting and prove that when the time series are drawn from a non-stationary Gaussian-linear dynamical system of fixed horizon, learning with privileged information is more efficient than learning without it. On synthetic data, we test the limits of our algorithm and theory, both when our assumptions hold and when they are violated. On three diverse real-world datasets, we show that our approach is generally preferable to classical learning, particularly when data is scarce. Finally, we relate our estimator to a distillation approach both theoretically and empirically.

Requirements

Required libraries found in requirements.txt

Models

Baseline and LUPTS are implemented using sklearn, the code is found in /src/model/

Evaluation

Synthethic

To re-produce experiments, run /notebooks/synthetic.ipynb Necessary experiment code is found in /src/synthetic/

Forecasting Air Quality

To re-produce experiments, run /notebooks/fivecities.ipynb Necessary experiment code is found in /src/fivecities/

The data is found in /data/fivecities/, but can also be downloaded from here.

Modeling Progression of Chronic Disease

Note: For the Alzheimer’s and Multiple myeloma progression modeling tasks, the data is not publicly available, but the code which produced the results is still found in this repository.

Alzheimer's progression modelling

Code is found in /notebooks/ADNI.ipynb and /src/adni/

Multiple myeloma progression modelling

Code is found in /notebooks/mm-prfs.ipynb and /notebooks/mm-tr.ipynb

Library for implementing reservoir computing models (echo state networks) for multivariate time series classification and clustering.

Framework overview This library allows to quickly implement different architectures based on Reservoir Computing (the family of approaches popularized

249 Dec 21, 2022

FAMIE is a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction (IE)

FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction

18 Sep 1, 2022

Official repository for "Exploiting Session Information in BERT-based Session-aware Sequential Recommendation", SIGIR 2022 short.

Session-aware BERT4Rec Official repository for "Exploiting Session Information in BERT-based Session-aware Sequential Recommendation", SIGIR 2022 shor

22 Dec 13, 2022

Doge-Prediction - Coding Club prediction ig

Doge-Prediction Coding Club prediction ig Basically: Create an application that

1 Jan 10, 2022

This repository provides an efficient PyTorch-based library for training deep models.

An Efficient Library for Training Deep Models This repository provides an efficient PyTorch-based library for training deep models. Installation Make

123 Jan 5, 2023

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

Motif-based Graph Self-Supervised Learning for Molecular Property Prediction Official Pytorch implementation of NeurIPS'21 paper "Motif-based Graph Se

71 Dec 20, 2022

Official repository for MixFaceNets: Extremely Efficient Face Recognition Networks

MixFaceNets This is the official repository of the paper: MixFaceNets: Extremely Efficient Face Recognition Networks. (Accepted in IJCB2021) https://i

51 Dec 13, 2022

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Restormer: Efficient Transformer for High-Resolution Image Restoration Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan,

906 Dec 30, 2022

LONG-TERM SERIES FORECASTING WITH QUERYSELECTOR – EFFICIENT MODEL OF SPARSEATTENTION

Query Selector Here you can find code and data loaders for the paper https://arxiv.org/pdf/2107.08687v1.pdf . Query Selector is a novel approach to sp

62 Dec 17, 2022

This repository is the official implementation of Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Related tags

Overview

Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Link to paper

Abstract

Requirements

Models

Evaluation

Synthethic

Forecasting Air Quality

Modeling Progression of Chronic Disease

Alzheimer's progression modelling

Multiple myeloma progression modelling

You might also like...

Library for implementing reservoir computing models (echo state networks) for multivariate time series classification and clustering.

FAMIE is a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction (IE)

Official repository for "Exploiting Session Information in BERT-based Session-aware Sequential Recommendation", SIGIR 2022 short.

Doge-Prediction - Coding Club prediction ig

This repository provides an efficient PyTorch-based library for training deep models.

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

Official repository for MixFaceNets: Extremely Efficient Face Recognition Networks

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

LONG-TERM SERIES FORECASTING WITH QUERYSELECTOR – EFFICIENT MODEL OF SPARSEATTENTION

Owner

Rickard Karlsson

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

Contra is a lightweight, production ready Tensorflow alternative for solving time series prediction challenges with AI

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."

TAug :: Time Series Data Augmentation using Deep Generative Models

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

The GitHub repository for the paper: “Time Series is a Special Sequence: Forecasting with Sample Convolution and Interaction“.

Price-Prediction-For-a-Dream-Home - A machine learning based linear regression trained model for house price prediction.

ICML 21 - Voice2Series: Reprogramming Acoustic Models for Time Series Classification