This is the replication package for paper submission: Towards Training Reproducible Deep Learning Models.

Last update: Feb 2, 2022

Related tags

Deep Learning ICSE2022Rep

Overview

This is the replication package for paper submission: Towards Training Reproducible Deep Learning Models.

This replication package contains the following parts:

experiment results/ contains the experimental results for the six open source models
implementation/ contains the code for training the six open source models
record-and-replay/ contains the binary format of the record-and-replay tool
Time.xlsx contains the table for the time overhead comparison

To use the record-and-replay tool, in Linux, point the absolute location to LD_PRELOAD and start the training process as usual. Check the system log: cat /var/log/syslog

For the semi-formal interview:

We worked closely with ~20 practitioners, who are either senior software developers or ML scientists with Ph.D. degrees. Their tasks are to prototype DL models and/or productionalize DL models. We have conducted two separate interviews with them and each round lasted for about 2 hours. During the interview, we presented our survey on the state-of-the-art techniques towards reproducibility and gathered their feedback (reported in Section 2.3).

You might also like...

Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms

Open-L2O This repository establishes the first comprehensive benchmark efforts of existing learning to optimize (L2O) approaches on a number of proble

161 Jan 2, 2023

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

InfoPro-Pytorch The Information Propagation algorithm for training deep networks with local supervision. (ICLR 2021) Revisiting Locally Supervised Lea

78 Dec 27, 2022

Lightweight, Python library for fast and reproducible experimentation :microscope:

Steppy What is Steppy? Steppy is a lightweight, open-source, Python 3 library for fast and reproducible experimentation. Steppy lets data scientist fo

134 Jul 10, 2022

Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

Rubicon Purpose Rubicon is a data science tool that captures and stores model training and execution information, like parameters and outcomes, in a r

97 Jan 3, 2023

Towards Interpretable Deep Metric Learning with Structural Matching

This is the replication package for paper submission: Towards Training Reproducible Deep Learning Models.

Related tags

Overview

You might also like...

Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Lightweight, Python library for fast and reproducible experimentation :microscope:

Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

Towards Interpretable Deep Metric Learning with Structural Matching

(under submission) Bayesian Integration of a Generative Prior for Image Restoration

A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains (IJCV submission)

A supplementary code for Editable Neural Networks, an ICLR 2020 submission.

Submission to Twitter's algorithmic bias bounty challenge

Owner

This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code for training a DPR model then continuing training with RAG.

This is the repository for paper NEEDLE: Towards Non-invertible Backdoor Attack to Deep Learning Models.

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

Replication attempt for the Protein Folding Model

Replication of Pix2Seq with Pretrained Model

Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI

The source code for Generating Training Data with Language Models: Towards Zero-Shot Language Understanding.

This is the code of NeurIPS'21 paper "Towards Enabling Meta-Learning from Target Models".

This is the replication package for paper submission: Towards Training Reproducible Deep Learning Models.

Related tags

Overview

You might also like...

Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Lightweight, Python library for fast and reproducible experimentation :microscope:

Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

Towards Interpretable Deep Metric Learning with Structural Matching

(under submission) Bayesian Integration of a Generative Prior for Image Restoration

A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains (IJCV submission)

A supplementary code for Editable Neural Networks, an ICLR 2020 submission.

Submission to Twitter's algorithmic bias bounty challenge

Owner

This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code for training a DPR model then continuing training with RAG.

This is the repository for paper NEEDLE: Towards Non-invertible Backdoor Attack to Deep Learning Models.

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

Replication attempt for the Protein Folding Model

Replication of Pix2Seq with Pretrained Model

Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

This is the pytorch implementation for the paper: *Learning Accurate Performance Predictors for Ultrafast Automated Model Compression*, which is in submission to TPAMI

The source code for Generating Training Data with Language Models: Towards Zero-Shot Language Understanding.

This is the code of NeurIPS'21 paper "Towards Enabling Meta-Learning from Target Models".

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI