DANet for Tabular data classification/ regression.

Ronnie Rocket

Last update: Sep 14, 2022

Related tags

Deep Learning DANet

Overview

Deep Abstract Networks

A PyTorch code implemented for the submission DANets: Deep Abstract Networks for Tabular Data Classification and Regression.

Downloads

Dataset

Download the datasets from the following links:

(Optional) Before starting the program, you may change the file format to .pkl by using svm2pkl() or csv2pkl() in ./data/data_util.py

Weights for inference models

The demo weights for Forest Cover Type dataset is available in the folder "./Weights/".

How to use

Setting

Clone or download this repository, and cd the path where you clone it.
Build a working python environment. Python 3.7 is fine for this repository.
Install packages in requirements.txt, e.g., by pip install -r requirements.txt.
The default hyperparameters are in ./config/default.py.

Training

Set the hyperparameters in config file (./config/default.py or ./config/*.yaml).
Notably, the hyperparameters in .yaml file will cover those in default.py.
Run python main.py --c [config_path] --g [gpu_id].
- -c: The config file path
- -g: GPU device ID
The checkpoint models and best models will be saved at ./logs.

Inference

Replace the resume_dir path by the file path of model/weight.
Run codes by using python predict.py -d [dataset_name] -m [model_file_path] -g [gpu_id].
- -d: Dataset name
- -m: Model path for loading
- -g: GPU device ID

Config Hyperparameters

Normal parameters

dataset: str
Dataset name must match those in ./data/dataset.py.
task: str
Using 'classification' or 'regression'.
resume_dir: str
The log path containing the checkpoint models.
logname: str
The directory names of the models save at ./logs.
seed: int
Random seed.

Model parameters

layer: int (default=20)
Number of abstract layers to stack
k: int (default=5)
Number of masks
base_outdim: int (default=64)
The output feature dimension in abstract layer.
drop_rate: float (default=0.1) Dropout rate in shortcut module

Fit parameters

lr: float (default=0.008)
Learning rate
max_epochs: int (default=5000)
Maximum number of epochs for training.
patience: int (default=1500)
Number of consecutive epochs without improvement before performing early stopping. If patience is set to 0, then no early stopping will be performed.
batch_size: int (default=8192)
Number of examples per batch.
virtual_batch_size: int (default=256)
Size of the mini batches used for "Ghost Batch Normalization". virtual_batch_size must divide batch_size

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

Website | Documentation | Tutorials | Installation | Release Notes CatBoost is a machine learning method based on gradient boosting over decision tree

5.7k Feb 12, 2021

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series classification, regression and forecasting.

Time series Timeseries Deep Learning Pytorch fastai - State-of-the-art Deep Learning with Time Series and Sequences in Pytorch / fastai

2.8k Jan 8, 2023

Hl classification bc - A Network-Based High-Level Data Classification Algorithm Using Betweenness Centrality

A Network-Based High-Level Data Classification Algorithm Using Betweenness Centr

3 Dec 1, 2022

Calculates carbon footprint based on fuel mix and discharge profile at the utility selected. Can create graphs and tabular output for fuel mix based on input file of series of power drawn over a period of time.

carbon-footprint-calculator Conda distribution ~/anaconda3/bin/conda install anaconda-client conda-build ~/anaconda3/bin/conda config --set anaconda_u

Seattle university Renewable energy research

7 Sep 26, 2022

[NeurIPS 2021] Well-tuned Simple Nets Excel on Tabular Datasets

[NeurIPS 2021] Well-tuned Simple Nets Excel on Tabular Datasets Introduction This repo contains the source code accompanying the paper: Well-tuned Sim

52 Jan 4, 2023

PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)

OCT-GAN: Neural ODE-based Conditional Tabular GANs (OCT-GAN) Code for reproducing the experiments in the paper: Jayoung Kim*, Jinsung Jeon*, Jaehoon L

7 Dec 27, 2022

Research on Tabular Deep Learning (Python package & papers)

Research on Tabular Deep Learning For paper implementations, see the section "Papers and projects". rtdl is a PyTorch-based package providing a user-f

510 Dec 30, 2022

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

A tour through tensorflow with financial data I present several models ranging in complexity from simple regression to LSTM and policy networks. The s

195 Dec 7, 2022

Code for "CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds" @ICRA2021

CloudAAE This is an tensorflow implementation of "CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds" Files log:

35 Nov 14, 2022

DANet for Tabular data classification/ regression.

Related tags

Overview

Deep Abstract Networks

Downloads

Dataset

Weights for inference models

How to use

Setting

Training

Inference

Config Hyperparameters

Normal parameters

Model parameters

Fit parameters

You might also like...

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series classification, regression and forecasting.

Hl classification bc - A Network-Based High-Level Data Classification Algorithm Using Betweenness Centrality

Calculates carbon footprint based on fuel mix and discharge profile at the utility selected. Can create graphs and tabular output for fuel mix based on input file of series of power drawn over a period of time.

[NeurIPS 2021] Well-tuned Simple Nets Excel on Tabular Datasets

PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)

Research on Tabular Deep Learning (Python package & papers)

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Code for "CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds" @ICRA2021

Owner

Ronnie Rocket

A standard framework for modelling Deep Learning Models for tabular data

Implementation of TabTransformer, attention network for tabular data, in Pytorch

Boosted neural network for tabular data

The official PyTorch implementation of recent paper - SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

deep-table implements various state-of-the-art deep learning and self-supervised learning algorithms for tabular data using PyTorch.

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

A framework for attentive explainable deep learning on tabular data

Job-Recommend-Competition - Vectorwise Interpretable Attentions for Multimodal Tabular Data

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.