Recurrent Conditional Query Learning

Dongda

Last update: Nov 28, 2022

Related tags

Deep Learning RCQL

Overview

Recurrent Conditional Query Learning (RCQL)

This repository contains the Pytorch implementation of

One Model Packs Thousands of Items with Recurrent Conditional Query Learning

Dongda Li, Zhaoquan Gu, Yuexuan Wang, Changwei Ren, Francis C.M. Lau

We propose a Recurrent Conditional Query Learning (RCQL) method to solve both 2D and 3D packing problems. We first embed states by a recurrent encoder, and then adopt attention with conditional queries from previous actions. The conditional query mechanism fills the information gap between learning steps, which shapes the problem as a Markov decision process. Benefiting from the recurrence, a single RCQL model is capable of handling different sizes of packing problems. Experiment results show that RCQL can effectively learn strong heuristics for offline and online strip packing problems (SPPs), out- performing a wide range of baselines in space utilization ratio. RCQL reduces the average bin gap ratio by 1.83% in offline 2D 40-box cases and 7.84% in 3D cases compared with state-of-the-art methods. Meanwhile, our method also achieves 5.64% higher space utilization ratio for SPPs with 1000 items than the state of the art.

Usage

Preparation

Install conda
Run conda env create -f environment.yml

Train

Modify the config file in config.py as you need.
Run python main.py.

You might also like...

the code of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021)

RMA-Net This repo is the implementation of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021). Paper

205 Nov 9, 2022

PyTorch implementation DRO: Deep Recurrent Optimizer for Structure-from-Motion

DRO: Deep Recurrent Optimizer for Structure-from-Motion This is the official PyTorch implementation code for DRO-sfm. For technical details, please re

56 Dec 12, 2022

Stacked Recurrent Hourglass Network for Stereo Matching

SRH-Net: Stacked Recurrent Hourglass Introduction This repository is supplementary material of our RA-L submission, which helps reviewers to understan

28 Jan 3, 2023

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Recurrent Fast Weight Programmers This is the official repository containing the code we used to produce the experimental results reported in the pape

36 Nov 15, 2022

Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"

Easy-To-Hard The official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks". Gett

52 Sep 8, 2022

An implementation of DeepMind's Relational Recurrent Neural Networks in PyTorch.

Comments

run error

when I run the main.py, I got below error

line 178, in update_rotate rotate_mask[i] = rotate.squeeze(-1).eq(i) NameError: name 'rotate_mask' is not defined

opened by Xiong5Heng 3
runtimeerror

您好，我在运行python main.py时遇到了这样的问题RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument mask in method wrapper__masked_select)

opened by 13086628579 2
Simply run main.py with default params, avg_reward get smaller and smaller

| actor_loss | 0.0489 | | alpha_loss | -29.1 | | avg_rewards | 272 | | entropy | 3.32 | | epoch | 0 | | explained_variance | -0.000163 | | gap_ratio | 0.85 | | value_loss | 41.3 | | var_gap_ratio | 8.53e-05 |

...

| actor_loss | 0.00868 | | alpha_loss | -17.8 | | avg_rewards | -21.5 | | entropy | 0.358 | | epoch | 2.28e+03 | | explained_variance | 0.883 | | gap_ratio | 0.47 | | value_loss | 0.0347 | | var_gap_ratio | 0.000503 |

opened by Jetcodery 2

Recurrent Conditional Query Learning

Related tags

Overview

Recurrent Conditional Query Learning (RCQL)

Usage

Preparation

Train

You might also like...

the code of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021)

PyTorch implementation DRO: Deep Recurrent Optimizer for Structure-from-Motion

Stacked Recurrent Hourglass Network for Stereo Matching

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"

An implementation of DeepMind's Relational Recurrent Neural Networks in PyTorch.

Pytorch implementation of the Variational Recurrent Neural Network (VRNN).

PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM

RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching

Comments

run error

runtimeerror

Simply run main.py with default params, avg_reward get smaller and smaller

| actor_loss | 0.0489 | | alpha_loss | -29.1 | | avg_rewards | 272 | | entropy | 3.32 | | epoch | 0 | | explained_variance | -0.000163 | | gap_ratio | 0.85 | | value_loss | 41.3 | | var_gap_ratio | 8.53e-05 |

| actor_loss | 0.00868 | | alpha_loss | -17.8 | | avg_rewards | -21.5 | | entropy | 0.358 | | epoch | 2.28e+03 | | explained_variance | 0.883 | | gap_ratio | 0.47 | | value_loss | 0.0347 | | var_gap_ratio | 0.000503 |

Owner

Dongda

Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources

Official implementation for NIPS'17 paper: PredRNN: Recurrent Neural Networks for Predictive Learning Using Spatiotemporal LSTMs.

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Official implementation of the paper DeFlow: Learning Complex Image Degradations from Unpaired Data with Conditional Flows

A deep learning based semantic search platform that computes similarity scores between provided query and documents

Official implementation for "QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation" (CVPR 2022)

Code for the ECCV2020 paper "A Differentiable Recurrent Surface for Asynchronous Event-Based Data"

Implementation of our paper 'RESA: Recurrent Feature-Shift Aggregator for Lane Detection' in AAAI2021.

OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)