A mini lib that implements several useful functions binding to PyTorch in C++.

maxwellzh

Last update: Sep 7, 2022

Related tags

Deep Learning torch-gather

Overview

Torch-gather

A mini library that implements several useful functions binding to PyTorch in C++.

What does gather do? Why do we need it?

When dealing with sequences, a common way of processing the variable lengths is padding them to the max length, which leads to quite a lot redundancies and waste on computing and memory as sequences length varies. So gather just removes their paddings and makes computation without waste of computation resource.

Install

python setup.py install

Docs

Note that all the input tensors should be on cuda device.

gather.gathercat(x_padded:torch.FloatTensor, lx:torch.IntTensor)

Return a concatence of given padded tensor x_padded according to its lengths lx.

Input:

x_padded (torch.float): padded tensor of size (N, L, V), where L=max(lx).

lx (torch.int): lengths of size (N, ).

Return:

x_gather (torch.float): the gathered tensor without paddings of size (lx[0]+lx[1]+...+lx[N-1], V)

Example:

>>> import torch
>>> from gather import gathercat
>>> lx = torch.randint(3, 20, (5, ), dtype=torch.int32, device='cuda')
>>> x_padded = torch.randn((5, lx.max(), 64), device='cuda')
>>> x_padded.size(), lx.size()
(torch.Size([5, 19, 64]), torch.Size([5]))
>>> x_gather = gathercat(x_padded, lx)
>>> x_gather.size()
torch.Size([81, 64])
# another example, with V=1
>>> x_padded = torch.tensor([[1., 2., 3.],[1.,2.,0.]], device='cuda').unsqueeze(2)
>>> lx = torch.tensor([3,2], dtype=torch.int32, device='cuda')
>>> x_padded
tensor([[[1.],
        [2.],
        [3.]],

        [[1.],
        [2.],
        [0.]]], device='cuda:0')
>>> lx
tensor([3, 2], device='cuda:0', dtype=torch.int32)
>>> gathercat(x_padded, lx)
tensor([[1.],
        [2.],
        [3.],
        [1.],
        [2.]], device='cuda:0')

This function is easy to implement with torch python functions like torch.cat(), however, gathercat() is customized for specified tasks, and more efficient.

gather.gathersum(xs:torch.FloatTensor, ys:torch.FloatTensor, lx:torch.IntTensor, ly:torch.IntTensor)

Return a sequence-matched broadcast sum of given paired gathered tensor xs and ys. For a pair of sequences in xs and ys, say xs_i and ys_i, gathersum() broadcast them so that they can be added up. The broadcast step can be understood as (xs_i.unsqueeze(1)+ys_i.unsqueeze(2)).reshape(-1, V) with python and torch.

Input:

xs (torch.float): gathered tensor of size (ST, V), where ST=sum(lx).

ys (torch.float): gathered tensor of size (SU, V), where SU=sum(ly).

lx (torch.int): lengths of size (N, ). lx[i] denotes length of the $i_{th}$ sequence in xs.

ly (torch.int): lengths of size (N, ). ly[i] denotes length of the $i_{th}$ sequence in ys.

Return:

gathered_sum (torch.float): the gathered sequence-match sum of size (lx[0]ly[0]+lx[1]ly[1]+...+lx[N-1]ly[N-1], V)

Example:
```
>>> import torch
>>> from gather import gathersum
>>> N, T, U, V = 5, 4, 4, 3
>>> lx = torch.randint(1, T, (N, ), dtype=torch.int32, device='cuda')
>>> ly = torch.randint(1, U, (N, ), dtype=torch.int32, device='cuda')
>>> xs = torch.randn((lx.sum(), V), device='cuda')
>>> ys = torch.randn((ly.sum(), V), device='cuda')
>>> xs.size(), ys.size(), lx.size(), ly.size()
(torch.Size([11, 3]), torch.Size([10, 3]), torch.Size([5]), torch.Size([5]))
>>> gathered_sum = gathersum(xs, ys, lx, ly)
>>> gathered_sum.size()
torch.Size([20, 3])
# let's see how the size 20 comes out
>>> lx.tolist(), ly.tolist()
([2, 2, 1, 3, 3], [3, 1, 3, 1, 2])
# still unclear? Uh, how about this?
>>> (lx * ly).sum().item()
20
```
This function seems doing something weird. Please refer to the discussion page for a specific usage example.

Reference

PyTorch binding refers to the 1ytic/warp-rnnt
For the specific usage of these functions, please refer to this discussion.

Python lib to talk to pylontech lithium batteries (US2000, US3000, ...) using RS485

python-pylontech Python lib to talk to pylontech lithium batteries (US2000, US3000, ...) using RS485 What is this lib ? This lib is meant to talk to P

26 Dec 28, 2022

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

PGPElib A mini library for Policy Gradients with Parameter-based Exploration [1] and friends. This library serves as a clean re-implementation of the

56 Jan 1, 2023

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Trivial Augment This is the official implementation of TrivialAugment (https://arxiv.org/abs/2103.10158), as was used for the paper. TrivialAugment is

94 Dec 30, 2022

Vanilla and Prototypical Networks with Random Weights for image classification on Omniglot and mini-ImageNet. Made with Python3.

vanilla-rw-protonets-project Vanilla Prototypical Networks and PNs with Random Weights for image classification on Omniglot and mini-ImageNet. Made wi

8 Aug 31, 2022

Robocop is your personal mini voice assistant made using Python.

A mini lib that implements several useful functions binding to PyTorch in C++.

Related tags

Overview

Torch-gather

What does gather do? Why do we need it?

Install

Docs

Reference

You might also like...

Python lib to talk to pylontech lithium batteries (US2000, US3000, ...) using RS485

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Vanilla and Prototypical Networks with Random Weights for image classification on Omniglot and mini-ImageNet. Made with Python3.

Robocop is your personal mini voice assistant made using Python.

Pca-on-genotypes - Mini bioinformatics project - PCA on genotypes

Code of 3D Shape Variational Autoencoder Latent Disentanglement via Mini-Batch Feature Swapping for Bodies and Faces

Mini-hmc-jax - A simple implementation of Hamiltonian Monte Carlo in JAX

Mini Software that give reminder to drink water as per your weight.

Owner

maxwellzh

A compendium of useful, interesting, inspirational usage of pandas functions, each example will be an ipynb file

A repository with exploration into using transformers to predict DNA ↔ transcription factor binding

Official implementation of "Generating 3D Molecules for Target Protein Binding"

Implements pytorch code for the Accelerated SGD algorithm.

deep-table implements various state-of-the-art deep learning and self-supervised learning algorithms for tabular data using PyTorch.

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

mbrl-lib is a toolbox for facilitating development of Model-Based Reinforcement Learning algorithms.

OpenDILab RL Kubernetes Custom Resource and Operator Lib

FluidNet re-written with ATen tensor lib

Jittor Medical Segmentation Lib -- The assignment of Pattern Recognition course (2021 Spring) in Tsinghua University