A PyTorch implementation of a Factorization Machine module in cython.

Jack Hessel

Last update: Jul 6, 2022

Related tags

Deep Learning fmpytorch

Overview

fmpytorch

A library for factorization machines in pytorch. A factorization machine is like a linear model, except multiplicative interaction terms between the variables are modeled as well.

The input to a factorization machine layer is a vector, and the output is a scalar. Batching is fully supported.

This is a work in progress. Feedback and bugfixes welcome! Hopefully you find the code useful.

Usage

The factorization machine layers in fmpytorch can be used just like any other built-in module. Here's a simple feed-forward model using a factorization machine that takes in a 50-D input, and models interactions using k=5 factors.

import torch
from fmpytorch.second_order.fm import FactorizationMachine

class MyModel(torch.nn.Module):
    def __init__(self):
        super(MyModel, self).__init__()
        self.linear = torch.nn.Linear(100, 50)
        self.dropout = torch.nn.Dropout(.5)
	# This makes a fm layer mapping from 50-D to 1-D.
	# The number of factors is 5.
        self.fm = FactorizationMachine(50, 5)

    def forward(self, x):
        x = self.linear(x)
        x = self.dropout(x)
        x = self.fm(x)
        return x

See examples/toy.py or examples/regression.py for fuller examples.

Installation

This package requires pytorch, numpy, and cython.

To install, you can run:

cd fmpytorch
sudo python setup.py install

Factorization Machine brief intro

A linear model, given a vector x models its output y as

where w are the learnable weights of the model.

However, the interactions between the input variables x_i are purely additive. In some cases, it might be useful to model the interactions between your variables, e.g., x_i * x_j. You could add terms into your model like

However, this introduces a large number of w2 variables. Specifically, there are O(n^2) parameters introduced in this formulation, one for each interaction pair. A factorization machine approximates w2 using low dimensional factors, i.e.,

where each v_i is a low-dimensional vector. This is the forward pass of a second order factorization machine. This low-rank re-formulation has reduced the number of additional parameters for the factorization machine to O(k*n). Magically, the forward (and backward) pass can be reformulated so that it can be computed in O(k*n), rather than the naive O(k*n^2) formulation above.

Currently supported features

Currently, only a second order factorization machine is supported. The forward and backward passes are implemented in cython. Compared to the autodiff solution, the cython passes run several orders of magnitude faster. I've only tested it with python 2 at the moment.

TODOs

Support for sparse tensors.
More interesting useage examples
More testing, e.g., with python 3, etc.
Make sure all of the code plays nice with torch-specific stuff, e.g., GPUs
Arbitrary order factorization machine support
Better organization/code cleaning

Thanks to

Vlad Niculae (@vene) for his sage wisdom.

The original factorization machine citation, which this layer is based off of, is

@inproceedings{rendle2010factorization,
	       title={Factorization machines},
    	       author={Rendle, Steffen},
      	       booktitle={ICDM},
               pages={995--1000},
	       year={2010},
	       organization={IEEE}
}

A Pytorch implementation of CVPR 2021 paper "RSG: A Simple but Effective Module for Learning Imbalanced Datasets"

RSG: A Simple but Effective Module for Learning Imbalanced Datasets (CVPR 2021) A Pytorch implementation of our CVPR 2021 paper "RSG: A Simple but Eff

120 Dec 12, 2022

Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, as a standalone package for Pytorch

Triangle Multiplicative Module - Pytorch Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or c

22 Oct 28, 2022

Pytorch implementation of "A simple neural network module for relational reasoning" (Relational Networks)

Pytorch implementation of Relational Networks - A simple neural network module for relational reasoning Implemented & tested on Sort-of-CLEVR task. So

800 Dec 5, 2022

Implementation of the Chamfer Distance as a module for pyTorch

Chamfer Distance for pyTorch This is an implementation of the Chamfer Distance as a module for pyTorch. It is written as a custom C++/CUDA extension.

205 Jan 5, 2023

PyTorch implementation for the visual prior component (i.e. perception module) of the Visually Grounded Physics Learner [Li et al., 2020].

VGPL-Visual-Prior PyTorch implementation for the visual prior component (i.e. perception module) of the Visually Grounded Physics Learner (VGPL). Give

8 Dec 29, 2022

Objective of the repository is to learn and build machine learning models using Pytorch. 30DaysofML Using Pytorch

30 Days Of Machine Learning Using Pytorch Objective of the repository is to learn and build machine learning models using Pytorch. List of Algorithms

119 Nov 24, 2022

PyTorch module to use OpenFace's nn4.small2.v1.t7 model

OpenFace for Pytorch Disclaimer: This codes require the input face-images that are aligned and cropped in the same way of the original OpenFace. * I m

176 Dec 12, 2022

Neural Module Network for VQA in Pytorch

Neural Module Network (NMN) for VQA in Pytorch Note: This is NOT an official repository for Neural Module Networks. NMN is a network that is assembled

111 Nov 24, 2022

Torch-mutable-modules - Use in-place and assignment operations on PyTorch module parameters with support for autograd

Torch Mutable Modules Use in-place and assignment operations on PyTorch module p

7 Jun 6, 2022

Comments

Tips for some optimizations

Nice work with the code in this repo! I really liked to see the use of Cython with PyTorch!

One quick suggestion, for the second order comparison, it's possible to use some matrix-multiply operations in order to avoid the triple-nested for loop. Something like the following:

# This code supposes PyTorch v0.2, as it uses broadcasting
class SecondOrderInteraction(nn.Module):
    def __init__(self, n_feats, n_factors):
        super(SecondOrderInteraction, self).__init__()
        self.v = nn.Parameter(torch.Tensor(n_feats, n_factors))
        self.v.data.uniform_(-0.01, 0.01)
        
    def forward(self, x):
        self.batch_size = x.size(0)
        all_interactions = torch.mm(self.v, self.v.t())
        pairwise = torch.bmm(x[:, :, None], x[:, None]) * all_interactions[None]
        mask = torch.ones(pairwise[0].size(), out=x.data.new()).triu(1)
        output = pairwise * Variable(mask)
        res = output.sum(1).sum(1,keepdim=True)
        return res

One advantage is that it works for CUDA out-of-the-box. But in other cases, using matrix-multiplies can become very memory-expensive, so the use of dedicated C/Cython code can be a much better approach. Thanks!

opened by fmassa 3

Installation issue

Hi

After running python setup.py install and using FactorizationMachine layer in a simple example, I got ModuleNotFoundError: No module names 'second_order_fast' in from second_order_fast import SecondOrderInteraction continuing with 'second_order_naive' ModuleNotFoundError!

I'm on Ubuntu 14.04, Anaconda4.4.0, python=3.6 and Cython=0.26 (gcc 4.8.4).

opened by ehsanmok 3

A PyTorch implementation of a Factorization Machine module in cython.

Related tags

Overview

fmpytorch

Usage

Installation

Factorization Machine brief intro

Currently supported features

TODOs

Thanks to

You might also like...

A Pytorch implementation of CVPR 2021 paper "RSG: A Simple but Effective Module for Learning Imbalanced Datasets"

Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, as a standalone package for Pytorch

Pytorch implementation of "A simple neural network module for relational reasoning" (Relational Networks)

Implementation of the Chamfer Distance as a module for pyTorch

PyTorch implementation for the visual prior component (i.e. perception module) of the Visually Grounded Physics Learner [Li et al., 2020].

Objective of the repository is to learn and build machine learning models using Pytorch. 30DaysofML Using Pytorch

PyTorch module to use OpenFace's nn4.small2.v1.t7 model

Neural Module Network for VQA in Pytorch

Torch-mutable-modules - Use in-place and assignment operations on PyTorch module parameters with support for autograd

Comments

Tips for some optimizations

Installation issue

Owner

Jack Hessel

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

PyTorch framework, for reproducing experiments from the paper Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Implementation of SSMF: Shifting Seasonal Matrix Factorization

Neural Factorization of Shape and Reflectance Under An Unknown Illumination

TuckER: Tensor Factorization for Knowledge Graph Completion

Implementation of Invariant Point Attention, used for coordinate refinement in the structure module of Alphafold2, as a standalone Pytorch module

计算机视觉中用到的注意力模块和其他即插即用模块PyTorch Implementation Collection of Attention Module and Plug&Play Module

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.