GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

Jiaxuan

Last update: Dec 29, 2022

Related tags

Deep Learning graph-generation

Overview

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Model

This repository is the official PyTorch implementation of GraphRNN, a graph generative model using auto-regressive model.

Jiaxuan You*, Rex Ying*, Xiang Ren, William L. Hamilton, Jure Leskovec, GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Model (ICML 2018)

Installation

Install PyTorch following the instuctions on the official website. The code has been tested over PyTorch 0.2.0 and 0.4.0 versions.

conda install pytorch torchvision cuda90 -c pytorch

Then install the other dependencies.

pip install -r requirements.txt

Test run

python main.py

Code description

For the GraphRNN model: main.py is the main executable file, and specific arguments are set in args.py. train.py includes training iterations and calls model.py and data.py create_graphs.py is where we prepare target graph datasets.

For baseline models:

B-A and E-R models are implemented in baselines/baseline_simple.py.
Kronecker graph model is implemented in the SNAP software, which can be found in https://github.com/snap-stanford/snap/tree/master/examples/krongen (for generating Kronecker graphs), and https://github.com/snap-stanford/snap/tree/master/examples/kronfit (for learning parameters for the model).
MMSB is implemented using the EDWARD library (http://edwardlib.org/), and is located in baselines.
We implemented the DeepGMG model based on the instructions of their paper in main_DeepGMG.py.
We implemented the GraphVAE model based on the instructions of their paper in baselines/graphvae.

Parameter setting: To adjust the hyper-parameter and input arguments to the model, modify the fields of args.py accordingly. For example, args.cuda controls which GPU is used to train the model, and args.graph_type specifies which dataset is used to train the generative model. See the documentation in args.py for more detailed descriptions of all fields.

Outputs

There are several different types of outputs, each saved into a different directory under a path prefix. The path prefix is set at args.dir_input. Suppose that this field is set to ./:

./graphs contains the pickle files of training, test and generated graphs. Each contains a list of networkx object.
./eval_results contains the evaluation of MMD scores in txt format.
./model_save stores the model checkpoints
./nll saves the log-likelihood for generated graphs as sequences.
./figures is used to save visualizations (see Visualization of graphs section).

Evaluation

The evaluation is done in evaluate.py, where user can choose which settings to evaluate. To evaluate how close the generated graphs are to the ground truth set, we use MMD (maximum mean discrepancy) to calculate the divergence between two sets of distributions related to the ground truth and generated graphs. Three types of distributions are chosen: degree distribution, clustering coefficient distribution. Both of which are implemented in eval/stats.py, using multiprocessing python module. One can easily extend the evaluation to compute MMD for other distribution of graphs.

We also compute the orbit counts for each graph, represented as a high-dimensional data point. We then compute the MMD between the two sets of sampled points using ORCA (see http://www.biolab.si/supp/orca/orca.html) at eval/orca. One first needs to compile ORCA by

g++ -O2 -std=c++11 -o orca orca.cpp`

in directory eval/orca. (the binary file already in repo works in Ubuntu).

To evaluate, run

python evaluate.py

Arguments specific to evaluation is specified in class evaluate.Args_evaluate. Note that the field Args_evaluate.dataset_name_all must only contain datasets that are already trained, by setting args.graph_type to each of the datasets and running python main.py.

Visualization of graphs

The training, testing and generated graphs are saved at 'graphs/'. One can visualize the generated graph using the function utils.load_graph_list, which loads the list of graphs from the pickle file, and util.draw_graph_list, which plots the graph using networkx.

Misc

Jesse Bettencourt and Harris Chan have made a great slide introducing GraphRNN in Prof. David Duvenaud’s seminar course Learning Discrete Latent Structure.

Comments

GraphRNN with node feature

Firstly, thanks a lot for your perfect work, especially the comprehensive baselines. However, as you mentioned in appendix A.6, GraphRNN can be used as a node and edge feature generation. I wonder if you have finished this part in this repository?

opened by Skylark0924 1
Covariate Adjustment + Single Graph + CPU Usage

I know how to convert these torch tensors to CPU, I am curious though why the user does not have the choice to run CUDA.

Also, I have a custom graph with some covariates, how can I train on these to compare to ERGM? Particularly, what would you recommend for using covariates for graphRNN generation?

I'm just having trouble locating where to input a single graph for this pipeline, as the paper said that single or multiple graphs can be used.

opened by jlevy44 0
How can I use graphRNN code to generate graphs with costume node labels or with node attributes?

Hello, How can I use graphRNN code to generate graphs with costume node labels or with node attributes? Here is my problem: by modifying 'create_graphs.py', I created a training set of grid2D graphs where instead of default labels such as (0,0), (0,1), (1,0),...the nodes have costume labels such as a,b,c,... and they have also a node attribute. However when I train graphRNN with these graphs, then the graphs generated by graphRNN have default labels (0,0), (0,1), (1,0),...and they have no node attributes. Do you have any solution for this problem? Thanks

opened by thesaturdayafternoon 1
MMD computation - square root missing

The compute_mmd function in mmd.py should compute square root of disc(s1, s1) + disc(s2, s2) - 2*disc(s1, s2) right? Referring equation 10 of the GraphRNN paper.

opened by tathagatv 0
How does the GraphRNN-S deal with variable length?

I was wondering if you just set the output to make possible edges for the current graph? My guess that is the case based on:

https://github.com/JiaxuanYou/graph-generation/blob/3444b8ad2fd7ecb6ade45086b4c75f8e2e9f29d1/main.py#L124

can anyone confirm?

opened by brando90 0
Issue of generating reciprocal edges in directed graph

Hi, thanks for open source the code! The framework can learn and generate undirected graph with high quality. But when I train the model (modified based on the suggestion given in the appendix of the paper) with directed graph, I find it cannot generate similar amount of reciprocal edges compare to the training graph I have.

Specifically, The modification I have done is double the length of edge sequence of node i as (A_1i, A_i1, A_2i, A_i2 ...), where A is the adj matrix. I do know this leads to a sparser sequence.

The graph I'm training has around 200 nodes, 659 one sided directed edges and 58 reciprocal edges, while the generated graph in average has less than 520 directed edges and 15 to 20 reciprocal edges. So the model is generating sparser graph.

I wonder if anyone has experience using this framework with directed graph, and give any advise on dealing with my issue?

Thanks in advance.

opened by c752334430 3
The implemented baseline issues
Many thanks for releasing the code. I have some questions regarding the GraphVAE.

It seems that the encoder is not using the graph neural network, the GCN layers are commented at lines 113-118 of baselines/graphvae/model.py. Am I right?

The binary assignment matrix has not been used after matching graphs, since self.permute_adj is also commented. if that's the case, the model is a simple Variational Graph Auto-Encoders implemented by kipf.

I uncommented these parts and still did not get the reported result in the GrapgRNN paper. I am not sure what could be the problem. I was wondering if this implementation of the GrapgVAE model used in the paper.

Many thanks for your help and answer.
opened by kiarashza 5

Owner

Jiaxuan

GitHub

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

45 Dec 8, 2022

《Towards High Fidelity Face Relighting with Realistic Shadows》(CVPR 2021)

Towards High Fidelity Face-Relighting with Realistic Shadows Andrew Hou, Ze Zhang, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu. In CVPR, 2021. T

114 Dec 10, 2022

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021 [WIP] The code for CVPR 2021 paper 'Disentangled Cycle Consistency for H

94 Dec 11, 2022

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

StackGAN-v2 StackGAN-v1: Tensorflow implementation StackGAN-v1: Pytorch implementation Inception score evaluation Pytorch implementation for reproduci

809 Dec 16, 2022

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

Contrastive Learning of Object Representations Supervisor: Prof. Dr. Gemma Roig Institutions: Goethe University CVAI - Computational Vision & Artifici

6 Dec 8, 2022

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

PyTorch SRResNet Implementation of Paper: "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network"(https://arxiv.org/abs

436 Jan 9, 2023

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images (ICCV 2021)

Table of Content Introduction Getting Started Datasets Installation Experiments Training & Testing Pretrained models Texture fine-tuning Demo Toward R

42 Dec 5, 2022

An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics.

Sketch Simulator An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics. See

12 Dec 18, 2022

Full body anonymization - Realistic Full-Body Anonymization with Surface-Guided GANs

Realistic Full-Body Anonymization with Surface-Guided GANs This is the official

30 Nov 18, 2022

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

Colorizer The point of this project is to write a program capable of taking a black and white / grayscale image, and generating a realistic or plausib

1 Jan 6, 2022

Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

LDL Paper | Supplementary Material Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution Jie Liang*, Hu

150 Dec 26, 2022

QA-GNN: Question Answering using Language Models and Knowledge Graphs

QA-GNN: Question Answering using Language Models and Knowledge Graphs This repo provides the source code & data of our paper: QA-GNN: Reasoning with L

434 Jan 4, 2023

Language models are open knowledge graphs ( non official implementation )

language-models-are-knowledge-graphs-pytorch Language models are open knowledge graphs ( work in progress ) A non official reimplementation of Languag

132 Dec 18, 2022

Some tentative models that incorporate label propagation to graph neural networks for graph representation learning in nodes, links or graphs.

1 Nov 18, 2021

Deep learning with dynamic computation graphs in TensorFlow

TensorFlow Fold TensorFlow Fold is a library for creating TensorFlow models that consume structured data, where the structure of the computation graph

1.8k Dec 28, 2022

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos. By adopting a unified pipeline-based API design, PyKale enforces standardization and minimalism, via reusing existing resources, reducing repetitions and redundancy, and recycling learning models across areas.

370 Dec 27, 2022

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

Related tags

Overview

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Model

Installation

Test run

Code description

Outputs

Evaluation

Visualization of graphs

Misc

Comments

GraphRNN with node feature

Covariate Adjustment + Single Graph + CPU Usage

How can I use graphRNN code to generate graphs with costume node labels or with node attributes?

MMD computation - square root missing

How does the GraphRNN-S deal with variable length?

Issue of generating reciprocal edges in directed graph

The implemented baseline issues

Owner

Jiaxuan

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

《Towards High Fidelity Face Relighting with Realistic Shadows》(CVPR 2021)

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images (ICCV 2021)

An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics.

Full body anonymization - Realistic Full-Body Anonymization with Surface-Guided GANs

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

QA-GNN: Question Answering using Language Models and Knowledge Graphs

Language models are open knowledge graphs ( non official implementation )

Some tentative models that incorporate label propagation to graph neural networks for graph representation learning in nodes, links or graphs.

Deep learning with dynamic computation graphs in TensorFlow

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper

The source code for Generating Training Data with Language Models: Towards Zero-Shot Language Understanding.

Generating Anime Images by Implementing Deep Convolutional Generative Adversarial Networks paper