Procedural 3D data generation pipeline for architecture

Related tags

Overview

Synthetic Dataset Generator

Authors:

This is a tool that generates a dataset of synthetic buildings of different typologies.

Arxiv Website Samples

The generated data includes:

Mesh files of generated buildings, .obj format
Rendered images of the mesh, .png format
Rendered segmentation masks, .png format
Depth annotation, .png and .exr format
Surface normals annotation, .png format
Point cloud files, .ply format (the number of points by default is 2048, can be changed in dataset_config.py)

How To Use

Install Blender>=2.90. After installation make sure to add blender as an Environment variable.
Download the package as a .zip file or:

git clone https://github.com/CDInstitute/CompoNET

*Navigate to the Building-Dataset-Generator folder.

pip install -r requirements.txt

To create completely synthetic buildings use:

run.bat

Or:

blender setup.blend --python dataset.py

Unfortunately, it is not possible to use Blender in background mode as it will not render the image masks correctly.

Note: all the parameters related to the dataset (including any specific parameters for your buildings (e.g. max and min height / width / length)) are to be provided in dataset_config.py. Default values adhere to international standards (min) and most common European values (max):

minimum height 3m
minimum length and width 6m
maximum length, width, height 30 m Other values to set:
number of dataset samples
building types
component materials
rendered image dimensions
number of points in the point clouds
paths to store the generated data
option to save the .exr files

Annotation structure

{'img': 'images/0.png', 'category': 'building', 'img_size': (256, 256), '2d_keypoints': [], 'mask': 'masks/0.png', 'img_source': 'synthetic', 'model': 'models/0.obj', 'point_cloud': 'PointCloud/0.ply', 'model_source': 'synthetic', 'trans_mat': 0, 'focal_length': 35.0, 'cam_position': (0.0, 0.0, 0.0), 'inplane_rotation': 0, 'truncated': False, 'occluded': False, 'slightly_occluded': False, 'bbox': [0.0, 0.0, 0.0, 0.0], 'material': ['concrete', 'brick']}

Performance

We ran the dataset generation algorithm for 100 model samples with different input parameters on Windows 10 OS on CPU and GPU using AMD Ryzen 7 3800-X 8-Core Processor and GeForce GTX 1080. Here we report the results for the multiview generation (3 views per model):

GPU	Multiview	Time (h)
		1.7
	✅	2.7
✅		0.34
✅	✅	0.8

Citation

Bibtex format

@inproceedings{fedorova2021synthetic,
      title={Synthetic 3D Data Generation Pipeline for Geometric Deep Learning in Architecture}, 
      author={Stanislava Fedorova and Alberto Tono and Meher Shashwat Nigam and Jiayao Zhang and Amirhossein Ahmadnia and Cecilia Bolognesi and Dominik L. Michels},
      year={2021},
}

Generated Image Samples

Comments

Getting it to work on Linux

I have installed the dependencies (including Blender) but I am not able to run the generation script because setup.txt is not found. There is a comment hinting that it should be in a Windows-based Anaconda env directory. I am also using Anaconda, but on Ubuntu Linux. I tried looking in the environment folder which I created for this project, but I can't find a setup.txt anywhere.

But because the path is relative I would assume that it should be found in the repository root. So maybe it is just missing?

Any help is appreciated.

Full stacktrace:

(building-dataset-generator) isaek@demoman1:~/PycharmProjects/Building-Dataset-Generator$ blender setup.blend --python dataset.py
Read prefs: /home/isaek/.config/blender/2.92/config/userpref.blend
Read blend: /home/isaek/PycharmProjects/Building-Dataset-Generator/setup.blend
Traceback (most recent call last):
  File "/home/isaek/PycharmProjects/Building-Dataset-Generator/dataset.py", line 13, in <module>
    from annotation import Annotation
  File "/home/isaek/PycharmProjects/Building-Dataset-Generator/annotation.py", line 10, in <module>
    from dataset_config import *
  File "/home/isaek/PycharmProjects/Building-Dataset-Generator/dataset_config.py", line 54, in <module>
    SCRIPT_PATH = open('setup.txt').read()[:-1] # '"D:\ProgramFiles\Anaconda\envs\py37\Lib\site-packages"
FileNotFoundError: [Errno 2] No such file or directory: 'setup.txt'

opened by iboates 4

A Sklearn-like Framework for Hyperparameter Tuning and AutoML in Deep Learning projects. Finally have the right abstractions and design patterns to properly do AutoML. Let your pipeline steps have hyperparameter spaces. Enable checkpoints to cut duplicate calculations. Go from research to production environment easily.

Neuraxle Pipelines Code Machine Learning Pipelines - The Right Way. Neuraxle is a Machine Learning (ML) library for building machine learning pipeline

555 Dec 24, 2022

Automates Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning :rocket:

MLJAR Automated Machine Learning Documentation: https://supervised.mljar.com/ Source Code: https://github.com/mljar/mljar-supervised Table of Contents

2.4k Dec 31, 2022

We have implemented shaDow-GNN as a general and powerful pipeline for graph representation learning. For more details, please find our paper titled Deep Graph Neural Networks with Shallow Subgraph Samplers, available on arXiv (https//arxiv.org/abs/2012.01380).

Deep GNN, Shallow Sampling Hanqing Zeng, Muhan Zhang, Yinglong Xia, Ajitesh Srivastava, Andrey Malevich, Rajgopal Kannan, Viktor Prasanna, Long Jin, R

117 Dec 20, 2022

git《Pseudo-ISP: Learning Pseudo In-camera Signal Processing Pipeline from A Color Image Denoiser》(2021) GitHub: [fig5]

TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline.

TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline

193 Dec 22, 2022

Procedural 3D data generation pipeline for architecture

Related tags

Overview

Synthetic Dataset Generator

Authors:

How To Use

Annotation structure

Performance

Citation

Generated Image Samples

You might also like...

Automates Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning :rocket:

We have implemented shaDow-GNN as a general and powerful pipeline for graph representation learning. For more details, please find our paper titled Deep Graph Neural Networks with Shallow Subgraph Samplers, available on arXiv (https//arxiv.org/abs/2012.01380).

git《Pseudo-ISP: Learning Pseudo In-camera Signal Processing Pipeline from A Color Image Denoiser》(2021) GitHub: [fig5]

[CVPR-2021] UnrealPerson: An adaptive pipeline for costless person re-identification

Clairvoyance: a Unified, End-to-End AutoML Pipeline for Medical Time Series

Evaluation Pipeline for our ECCV2020: Journey Towards Tiny Perceptual Super-Resolution.

This repository contains the code for our fast polygonal building extraction from overhead images pipeline.

TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline.

Comments

Getting it to work on Linux

Owner

Computational Design Institute

code for paper "Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?"

Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Loop Story Generation"

A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision for Visual Scene Graph Generation''

Image-generation-baseline - MUGE Text To Image Generation Baseline

This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

DWIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data.

A practical ML pipeline for data labeling with experiment tracking using DVC.

torchlm is aims to build a high level pipeline for face landmarks detection, it supports training, evaluating, exporting, inference(Python/C++) and 100+ data augmentations

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".