Calling Julia from Python - an experiment on data loading

Abel Siqueira

Last update: Jun 7, 2022

Related tags

Deep Learning call-julia-from-python-experiments

Overview

Calling Julia from Python - an experiment on data loading

See the slides.

TLDR

After reading Patrick's blog post, we decided to try to replace C++ with Julia to check:

How easy/hard it is
How much improvement can be gained with a basic version
How much improvement can be gained with an optimized version

A basic version is already an improvement over the pure Python version, and an optimized version was faster than the C++ version.

Reproduction

Follow Patrick's blog post to install the C++ part.
Install Julia (We've used Julia 1.6.3)
- I recommend using Jill
- We'll refer to this Julia as path/to/julia.
Install Python
- Ideally, one dynamically linked to libpython.
- To test it, use ldd path/to/python and look for libpython3.9. It should exist for the shared version.
- If you don't have, look into workarounds here
- Tip: Archlinux's system Python is dynamically linked.
- We've used Python 3.9.7 from Archlinux.
Open Julia and enter the following commands:
- ENV["PYTHON"] = "path/to/python"
- using Pkg
- Pkg.add("PyCall")
- This will make sure that the packages we are installing use the correct Python version
Install juliapy with path/to/python -m pip install julia
Run path/to/python and enter
- import julia
- julia.install("julia=path/to/julia")
Download dataset and store in gen-data folder:
Run scalability_test.py - it should take several hours (over 10) and consume a moderate amount of memory.
Run scalability_analysis.py.

You might also like...

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Apache MXNet (incubating) for Deep Learning Master Docs License Apache MXNet (incubating) is a deep learning framework designed for both efficiency an

29 Nov 16, 2022

Numba-accelerated Pythonic implementation of MPDATA with examples in Python, Julia and Matlab

PyMPDATA PyMPDATA is a high-performance Numba-accelerated Pythonic implementation of the MPDATA algorithm of Smolarkiewicz et al. used in geophysical

Atmospheric Cloud Simulation Group @ Jagiellonian University

15 Nov 23, 2022

Python and Julia in harmony.

PythonCall & JuliaCall Bringing Python® and Julia together in seamless harmony: Call Python code from Julia and Julia code from Python via a symmetric

414 Jan 7, 2023

Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab

PySDM PySDM is a package for simulating the dynamics of population of particles. It is intended to serve as a building block for simulation systems mo

32 Oct 18, 2022

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

Comments

Fix python versions ~~using poetry~~

To prevent this pull request from becoming too large, I'll merge this and create a new issue to set the python versions.

Originally posted by @abelsiqueira in https://github.com/abelsiqueira/call-julia-from-python-experiments/issues/1#issuecomment-987970132

opened by abelsiqueira 1
Improve docker-10
Fixes: #10

Changes Ubuntu version to 21.10

Adds extra environment variables

Removes the Python virtual environment

Add make flags to compile the tools faster

Remove the downloaded tar files

Uninstall dev dependencies
opened by fdiblen 0

Calling Julia from Python - an experiment on data loading

Related tags

Overview

Calling Julia from Python - an experiment on data loading

TLDR

Reproduction

You might also like...

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Numba-accelerated Pythonic implementation of MPDATA with examples in Python, Julia and Matlab

Python and Julia in harmony.

Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

Small-bets - Ergodic Experiment With Python

Perspective: Julia for Biologists

MacroTools provides a library of tools for working with Julia code and expressions.

✔️ Visual, reactive testing library for Julia. Time machine included.

Comments

Fix python versions using poetry

Improve docker-10

Releases(v0.3.0)

v0.3.0(Jan 4, 2022)

v0.2.0(Dec 10, 2021)

v0.1.0(Nov 17, 2021)

v0.1.0-rc2(Nov 17, 2021)

v0.1.0-rc(Nov 17, 2021)

Owner

Abel Siqueira

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

A practical ML pipeline for data labeling with experiment tracking using DVC.

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

Python code for loading the Aschaffenburg Pose Dataset.

Simple tools for logging and visualizing, loading and training

Pytorch implementation of MLP-Mixer with loading pre-trained models.

dyld_shared_cache processing / Single-Image loading for BinaryNinja

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Calling Julia from Python - an experiment on data loading

Related tags

Overview

Calling Julia from Python - an experiment on data loading

TLDR

Reproduction

You might also like...

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Numba-accelerated Pythonic implementation of MPDATA with examples in Python, Julia and Matlab

Python and Julia in harmony.

Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

Small-bets - Ergodic Experiment With Python

Perspective: Julia for Biologists

MacroTools provides a library of tools for working with Julia code and expressions.

✔️ Visual, reactive testing library for Julia. Time machine included.

Comments

Fix python versions ~~using poetry~~

Improve docker-10

Releases(v0.3.0)

v0.3.0(Jan 4, 2022)

v0.2.0(Dec 10, 2021)

v0.1.0(Nov 17, 2021)

v0.1.0-rc2(Nov 17, 2021)

v0.1.0-rc(Nov 17, 2021)

Owner

Abel Siqueira

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

A practical ML pipeline for data labeling with experiment tracking using DVC.

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

Python code for loading the Aschaffenburg Pose Dataset.

Simple tools for logging and visualizing, loading and training

Pytorch implementation of MLP-Mixer with loading pre-trained models.

dyld_shared_cache processing / Single-Image loading for BinaryNinja

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Fix python versions using poetry