Vertex AI: Serverless framework for MLOPs (ESP / ENG)

Hernán Escudero

Last update: Apr 28, 2022

Related tags

Deep Learning vertex-pipelines-kfp-chicago

Overview

Vertex AI: Serverless framework for MLOPs (ESP / ENG)

Español

Qué es esto?

Este repo contiene un pipeline end to end diseñado usando el SDK de Kubeflow Pipelines (KFP). En el contexto del uso de Vertex AI como solución, la idea es construir una arquitectura de machine learning lo más automatizada posible, integrando algunos de los principales servicios de Google Cloud Platform (GCP) tales como BigQuery (data warehousing), Google Cloud Storage (almacenamiento de objetos) y Container Registry (repositorio de inágenes de Docker).

Cómo lo corro?

Primero, ejecutar la notebook pipeline_setup.ipynb. Contiene la configuración de la infraestructura que será utilizada: se crean datasets en BigQuery y buckets en GCS y se instalan librerías necesarias. Además se crean imágenes de Docker y se pushea a Container Registry para los jobs de tuneos de hiperparámetros.
Segundo, dentro de la carpeta components se encuentra la notebook components_definition.ipynb que deberá ejecutarse para generar los .yamls que serán invocados en la notebook principal de ejecución.
Por último, seguir los pasos indicados en pipeline_run.ipynb. Algunos parámetros como la cantidad de trials de hiperparámetros o los tipos de máquina deseadas para algunos pasos pueden ser fácilmente modificables.

TO-DO

agregar costo estimado permisos

English

What is this?

This repo contains an end to end pipeline designed using Kubelow Pipelines SDK (KFP). Using Vertex AI as a main solution, the idea is to build a machine learning architecture as automated as possible, integrating some of the main Google Cloud Platform (GCP) services, such as BigQuery (data warehousing), Google Cloud Storage (storage system) and Container Registry (Docker images repository).

How do I run it?

First, execute pipeline_setup.ipynb. It contains the infraestructure configuration to be used: BigQuery datasets and GCS buckets are created and installs the necessary libraries. It also creates Docker images and pushes them to Container Registry in order to perform hyperparameter tuning jobs.
Second, in the components folder there's a notebook called components_definition.ipynb which should be executed to generate the .yamls to be invoked in the main notebook execution.
Last, follow the steps in pipeline_run.ipynb. Some parameters, as hyperparameter trials or machine types for given steps of the process can be easily modified.

To-do

estimated cost roles

You might also like...

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Karate Club is an unsupervised machine learning extension library for NetworkX. Please look at the Documentation, relevant Paper, Promo Video, and Ext

1.8k Jan 7, 2023

A flexible framework of neural networks for deep learning

5.8k Jan 6, 2023

An Open Source Machine Learning Framework for Everyone

Documentation TensorFlow is an end-to-end open source platform for machine learning. It has a comprehensive, flexible ecosystem of tools, libraries, a

170.1k Jan 5, 2023

Intel® Nervana™ reference deep learning framework committed to best performance on all hardware

DISCONTINUATION OF PROJECT. This project will no longer be maintained by Intel. Intel will not provide or guarantee development of or support for this

3.9k Dec 20, 2022

A modular active learning framework for Python

Modular Active Learning framework for Python3 Page contents Introduction Active learning from bird's-eye view modAL in action From zero to one in a fe

1.9k Dec 31, 2022

🔥 Cogitare - A Modern, Fast, and Modular Deep Learning and Machine Learning framework for Python

Cogitare is a Modern, Fast, and Modular Deep Learning and Machine Learning framework for Python. A friendly interface for beginners and a powerful too

Cogitare - Modern and Easy Deep Learning with Python

76 Sep 30, 2022

A Sklearn-like Framework for Hyperparameter Tuning and AutoML in Deep Learning projects. Finally have the right abstractions and design patterns to properly do AutoML. Let your pipeline steps have hyperparameter spaces. Enable checkpoints to cut duplicate calculations. Go from research to production environment easily.

Neuraxle Pipelines Code Machine Learning Pipelines - The Right Way. Neuraxle is a Machine Learning (ML) library for building machine learning pipeline

555 Dec 24, 2022

A Comparative Framework for Multimodal Recommender Systems

Cornac Cornac is a comparative framework for multimodal recommender systems. It focuses on making it convenient to work with models leveraging auxilia

671 Jan 3, 2023

A unified framework for machine learning with time series

Welcome to sktime A unified framework for machine learning with time series We provide specialized time series algorithms and scikit-learn compatible

6k Jan 8, 2023

Comments

Quick XGB question

Hi, is there a reason that you seemed to go down the XGB route, but decided against it (I can see some XGB lines commented out).
I was thinking you likely thought it was not needed to illustrate what you wanted? or was there another reason (like it didn't work)? thanks!

opened by david-dirring 2
missing components_definition.ipynb ?

Great repo, thanks for the details on how you leveraged vertex in this example. I was not able to find components_definition.ipynb file? can you take a look. thanks!

opened by david-dirring 2