Open MLOps - A Production-focused Open-Source Machine Learning Framework

Data Revenue

Last update: Dec 28, 2022

Related tags

Machine Learning OpenMLOps

Overview

Open MLOps - A Production-focused Open-Source Machine Learning Framework

Open MLOps is a set of open-source tools carefully chosen to ease user experience of conducting machine learning experiments and deploying machine learning models. Read our announcement blog post with more background here.

We also provide a step by step set-up guide and some other getting started tutorials.

In this repository, we provide these applications as Terraform modules with which the user will be able to install them into a Kubernetes cluster. The tools we provide are the following:

Prefect for data flow automation
Jupyter Hub for experimenting lab
Dask for distributed computing
Feast for Feature Store and Serving
MLFlow for model registry and experiment tracking
Seldon for model deployment

Other repositories

Modules

Jupyter Hub
Prefect
Dask
MLFlow
Feast
Seldon

Jupyter Hub

With the Jupyter Hub, we enabled a multi-user environment in which each of them can spawn a Jupyter server to do their experiments. Users can work on different environments being able to install any library necessary to meet their needs.

We provide a default Jupyter server image that comes with most of the data science packages installed. Users can use their own Jupyter server images as well.

Configuration

Below we provide a lists of the configurable parameters available and their default values.

Parameter (* required parameter)	Description	Default
`jupyterhub_namespace`	Namespace to install jupyterhb	`jhub`

Proxy configuration

The proxy receives the requests from the client’s browser and forwards all requests to the Hub. In the JupyterHub docs you can find a more in-depth explanation.

* Required parameters

Parameter	Description	Default
`jhub_proxy_https_enabled`	Indicator to set whether HTTPS should be enabled or not on the proxy	`false`
`jhub_proxy_https_hosts`	You domains in list form. Required for automatic HTTPS	`[]`
`jhub_proxy_secret_token *`	A 32-byte cryptographically secure randomly generated string used to secure communications between the hub and the configurable-http-proxy (for example, generated by `openssl rand -hex 32`)	`nil`
`jhub_proxy_https_letsencrypt_contact_email`	The contact email to be used for automatically provisioned HTTPS certificates by Let’s Encrypt	`""`

Authentication configuration

JupyterHub’s OAuthenticator has support for enabling your users to authenticate via a third-party OAuth2 identity provider such as GitHub.

You can configure authentication using GitHub accounts and restrict what users are authorized based on membership in a GitHub organization.

See details on how to set up a GitHub Oauth here.

If you choose not to use GitHub to authenticate users, the DummyAuthenticator will be used as default. The Dummy Authenticator lets any user log in with the given password.

The dummy password is: a-shared-secret-password.

* Required parameters ** Required when oauth_github_enable is enabled

Parameter	Description	Default
`oauth_github_enable`	Defines whether the authentication will be handled by github oauth. Required when `oauth_github_enable` is enabled	`false`
`oauth_github_client_id **`	Github client id used on GitHubOAuthenticator.	`""`
`oauth_github_client_secret **`	Github secret used to authenticate with github.	`""`
`oauth_github_admin_users`	List of github user names to allow as administrator	`[]`
`oauth_github_callback_url`	The URL that people are redirected to after they authorize your GitHub App to act on their behalf	`""`
`oauth_github_allowed_organizations`	List of Github organization to restrict access to the members	`[""]`

User configuration

Single user configuration refers to the default settings for each user logged in the JupyterHub.

A user can choose a Docker image to spawn a new Jupyter server. Each Docker image can have different libraries and environments installed. We use the singleuser_profile_list parameter to set up a list of default images available to the user. This parameter receives a list of maps that describes the image details such as the image location and description.

See an example:

[{
  display_name = "Prefect"
  description  = "Notebook with prefect installed"
  default      = true
  kubespawner_override = {
    image = "drtools/prefect:notebook-prefect"
  }
}]

You must pass the image pull secret if you provide an image located in a private container registry. The image pull secret parameter is defined as below:

default = [{
    name = ""
}]

Parameter	Description	Default
`singleuser_profile_list`	List of images which the user can select to spawn a server
`singleuser_image_pull_secrets`	List of image secrets	nil
`singleuser_image_pull_policy`	Image pull policy	`Always`
`singleuser_memory_guarantee`	How much memory will be guarateed to the user	`1G`
`singleuser_storage_capacity`	How much storage capacity a user will have	`1G`
`singleuser_storage_mount_path`	Storage mount path	`/home/jovyan/persistent`

Prefect

...

Parameter	Description	Default
`namespace`	Namespace name to deploy the application	`prefect
`prefect_version_tag`	Configures the default tag for prefect images	`latest`

Agent

According to Prefect docs, Agents are lightweight processes for orchestrating flow runs. Agents run inside a user's architecture, and are responsible for starting and monitoring flow runs. During operation the agent process queries the Prefect API for any scheduled flow runs, and allocates resources for them on their respective deployment platforms.

Parameter	Description	Default
`agent_enabled`	determines if the Prefect Kubernetes agent is deployed	`True`
`agent_prefect_labels`	Defines what scheduling labels (not K8s labels) should be associated with the agent	`[""]`
`agent_image_name`	Defines the prefect agent image name	`prefecthq/prefect`
`agent_image_tag`	Defines agent image tag	`"`
`agent_image_pull_policy`	Defines the image pull policy	`Always`

Postgresql

Parameter	Description	Default
`postgresql_database`	Defines the postgresql database name	`prefect`
`postgresql_username`	Defines the username to authenticate with	`prefect`
`postgresql_existing_secret`	Configures which secret should be referenced for access to the database.	`""`
`postgresql_service_port`	Configures the port that the database should be accessed at	`5432`
`postgresql_external_hostname`	Defines the address to contact an externally managed postgres database instance at	`""`
`postgresql_use_subchart`	Determines if a this chart should deploy a user-manager postgres database or use an externally managed postgres instance	`true`
`postgresql_persistence_enabled`	Enables a PVC that stores the database between deployments. If making changes to the database deployment, this PVC will need to be deleted for database changes to take effect. This is especially notable when the authentication password changes on redeploys.	`false`
`postgresql_persistence_size`	Defines the persistence storage size for postgres	`8G`
`postgresql_init_user`	Defines the initial db username	`postgres`

Dask

...

Parameter	Description	Default
`namespace`	Namespace name to deploy the application	`dask`
`worker_name`	Dask worker name	`worker`
`worker_replicas`	Default number of workers	`3`
`worker_image_repository`	Containe image repository	`daskdev/dask`
`worker_image_tag`	Container image tag	`2.30.0`
`worker_image_pull_policy`	Container image pull policy.	`IfNotPresent`
`worker_image_dask_worker_command`	ask worker command. E.g `dask-cuda-worker` for GPU worker.	`dask-worker`
`worker_image_pull_secret`	Container image pull secrets	`[{name: ""}]`
`worker_environment_variables`	Environment variables. See values.yaml for example values.	`[{}]`

Feast

...

Parameter	Description	Default
`namespace`	Namespace name to deploy the application	`feast`
`feast_core_enabled`	Defines whether to install feast core	`True`
`feast_online_serving_enabled`	Defines whether to install feast server	`True`
`feast_jupyter_enabled`	Defines whether to install feast jupyther hub	`False`
`feast_jobservice_enabled`	Defines whether to install feast job service	`True`
`feast_posgresql_enabled`	Defines whether to enable postgresql	`True`
`feast_postgresql_password *`	Postgress password	`""`
`feast_kafka_enabled`	Defines whether to enable kafka	`False`
`feast_redis_enabled`	Defines whether to enable redis	`True`
`feast_redis_use_password`	Defines whether to enable redis password	`False`
`feast_prometheus_enabled`	Defines whether to install prometheys	`False`
`feast_prometheus_statsd_exporter_enabled`	Defines whether to enable statsd exporter	`False`
`feast_grafana_enabled`	Defines whether to enable grafana	`True`

MLFlow

...

Parameter	Description	Default
`namespace`	Namespace name to deploy the application	`mlflow`
`db_host`	Database host address	``
`db_username`	Database username	`mlflow`
`db_password *`	Database password	``
`database_name`	Database name	`mlflow`
`db_port`	Database port	`5432`
`default_artifact_root`	local or remote filepath to store model artifacts. It is mandatory when specifying a database backend store	`/tmp`
`image_pull_policy`	Docker image pull policy	`IfNotPresent`
`image_repository`	Docker image repository	`drtools/mlflow`
`image_tag`	Docker image tag	`1.13.1`
`service_type`	Kubernetes service type	`NodePort`
`docker_registry_server`	Docker Registry Server	``
`docker_auth_key`	Base64 Enconded combination of {registry_username}:{registry_password}. Can be found in ~/.docker/config.json	``
`docker_private_repo`	Whether the MLFlow's image comes from a private repository or not. If `true`, `docker_registry_server` and `docker_auth_key` will be required	`false`

Note: The variables docker_registry_server and docker_auth_key are optional and should only be used when pulling MLFlow's image from a private repository.

Seldon

Parameter	Description	Default
`namespace`	Namespace name to deploy the application	`mlflow`
`istio_enabled`	Whether to install istio as ingress controller	`true`
`usage_metrics_enabled`	Whether to enable usage metrics	`true`

Exposing Services

In order to access the services from outside the cluster, we need to expose them. Usually, this is done through Kubernetes Ingress resources. In this project, since we rely on Seldon to expose our prediction endpoints, we use Ambassador API Gateway as our ingress controller. Seldon Core works well with Ambassador, allowing a single ingress to be used to expose ambassador and running machine learning deployments can then be dynamically exposed through seldon-created ambassador configurations.

Ambassador

Ambassador is a Kubernetes-native API Gateway built on the Envoy Proxy. In addition to the classical routing capabilities of an ingress, it can perform sophisticated traffic management functions, such as load balancing, circuit breakers, rate limits, and automatic retries. Also, it has support for independent authentication systems, such as the ORY ecosystem.

Exposing a service in Ambassador

Ambassador is designed around a declarative, self-service management model. The core resource used to support application development teams who need to manage the edge with Ambassador is the Mapping resource. This resource allows us to define custom routing rules to our services. This routing configuration can achieved by applying a custom Kubernetes Resource like the following

# mapping.yaml
---
apiVersion: getambassador.io/v2
kind:  Mapping
metadata:
  name:  httpbin-mapping
spec:
  prefix: /httpbin/
  service: httpbin.httpbin_namespace

By applying this configuration with kubectl apply -f httpbin-mapping.yaml.

Terraform

Since this project uses Terraform to manage resources and, with the current version, it's still not possible to apply custom Kubernetes resource definitions, we need to add this YAML file inside the services annotation. One way to do this is by using Service's Metadata field

resource "kubernetes_service" "httpbin" {
  metadata {
    ...
    annotations = {
      "getambassador.io/config" = <<YAML
---
apiVersion: getambassador.io/v2
kind: Mapping
name: httpbin-mapping
service: httpbin.httpbin_namespace
prefix: /httpbin/
YAML
    }
  }
}

This will produce the same behaviour as applying the custom yaml file described above.

Authentication

Since we're exposing our services in the Internet, we need an Authentication and Authorization system to prevent unwanted users to accessing our services. Ambassador API Gateway can control the access by using an External Authentication Service resource (AuthService). An AuthService is an API that has a verification endpoint, which determines if the user can access this resource (returning 200 or not, 401). In this project, we rely on ORY ecosystem to enable authentication. ORY is an open-source ecosystem of services with clear boundaries that solve authentication and authorization.

Session Lifespan

The session lifespan of authenticated users can be managed through the /ory/kratos/values.yaml file. By default, the session lifespan is 24h, but it is currently set to 30 days.

kratos:
  config:
  ...
    session:
      cookie:
        domain: ${cookie_domain}
      lifespan: 720h

ORY Oathkeeper

ORY Oathkeeper is an Identity and Access Proxy. It functions as a centralized way to manage different Authentication and Authorization methods, and inform the gateway, whether the HTTP request is allowed or not. The Oathkeeper serves perfectly as an Ambassador's External AuthService.

Zero-Trust and Unauthorized Resources

Oathkeeper is rooted in the principle of "never trust, always verify,". This means that if no additional configuration is provided, the Oathkeeper will always block the incoming request. In practice, all endpoints exposed in Ambassador will be blocked by external requests, until further configuration is made.

Access Rules

To configure an access rule to ORY Oathkeeper, the file access-rule-oathkeeper.yaml is used. Example:

Allow all incoming requests

- id: oathkeeper-access-rule
  match:
    url: <{http,https}>://${hostname}/allowed-service/<**>
    methods:
      - GET
  authenticators:
    - handler: anonymous
  authorizer:
    handler: allow
  mutators:
    - handler: noop
  credentials_issuer:
    handler: noop

This configuration will register all the incoming requests as a guest user, thus, not performing any credentials validation.

Authorize on KRATOS

- id: httpbin-access-rule
  match:
    url: <{http,https}>://${hostname}/blocked-service/<**>
    methods:
      - GET
  authenticators:
    - handler: cookie_session
  authorizer:
    handler: allow
  mutators:
    - handler: id_token
  credentials_issuer:
    handler: noop
  errors:
    - handler: redirect
      config:
        to: http://${hostname}/auth/login

This configuration will force authenticating all incoming requests by checking a cookie_session, which configuration is specified in config-oathkeeper.yaml

Comments

Error creating S3 bucket: BucketAlreadyOwnedByYou

When I try to follow the instruction at https://github.com/datarevenue-berlin/OpenMLOps/blob/master/tutorials/set-up-open-source-production-mlops-architecture-aws.md

I got to the step running: terraform apply -var-file=my_vars.tfvars

But I got: Error creating S3 bucket: BucketAlreadyOwnedByYou: Your previous request to create the named bucket succeeded and you already own it. │ │ with aws_s3_bucket.mlflow_artifact_root, │ on main.tf line 11, in resource "aws_s3_bucket" "mlflow_artifact_root": │ 11: resource "aws_s3_bucket" "mlflow_artifact_root"

If I delete the S3 bucket and rerun the command, I got: Error loading state: S3 bucket does not exist.

So maybe it was using that bucket, but then later try to create the bucket again?

opened by NhatAnh 12
Deployment problem

Hello, I have a problem following the steps in the tutorial, during the first terraform apply in the creation of the resource : module.mlops-architecture-eks.helm_release.autoscaler I always get the following error:

│ Error: Post "http://localhost/api/v1/namespaces/kube-system/configmaps": dial tcp 127.0.0.1.1:80: connect: connection refused. │ │ with module.mlops-architecture-eks.module.eks.kubernetes_config_map.aws_auth[0], │ on .terraform/modules/mlops-architecture-eks.eks/aws_auth.tf line 63, in resource "kubernetes_config_map" "aws_auth": │ 63: resource "kubernetes_config_map" "aws_auth" {. │ ╵

opened by Pfriasf 6
Unsupported Kubernetes Version Error

Hello, and thank you for this great resource. I'm running into an error while going through the architecture set-up tutorial for AWS. I'm able to successfully run everything including "terraform plan -var-file=my_vars.tfvars". I get the following error when I run "terraform apply -var-file=my_vars.tfvars". Let me know if there's any other helpful information I could provide:

opened by Fadi-Almachraki 4

Local minikube tutorial failure

I am trying to follow the tutorial Set up your local minikube machine learning architecture for experimenting hands-on with the components of OpenMLOps. Unfortunately, I am not able to follow all the instructions to the end.

Disclaimer: I am new to Terraform and Kubernetes.

To begin with, I tried to start minikube in two ways,

with minikube start --kubernetes-version=v1.17.17
and without a specified kubernetes version.

In case 1., I could not get minikube to start running. Here is the entire output of the command.

In the latter (without specifying the version), I believe minikube defaults to v1.22.2 as per the output I get from kubectl version. In this case, I am able to get minikube running and start a tunnel as well:

Status:
        machine: minikube
        pid: 10105
        route: 10.96.0.0/12 -> 192.168.49.2
        minikube: Running
        services: [mlflow]
    errors: 
                minikube: no errors
                router: no errors
                loadbalancer emulator: no errors

All ok until I get to the step terraform apply -var-file=my_vars.tfvars, which outputs the following at the end (the only 3 changes not able to complete):

module.ambassador[0].helm_release.ambassador[0]: Creating...
module.dask-jupyterhub.helm_release.dask-jupyterhub: Creating...
module.prefect-server.helm_release.prefect-server: Creating...
╷
│ Error: failed to install CRD crds/filter.yaml: unable to recognize "": no matches for kind "CustomResourceDefinition" in version "apiextensions.k8s.io/v1beta1"
│ 
│   with module.ambassador[0].helm_release.ambassador[0],
│   on modules/ambassador/main.tf line 1, in resource "helm_release" "ambassador":
│    1: resource "helm_release" "ambassador" {
│ 
╵
╷
│ Error: failed to install CRD crds/daskclusters.yaml: unable to recognize "": no matches for kind "CustomResourceDefinition" in version "apiextensions.k8s.io/v1beta1"
│ 
│   with module.dask-jupyterhub.helm_release.dask-jupyterhub,
│   on modules/dask-jupyterhub/main.tf line 1, in resource "helm_release" "dask-jupyterhub":
│    1: resource "helm_release" "dask-jupyterhub" {
│ 
╵
╷
│ Error: unable to build kubernetes objects from release manifest: unable to recognize "": no matches for kind "RoleBinding" in version "rbac.authorization.k8s.io/v1beta1"
│ 
│   with module.prefect-server.helm_release.prefect-server,
│   on modules/prefect-server/main.tf line 1, in resource "helm_release" "prefect-server":
│    1: resource "helm_release" "prefect-server" {
│ 
╵

Why is this happening?

opened by bgalvao 4

How to configure running this framework on local Kubernetes cluster?

Thanks for sharing your OpenMLOps framework. I hope to find some guidance to configure this framework to make it work on our local Kubernetes cluster. What modifications need to be done for each component?

opened by fadishaar84 4
MLOPS 349: Seldon storage initializer image and values.yaml

I have moved the set value from main.tf to the values.yaml file and added a different image to the storageInitializer thing, so that we don't have to change it after deploying using kubectl get configmaps -n seldon

opened by bernardolk 4
A few questions after following the setup guide
I was following this guide: https://github.com/datarevenue-berlin/OpenMLOps/blob/master/tutorials/set-up-open-source-production-mlops-architecture-aws.md#configuring-the-my_varstfvars-file and have a few questions.

Do I need to set additional_aws_users field in my_vars.tfvars to an empty list if I just have only one user (me)?

If I am the only user, do I need to fill in oauth2_providers? Or do I just need to make it an empty list? If I need to create a new OAuth application, what should I fill in the Authorization callback URL text box?

If I set both 2 attributes above to be empty lists and run terraform init, it asks me to enter an S3 bucket name. What should I fill in this? If it's related to the bucket_name field in the my_vars.tfvars file, I've already set it.
opened by off99555 3
error creating EKS Cluster (eks-mlops): InvalidParameterException: unsupported Kubernetes version

I followed steps and I got this error when I do:

terraform apply -var-file=my_vars.tfvars

Error: error creating EKS Cluster (eks-mlops): InvalidParameterException: unsupported Kubernetes version

I dont know what is the problem here and how to debug

opened by oristides 2
OpenMLOps has no license

OpenMLOps has no license, according to GitHub[1]:

...without a license, the default copyright laws apply, meaning that you (Data Revenue) retain all rights to your source code and no one may reproduce, distribute, or create derivative works from your work.

So no one external to Data Revenue can use this repo.

opened by KellyJDavis 2
Add prefect pull image secret

This will allow prefect to pull images from private repositories. For this to work with current Prefect version though, we have to set this line image_pull_secrets=["regcred"] into the KubernetesRun flow config init method, because it seems like this Prefect helm chart has quite a few bugs in it.

opened by bernardolk 1

Fix warnings in documentation rst files.

Fixes the current warnings in docs/**/*rst files

The "inline strong" and "Bullet list" warnings both affect the rendered output.
The "Title underline" warning does not affect the rendered output.

docs/components/ory/technical.rst:23: WARNING: Bullet list ends without a blank line; unexpected unindent.
docs/components/ory/usage.rst:14: WARNING: Inline strong start-string without end-string.
docs/components/ory/usage.rst:33: WARNING: Inline strong start-string without end-string.
docs/components/ory/usage.rst:56: WARNING: Title underline too short.

Identities and Providers
~~~~~~~~~~~~~~~~~~~~
docs/components/ory/usage.rst:56: WARNING: Title underline too short.

Identities and Providers
~~~~~~~~~~~~~~~~~~~~

hacktoberfest-accepted

opened by spazm 1

minikube local installation error
i am following this tutorial and getting this error in this statement "terraform apply -var-file=my_vars.tfvars"

https://github.com/datarevenue-berlin/OpenMLOps/blob/master/tutorials/set-up-minikube-cluster.md

following the versions:

Terraform v1.0.8 kubectl server version v1.21.5 and client version v1.21.5, which includes: minikube version: v1.23.2

Minikube start --kubernetes-version=v1.21.5 --vm-driver = hyperv

@sixhobbits @spazm @pedrocwb @timotk
opened by Charanv09 0
Support for Ray

Hi, I was wondering whether there is plan to support Ray for distributed training. I want an ML platform that supports both deep learning and traditional models, and looks like Dask does not have a very support for distributed deep learning. Thanks

opened by huanvo88 0
error creating Kubernetes cluster with AWS tutorial

Hi all, I am trying to follow the AWS tutorial on setting OpenMLOps and I am getting the bellow error when I run terraform apply.

any help with that? Thanks in advance

opened by LameesKadhim 1

Releases(0.1.0)

0.1.0(Aug 16, 2021)

First release of OpenMLOPs
Source code(tar.gz)
Source code(zip)

Owner

Data Revenue

GitHub

Evidently helps analyze machine learning models during validation or production monitoring

Evidently helps analyze machine learning models during validation or production monitoring. The tool generates interactive visual reports and JSON profiles from pandas DataFrame or csv files. Currently 6 reports are available.

3.1k Jan 7, 2023

Production Grade Machine Learning Service

This project is made to help you scale from a basic Machine Learning project for research purposes to a production grade Machine Learning web service

10 Apr 4, 2022

Machine learning that just works, for effortless production applications

16 Sep 2, 2022

Iris-Heroku - Putting a Machine Learning Model into Production with Flask and Heroku

Puesta en Producción de un modelo de aprendizaje automático con Flask y Heroku L

1 Jun 3, 2022

⏳ Tempo: The MLOps Software Development Kit

Tempo provides a unified interface to multiple MLOps projects that enable data scientists to deploy and productionise machine learning systems.

36 Jun 20, 2021

This is a public repo where code samples are stored for the book Practical MLOps.

[Book-2021] Practical MLOps O'Reilly Book

421 Dec 31, 2022

End to End toy example of MLOps

churn_model MLOps Toy Example End to End You might find below links useful Connect VSCode to Git MLFlow Port Heroku App Project Organization ├── LICEN

6 Feb 6, 2022

ClearML - Auto-Magical Suite of tools to streamline your ML workflow. Experiment Manager, MLOps and Data-Management

ClearML - Auto-Magical Suite of tools to streamline your ML workflow Experiment Manager, MLOps and Data-Management ClearML Formerly known as Allegro T

4k Jan 9, 2023

MLOps pipeline project using Amazon SageMaker Pipelines

This project shows steps to build an end to end MLOps architecture that covers data prep, model training, realtime and batch inference, build model registry, track lineage of artifacts and model drift detection. It utilizes SageMaker Pipelines that offers machine learning (ML) to orchestrate SageMaker jobs and author reproducible ML pipelines.

3 Sep 16, 2022

Azure MLOps (v2) solution accelerators.

Azure MLOps (v2) solution accelerator Welcome to the MLOps (v2) solution accelerator repository! This project is intended to serve as the starting poi

233 Jan 1, 2023

A demo project to elaborate how Machine Learn Models are deployed on production using Flask API

This is a salary prediction website developed with the help of machine learning, this makes prediction of salary on basis of few parameters like interview score, experience test score.

1 Feb 10, 2022

Pytools is an open source library containing general machine learning and visualisation utilities for reuse

pytools is an open source library containing general machine learning and visualisation utilities for reuse, including: Basic tools for API developmen

26 Nov 6, 2022

Data Version Control or DVC is an open-source tool for data science and machine learning projects

Continuous Machine Learning project integration with DVC Data Version Control or DVC is an open-source tool for data science and machine learning proj

2 Jul 29, 2021

SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker.

SageMaker Python SDK SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker. With the S

1.8k Jan 1, 2023

MLReef is an open source ML-Ops platform that helps you collaborate, reproduce and share your Machine Learning work with thousands of other users.

The collaboration platform for Machine Learning MLReef is an open source ML-Ops platform that helps you collaborate, reproduce and share your Machine

1.4k Dec 27, 2022

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Karate Club is an unsupervised machine learning extension library for NetworkX. Please look at the Documentation, relevant Paper, Promo Video, and Ext

1.8k Jan 3, 2023

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Ray provides a simple, universal API for building distributed applications. Ray is packaged with the following libraries for accelerating machine lear

23.3k Dec 31, 2022

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Master status: Development status: Package information: TPOT stands for Tree-based Pipeline Optimization Tool. Consider TPOT your Data Science Assista

8.9k Jan 9, 2023

Python Extreme Learning Machine (ELM) is a machine learning technique used for classification/regression tasks.

Python Extreme Learning Machine (ELM) Python Extreme Learning Machine (ELM) is a machine learning technique used for classification/regression tasks.

84 Nov 25, 2022

Open MLOps - A Production-focused Open-Source Machine Learning Framework

Related tags

Overview

Open MLOps - A Production-focused Open-Source Machine Learning Framework

Other repositories

Modules

Jupyter Hub

Configuration

Proxy configuration

Authentication configuration

User configuration

Prefect

Agent

Postgresql

Dask

Feast

MLFlow

Seldon

Exposing Services

Ambassador

Exposing a service in Ambassador

Terraform

Authentication

Session Lifespan

ORY Oathkeeper

Zero-Trust and Unauthorized Resources

Access Rules

Allow all incoming requests

Authorize on KRATOS

Comments

Releases(0.1.0)

0.1.0(Aug 16, 2021)

Owner

Data Revenue

Evidently helps analyze machine learning models during validation or production monitoring

Production Grade Machine Learning Service

Machine learning that just works, for effortless production applications

Iris-Heroku - Putting a Machine Learning Model into Production with Flask and Heroku

⏳ Tempo: The MLOps Software Development Kit

This is a public repo where code samples are stored for the book Practical MLOps.

End to End toy example of MLOps

ClearML - Auto-Magical Suite of tools to streamline your ML workflow. Experiment Manager, MLOps and Data-Management

MLOps pipeline project using Amazon SageMaker Pipelines

Azure MLOps (v2) solution accelerators.

A demo project to elaborate how Machine Learn Models are deployed on production using Flask API

Pytools is an open source library containing general machine learning and visualisation utilities for reuse

Data Version Control or DVC is an open-source tool for data science and machine learning projects

SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker.

MLReef is an open source ML-Ops platform that helps you collaborate, reproduce and share your Machine Learning work with thousands of other users.

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Python Extreme Learning Machine (ELM) is a machine learning technique used for classification/regression tasks.