slim-python is a package to learn customized scoring systems for decision-making problems.

Berk Ustun

Last update: Nov 2, 2022

Related tags

Machine Learning slim-python

Overview

slim-python is a package to learn customized scoring systems for decision-making problems.

These are simple decision aids that let users make yes-no predictions by adding and subtracting a few small numbers.

SLIM is designed to learn the most accurate scoring system for a given dataset and set of constraints. These models are produced by solving a hard optimization problem that directly optimizes for accuracy, sparsity, and customized constraints (e.g., hard limits on model size, TPR, FPR).

Requirements

slim-python was developed using Python 2.7.11 and CPLEX 12.6.2.

CPLEX

CPLEX is cross-platform commercial optimization tool with a Pytho API. It is freely available to students and faculty members at accredited institutions as part of the IBM Academic Initiative. To get CPLEX:

Join the IBM Academic Initiative. Note that it may take up to a week to obtain approval.
Download IBM ILOG CPLEX Optimization Studio V12.6.1 (or higher) from the software catalog
Install the file on your computer. Note mac/unix users will need to install a .bin file.
Setup the CPLEX Python modules as described here here.

Please check the CPLEX user manual or the CPLEX forums if you have problems installing CPLEX.

Citation

If you use SLIM for academic research, please cite our paper!

@article{
    ustun2015slim,
    year = {2015},
    issn = {0885-6125},
    journal = {Machine Learning},
    doi = {10.1007/s10994-015-5528-6},
    title = {Supersparse linear integer models for optimized medical scoring systems},
    url = {http://dx.doi.org/10.1007/s10994-015-5528-6},
    publisher = { Springer US},
    author = {Ustun, Berk and Rudin, Cynthia},
    pages = {1-43},
    language = {English}
}

Comments

Multiclass prediction

Hi!

This package is amazing. Wonder if there are any plans or research for multiclass prediction? Will the optimization functions need to be updated if we changed from binary to multi-class?

Best, Yolanda

opened by yolimonsta 0
Licensing and CPLEX

I just saw the talk and this looks great. Would it be possible to change the licensing to BSD / MIT? Those are the standard in the data science python community and will get you more adoption.

Also, how much worse will this get if you use an open source MIPS solver? It would be great to be able to choose a solver to be independent from IBM.

Andy

opened by amueller 2

MachineLearningStocks is designed to be an intuitive and highly extensible template project applying machine learning to making stock predictions.

Using python and scikit-learn to make stock predictions

1.3k Jan 3, 2023

Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflows on Kubernetes simple, portable, and scalable.

SDK: Overview of the Kubeflow pipelines service Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflows on

3.1k Jan 6, 2023

learn python in 100 days, a simple step could be follow from beginner to master of every aspect of python programming and project also include side project which you can use as demo project for your personal portfolio

6 Nov 5, 2022

Iris species predictor app is used to classify iris species created using python's scikit-learn, fastapi, numpy and joblib packages.

Iris Species Predictor Iris species predictor app is used to classify iris species using their sepal length, sepal width, petal length and petal width

5 Apr 5, 2022

Penguins species predictor app is used to classify penguins species created using python's scikit-learn, fastapi, numpy and joblib packages.

Penguins Classification App Penguins species predictor app is used to classify penguins species using their island, sex, bill length (mm), bill depth

3 Apr 5, 2022

K-Means clusternig example with Python and Scikit-learn

To design and implement the Identification of Iris Flower species using machine learning using Python and the tool Scikit-Learn.

1 Jan 11, 2022

slim-python is a package to learn customized scoring systems for decision-making problems.

Related tags

Overview

Requirements

CPLEX

Citation

You might also like...

MachineLearningStocks is designed to be an intuitive and highly extensible template project applying machine learning to making stock predictions.

Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflows on Kubernetes simple, portable, and scalable.

learn python in 100 days, a simple step could be follow from beginner to master of every aspect of python programming and project also include side project which you can use as demo project for your personal portfolio

Iris species predictor app is used to classify iris species created using python's scikit-learn, fastapi, numpy and joblib packages.

Penguins species predictor app is used to classify penguins species created using python's scikit-learn, fastapi, numpy and joblib packages.

K-Means clusternig example with Python and Scikit-learn

A Python implementation of GRAIL, a generic framework to learn compact time series representations.

Predicting Baseball Metric Clusters: Clustering Application in Python Using scikit-learn

To design and implement the Identification of Iris Flower species using machine learning using Python and the tool Scikit-Learn.

Comments

Multiclass prediction

Licensing and CPLEX

Owner

Berk Ustun

Highly interpretable classifiers for scikit learn, producing easily understood decision rules instead of black box models

PySpark + Scikit-learn = Sparkit-learn

Auto updating website that tracks closed & open issues/PRs on scikit-learn/scikit-learn.

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

Decision Tree Regression algorithm implemented on Python from scratch.

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

Test symmetries with sklearn decision tree models

Pyomo is an object-oriented algebraic modeling language in Python for structured optimization problems.

OptaPy is an AI constraint solver for Python to optimize planning and scheduling problems.

A toolkit for making real world machine learning and data analysis applications in C++