scikit-learn: machine learning in Python

some

pip install -U scikit-learn

conda install -c conda-forge scikit-learn

git clone https://github.com/scikit-learn/scikit-learn.git

pytest sklearn

        - An iterable that generates (train, test) splits as arrays of indices.

pytest sklearn/tests/test_docstrings.py -k sklearn._config.config_context

pytest maint_tools/test_docstrings.py -k StandardScaler- 

import inspect
import difflib
from IPython.display import HTML

def show_func_diff(func_a, func_b):
    return HTML(difflib.HtmlDiff().make_file(inspect.getsourcelines(func_a)[0], inspect.getsourcelines(func_b)[0]))

from sklearn.cross_validation import cross_val_score as cross_val_score_old
from sklearn.model_selection import cross_val_score

show_func_diff(cross_val_score, cross_val_score_old)

('Eigenfaces - PCA using randomized SVD',
 decomposition.PCA(n_components=n_components, svd_solver='lobpcg',
                   whiten=True),
 True),

from scipy.sparse import csr_matrix
from sklearn.datasets import make_classification
from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import cross_val_score

X, y = make_classification(10000, n_features=200)
X = csr_matrix(X, copy=True)

clf = RandomForestClassifier(n_jobs=-1)

cross_val_score(clf, X, y)

ValueError: 
All the 5 fits failed.
It is very likely that your model is misconfigured.
You can try to debug the error by setting error_score='raise'.

Below are more details about the failures:
--------------------------------------------------------------------------------
5 fits failed with the following error:
joblib.externals.loky.process_executor._RemoteTraceback: 
"""
Traceback (most recent call last):
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/joblib/externals/loky/process_executor.py", line 428, in _process_worker
    r = call_item()
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/joblib/externals/loky/process_executor.py", line 275, in __call__
    return self.fn(*self.args, **self.kwargs)
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/joblib/_parallel_backends.py", line 620, in __call__
    return self.func(*args, **kwargs)
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/joblib/parallel.py", line 288, in __call__
    return [func(*args, **kwargs)
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/joblib/parallel.py", line 288, in <listcomp>
    return [func(*args, **kwargs)
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/sklearn/utils/fixes.py", line 117, in __call__
    return self.function(*args, **kwargs)
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/sklearn/ensemble/_forest.py", line 185, in _parallel_build_trees
    tree.fit(X, y, sample_weight=curr_sample_weight, check_input=False)
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/sklearn/tree/_classes.py", line 889, in fit
    super().fit(
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/sklearn/tree/_classes.py", line 379, in fit
    builder.build(self.tree_, X, y, sample_weight)
  File "sklearn/tree/_tree.pyx", line 147, in sklearn.tree._tree.DepthFirstTreeBuilder.build
  File "sklearn/tree/_tree.pyx", line 173, in sklearn.tree._tree.DepthFirstTreeBuilder.build
  File "sklearn/tree/_splitter.pyx", line 789, in sklearn.tree._splitter.BaseSparseSplitter.init
  File "stringsource", line 660, in View.MemoryView.memoryview_cwrapper
  File "stringsource", line 350, in View.MemoryView.memoryview.__cinit__
ValueError: buffer source array is read-only
"""

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/sklearn/model_selection/_validation.py", line 686, in _fit_and_score
    estimator.fit(X_train, y_train, **fit_params)
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/sklearn/ensemble/_forest.py", line 474, in fit
    trees = Parallel(
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/joblib/parallel.py", line 1098, in __call__
    self.retrieve()
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/joblib/parallel.py", line 975, in retrieve
    self._output.extend(job.get(timeout=self.timeout))
  File "/home/temp/.local/miniconda/lib/python3.10/site-packages/joblib/_parallel_backends.py", line 567, in wrap_future_result
    return future.result(timeout=timeout)
  File "/home/temp/.local/miniconda/lib/python3.10/concurrent/futures/_base.py", line 458, in result
    return self.__get_result()
  File "/home/temp/.local/miniconda/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result
    raise self._exception
ValueError: buffer source array is read-only

System:
    python: 3.10.6 | packaged by conda-forge | (main, Aug 22 2022, 20:36:39) [GCC 10.4.0]
executable: /home/temp/.local/miniconda/bin/python3.10
   machine: Linux-5.14.0-1054-oem-x86_64-with-glibc2.31

Python dependencies:
      sklearn: 1.2.0
          pip: 22.3
   setuptools: 65.5.0
        numpy: 1.23.5
        scipy: 1.9.3
       Cython: 0.29.32
       pandas: 1.5.2
   matplotlib: 3.6.2
       joblib: 1.2.0
threadpoolctl: 3.1.0

Built with OpenMP: True

threadpoolctl info:
       user_api: blas
   internal_api: mkl
         prefix: libmkl_rt
       filepath: /home/temp/.local/miniconda/lib/libmkl_rt.so.2
        version: 2022.1-Product
threading_layer: intel
    num_threads: 4

       user_api: openmp
   internal_api: openmp
         prefix: libomp
       filepath: /home/temp/.local/miniconda/lib/libomp.so
        version: None
    num_threads: 8

import base64
print(base64.b64decode("dGhvbWFzLmdlcm1lckBoaHUuZGU=").decode("ascii"))

from sklearn.neighbors import VALID_METRICS
for key in VALID_METRICS.keys():
    print(f"'nan_euclidean' in {key}:", 'nan_euclidean' in VALID_METRICS[key])
>>> 'nan_euclidean' in ball_tree: False
>>> 'nan_euclidean' in kd_tree: False
>>> 'nan_euclidean' in brute: True

pip install -U scikit-learn

conda install -c conda-forge scikit-learn

pip install -U scikit-learn

conda install -c conda-forge scikit-learn

pip install -U scikit-learn

conda install -c conda-forge scikit-learn

pip install -U scikit-learn

conda install -c conda-forge scikit-learn

pip install -U scikit-learn

conda install -c conda-forge scikit-learn

pip install -U scikit-learn

conda install -c conda-forge scikit-learn

pip install -U scikit-learn

conda install -c conda-forge scikit-learn

pip install -U scikit-learn

conda install -c conda-forge scikit-learn

scikit-learn: machine learning in Python

Related tags

Overview

Installation

Dependencies

User installation

Changelog

Development

Important links

Source code

Contributing

Testing

Submitting a Pull Request

Project History

Help and Support

Documentation

Communication

Citation

Comments

Performance and scores

TODO before merge

Future enhancements

Background / Objective

Validating docstrings in scikit-learn

Note

Steps

Functions to Update

Multi-layer perceptron (MLP)

Code Check out :

Tutorial link:

Sample Benchmark:

TODO:

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Describe the bug

Steps/Code to Reproduce

Expected Results

Actual Results

Versions

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Releases(1.2.0)

1.2.0(Dec 8, 2022)

1.1.3(Oct 26, 2022)

1.1.2(Aug 5, 2022)

1.1.1(May 19, 2022)

1.1.0(May 12, 2022)

1.0.2(Dec 25, 2021)

1.0.1(Oct 25, 2021)

1.0(Sep 24, 2021)

0.24.2(Apr 28, 2021)

0.24.1(Jan 19, 2021)

0.24.0(Dec 22, 2020)

0.23.2(Aug 4, 2020)

0.23.1(May 19, 2020)

0.23.0(May 12, 2020)

0.22.2.post1(Mar 4, 2020)

0.22.1(Jan 2, 2020)

0.22(Dec 3, 2019)

0.20.4(Jul 30, 2019)

0.21.3(Jul 30, 2019)

0.21.2(May 23, 2019)

0.21.1(May 15, 2019)

0.21.0(May 10, 2019)

0.20.3(Mar 2, 2019)

0.20.2(Dec 20, 2018)

0.20.1(Nov 25, 2018)

0.20.0(Nov 22, 2018)

0.19.2(Nov 22, 2018)

0.19.1(Oct 22, 2017)

0.18.1(Nov 15, 2016)

0.18(Oct 18, 2016)