My project contrasts K-Nearest Neighbors and Random Forrest Regressors on Real World data

Last update: Oct 28, 2021

Related tags

Machine Learning kNN-vs-RFR

Overview

kNN-vs-RFR

My project contrasts K-Nearest Neighbors and Random Forrest Regressors on Real World data

In many areas, rental bikes have been launched to improve accessibility ease. It is important to have the rented bike ready and open to the public at the appropriate time, as this reduces the amount of time people have to wait. Eventually, ensuring a steady supply of rented bikes for the area becomes a big concern. The most important aspect is predicting the number of rental bikes required at each hour in order to maintain a steady supply. In this project, we discuss the ways in which we can predict the number of bikes needed for the particular day based on the provided data set. These type of prediction systems enable users to borrow a bike from a specific location and return it to a different location. Hence, we use machine learning to predict the number of rental bikes that are needed on a particular day

Background:

In Machine Intelligence, there are many ways in which we can predict the number of bikes that might be needed in a particular day. One of the methods used was to examine the models for predicting hourly rental bike demand and investigate a function filtering method to exclude non-predictive parameters and rate features based on their prediction efficiency. The project was accomplished by using repeated cross validation to train five statistical regression models with their best hyper-parameters, and then evaluating their results. The other method just estimates the cumulative number of rented bikes in the entire bike sharing system. The various data in the data collection were used to manipulate and forecast the final number of rental bikes. Methods such as Ridge Linear Regression, Support Vector Machine for Regression, Random Forest Method for Regression and Gradient Boosted Regression Tree are used for the prediction of rental bikes.

Additional Info:

Feel free to dowload my code which is in main.py. I have also provided a copy of the testing and training data sets used. Lastly, I have also uploaded a copy of the short research paper that I wrote based on this project.

You might also like...

It is a forest of random projection trees

rpforest rpforest is a Python library for approximate nearest neighbours search: finding points in a high-dimensional space that are close to a given

211 Dec 29, 2022

Machine Learning Algorithms ( Desion Tree, XG Boost, Random Forest )

implementation of machine learning Algorithms such as decision tree and random forest and xgboost on darasets then compare results for each and implement ant colony and genetic algorithms on tsp map, play blackjack game and robot in grid world and evaluate reward for it

1 Jan 19, 2022

A project based example of Data pipelines, ML workflow management, API endpoints and Monitoring.

My project contrasts K-Nearest Neighbors and Random Forrest Regressors on Real World data

Related tags

Overview

kNN-vs-RFR

Background:

Additional Info:

You might also like...

It is a forest of random projection trees

Machine Learning Algorithms ( Desion Tree, XG Boost, Random Forest )

A project based example of Data pipelines, ML workflow management, API endpoints and Monitoring.

Real-time stream processing for python

2D fluid simulation implementation of Jos Stam paper on real-time fuild dynamics, including some suggested extensions.

Real-time domain adaptation for semantic segmentation

A data preprocessing package for time series data. Design for machine learning and deep learning.

Data science, Data manipulation and Machine learning package.

Data Version Control or DVC is an open-source tool for data science and machine learning projects

Owner

MBTR is a python package for multivariate boosted tree regressors trained in parameter space.

A framework for building (and incrementally growing) graph-based data structures used in hierarchical or DAG-structured clustering and nearest neighbor search

The project's goal is to show a real world application of image segmentation using k means algorithm

Used Logistic Regression, Random Forest, and XGBoost to predict the outcome of Search & Destroy games from the Call of Duty World League for the 2018 and 2019 seasons.

A toolkit for making real world machine learning and data analysis applications in C++

ANNchor is a python library which constructs approximate k-nearest neighbour graphs for slow metrics.

Neighbourhood Retrieval (Nearest Neighbours) with Distance Correlation.

learn python in 100 days, a simple step could be follow from beginner to master of every aspect of python programming and project also include side project which you can use as demo project for your personal portfolio

ThunderGBM: Fast GBDTs and Random Forests on GPUs

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.