50-days-of-Statistics-for-Data-Science - This repository consist of a 50-day program

Overview

50-days-of-Statistics-for-Data-Science

This repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded in this repository.

Sr No Notebook Topic Colab
1 Elements of Structured Data Open In Colab
2 Rectangular Data Open In Colab
3 Estimates of Location Open In Colab
4 Estimates of Variability Open In Colab
5 Exploring the Data Distribution Open In Colab
6 Exploring Binary and Categorical Data Open In Colab
7 Correlation Open In Colab
8 Exploring Two or More Variables Open In Colab
9 Random Sampling and Sample Bias Open In Colab
10 Selection Bias Open In Colab
11 Sampling Distribution of a Statistic Open In Colab
12 The Bootstrap Open In Colab
13 Confidence Intervals Open In Colab
14 Normal Distribution Open In Colab
15 Long-Tailed Distributions Open In Colab
16 Student’s t-Distribution Open In Colab
17 Binomial Distribution Open In Colab
18 Chi-Square Distribution Open In Colab
19 F-Distribution Open In Colab
20 Poisson and Related Distributions Open In Colab
21 A/B Testing Open In Colab
22 Hypothesis Tests Open In Colab
23 Resampling Open In Colab
24 Statistical Significance and p-Values Open In Colab
25 t-Tests Open In Colab
26 Multiple Testing Open In Colab
27 Degrees of Freedom Open In Colab
28 ANOVA Open In Colab
29 Chi-Square Test Open In Colab
30 Multi-Arm Bandit Algorithm Open In Colab
31 Power and Sample Size Open In Colab
32 Simple Linear Regression Open In Colab
33 Multiple Linear Regression Open In Colab
34 Prediction Using Regression Open In Colab
35 Factor Variables in Regression Open In Colab
36 Interpreting the Regression Equation Open In Colab
37 Regression Diagnostics Open In Colab
38 Polynomial and Spline Regression Open In Colab
39 Naïve Bayes Open In Colab
40 Discriminant Analysis Open In Colab
41 Logistic Regression Open In Colab
42 Evaluating Classification Models Open In Colab
43 Strategies for Imbalanced Data Open In Colab
44 K-Nearest Neighbors Open In Colab
45 Tree Models Open In Colab
46 Bagging and the Random Forest Open In Colab
47 Boosting Open In Colab
48 Principal Components Analysis Open In Colab
49 K-Means Clustering Open In Colab
50 Hierarchical Clustering Open In Colab
51 Model-Based Clustering Open In Colab
52 Scaling and Categorical Variables Open In Colab
You might also like...
The codes reproduce the figures and statistics in the paper, "Controlling for multiple covariates," by Mark Tygert.

The accompanying codes reproduce all figures and statistics presented in "Controlling for multiple covariates" by Mark Tygert. This repository also pr

Automatic self-diagnosis program (python required)Automatic self-diagnosis program (python required)

auto-self-checker 자동으로 자가진단 해주는 프로그램(python 필요) 중요 이 프로그램이 실행될때에는 절대로 마우스포인터를 움직이거나 키보드를 건드리면 안된다(화면인식, 마우스포인터로 직접 클릭) 사용법 프로그램을 구동할 폴더 내의 cmd창에서 pip

A simple python program that can be used to implement user authentication tokens into your program...

token-generator A simple python module that can be used by developers to implement user authentication tokens into your program... code examples creat

An open source Python package for plasma science that is under development
An open source Python package for plasma science that is under development

PlasmaPy PlasmaPy is an open source, community-developed Python 3.7+ package for plasma science. PlasmaPy intends to be for plasma science what Astrop

Pseudo-rng-app - whos needs science to make a random number when you have pseudoscience?
Pseudo-rng-app - whos needs science to make a random number when you have pseudoscience?

Pseudo-random numbers with pseudoscience rng is so complicated! Why cant we have a horoscopic, vibe-y way of calculating a random number? Why cant rng

Aalto-cs-msc-theses - Listing of M.Sc. Theses of the Department of Computer Science at Aalto University

Aalto-CS-MSc-Theses Listing of M.Sc. Theses of the Department of Computer Scienc

Udacity's CS101: Intro to Computer Science - Building a Search Engine

Udacity's CS101: Intro to Computer Science - Building a Search Engine All soluti

Team nan solution repository for FPT data-centric competition. Data augmentation, Albumentation, Mosaic, Visualization, KNN application
Team nan solution repository for FPT data-centric competition. Data augmentation, Albumentation, Mosaic, Visualization, KNN application

FPT_data_centric_competition - Team nan solution repository for FPT data-centric competition. Data augmentation, Albumentation, Mosaic, Visualization, KNN application

Automatically download the cwru data set, and then divide it into training data set and test data set

Automatically download the cwru data set, and then divide it into training data set and test data set.自动下载cwru数据集,然后分训练数据集和测试数据集

Owner
komal_lamba
👋 Hi, I’m @komal_lamba 👀 I’m interested in ML, DL, NLP, and Audio Analysis 🌱 I’m AI enthusiast
komal_lamba
Automatic labeling, conversion of different data set formats, sample size statistics, model cascade

Simple Gadget Collection for Object Detection Tasks Automatic image annotation Conversion between different annotation formats Obtain statistical info

llt 4 Aug 24, 2022
Implementation of 'lightweight' GAN, proposed in ICLR 2021, in Pytorch. High resolution image generations that can be trained within a day or two

512x512 flowers after 12 hours of training, 1 gpu 256x256 flowers after 12 hours of training, 1 gpu Pizza 'Lightweight' GAN Implementation of 'lightwe

Phil Wang 1.5k Jan 2, 2023
A community run, 5-day PyTorch Deep Learning Bootcamp

Deep Learning Winter School, November 2107. Tel Aviv Deep Learning Bootcamp : http://deep-ml.com. About Tel-Aviv Deep Learning Bootcamp is an intensiv

Shlomo Kashani. 1.3k Sep 4, 2021
🎓Automatically Update CV Papers Daily using Github Actions (Update at 12:00 UTC Every Day)

??Automatically Update CV Papers Daily using Github Actions (Update at 12:00 UTC Every Day)

Realcat 270 Jan 7, 2023
People log into different sites every day to get information and browse through these sites one by one

HyperLink People log into different sites every day to get information and browse through these sites one by one. And they are exposed to advertisemen

null 0 Feb 17, 2022
A Peer-to-peer Platform for Secure, Privacy-preserving, Decentralized Data Science

PyGrid is a peer-to-peer network of data owners and data scientists who can collectively train AI models using PySyft. PyGrid is also the central serv

OpenMined 615 Jan 3, 2023
Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data

federated is the source code for the Bachelor's Thesis Privacy-Preserving Federated Learning Applied to Decentralized Data (Spring 2021, NTNU) Federat

Dilawar Mahmood 25 Nov 30, 2022
🛠 All-in-one web-based IDE specialized for machine learning and data science.

All-in-one web-based development environment for machine learning Getting Started • Features & Screenshots • Support • Report a Bug • FAQ • Known Issu

Machine Learning Tooling 2.9k Jan 9, 2023
Rafael Project- Classifying rockets to different types using data science algorithms.

Rocket-Classify Rafael Project- Classifying rockets to different types using data science algorithms. In this project we received data base with data

Hadassah Engel 5 Sep 18, 2021
Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

MUST-GAN Code | paper The Pytorch implementation of our CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generat

TianxiangMa 46 Dec 26, 2022