Machine Learning for Time-Series with Python.Published by Packt

Overview

Machine-Learning-for-Time-Series-with-Python

Become proficient in deriving insights from time-series data and analyzing a model’s performance

Links

Key Features

Explore popular and modern machine learning methods including the latest online and deep learning algorithms Learn to increase the accuracy of your predictions by matching the right model with the right problem Master time-series via real-world case studies on operations management, digital marketing, finance, and healthcare The updated edition enables you to implement evergreen frameworks that will stay relevant as Power BI updates. Get familiar with Power BI development tools and services by going deep into the data connectivity, transformation, modeling, visualization, and analytical capabilities of Power BI. Microsoft Power BI Cookbook, Second Edition enables Power BI’s functional programming languages of DAX and M to come alive to deliver powerful solutions to common business intelligence challenges.

What you will learn

  • Understand the main classes of time-series and learn how to detect outliers and patterns
  • Choose the right method to solve time-series problems
  • Characterize seasonal and correlation patterns through autocorrelation and statistical techniques
  • Get to grips with time-series data visualization
  • Understand classical time-series models like ARMA and ARIMA
  • Implement deep learning models, like Gaussian processes, transformers, and state-of-the-art machine learning models
  • Become familiar with many libraries like Prophet, XGboost, and TensorFlow

Who This Book Is For

This book is ideal for data analysts, data scientists, and Python developers who are looking to perform time-series analysis to effectively predict outcomes. Basic knowledge of the Python language is essential. Familiarity with statistics is desirable.

Table of Contents

  1. Introduction to Time-Series with Python
  2. Time-Series Analysis with Python
  3. Preprocessing Time-Series
  4. Introduction to Machine Learning for Time-Series
  5. Forecasting with Moving Averages and Autoregressive Models
  6. Unsupervised Methods for Time-Series
  7. Machine Learning Models for Time-Series
  8. Online Learning for Time-Series
  9. Probabilistic Models for Time-Series
  10. Deep Learning for Time-Series
  11. Reinforcement Learning for Time-Series
  12. Multivariate Forecasting

Author Notes

I've heard from a few people struggling with tsfresh and featuretools for chapter 3.

My PR for tsfresh was merged mid-December fixing a version incompatibility - featuretools went through many breaking changes with the release of version 1.0.0 (congratulations to the team!). Please see how to fix any problems in the discussion here.

Comments
  • the code is not complete

    the code is not complete

    Hi Ben,

    I have high expectation on your book and am very excited to learn. However, the codes published on Github is incomplete. For example, in chapter 2, where are the codes from page 52 to 53 especially page 53? I am very interested to know how you did the code in page 53. Also some codes in jupyter notebook are not explained either in notebook and book. I am kind of disappointed. Could you please republish your complete jupyter notebook? Thanks

    opened by yongnuan 14
  • Chapter 3 Preprocessing notebook

    Chapter 3 Preprocessing notebook

    Hello, In the Chapter 3 Preprocessing notebook there seem to be several errors in the code.

    When installing 'featuretools[tsfresh]' in colab the following error occurs: WARNING: featuretools 0.27.1 does not provide the extra 'tsfresh'

    When running the first code bracket using feature tools in colab the following error occurs: TypeError: import_optional_dependency() got an unexpected keyword argument 'errors'

    It appears colab installs featuretools version 0.27.1 If I install the most recent version of featuretoolls (i.e., 1.3.0) and run the code bracket below Automated Feature Extraction I get the following error: AttributeError: 'EntitySet' object has no attribute 'entity_from_dataframe'

    If it is versioning that is an issue, please let me know what version of featuretools the code in the book was based on. If it is not versioning, would you be able to advise what the issue is?

    Let me know if you have any questions. Thanks!

    Best, Nils

    opened by nbwosm 3
  • this repo's minor readme issue

    this repo's minor readme issue

    Hi Ben!

    Not a book issue, but one with the readme associated with this repo. The Key Features section starts fine but in the middle it accidentally switches into a description of another Pack book on Power BI. Looking forward to reading your book!

    Alex

    opened by TacheAlexandru 1
  • Fix chapters 6 order as per book and fix a run time issue.

    Fix chapters 6 order as per book and fix a run time issue.

    1. Change the order for chapter-6 as per the book.
    2. issue in the notebook for line intrusion_outliers = od.predict(intrusions["data"][:,0])
    3. Few minor modifications for the chapter 5 notebook.
    opened by sharanharsoor 0
  • Update parameter use in get_season() function

    Update parameter use in get_season() function

    An integer is being used instead of the yearly_periods parameter in the get_season() function. The function works as is, but is not generalizable to seasonality of other yearly_periods values.

    opened by JTanner4 0
  • Hard to believe these are typos

    Hard to believe these are typos

    This is from the documentation of ARCH package "The null hypothesis of the Augmented Dickey-Fuller is that there is a unit root, with the alternative that there is no unit root. If the p-value is above a critical size, then the null cannot be rejected that there and the series appears to be a unit root."

    on p. 152 a p-value of 0.997 is reported (in a figure which references KPSS test while the table shown clearly says that it is an ADF test) and the following conclusion is made "Given the p-value of 0.997, we can reject our null hypothesis of the unit root, and we conclude that our process is weakly stationary."

    In a couple of pages "Please note that we need to set trend="t" here so that the model includes a constant. If not, we would get a spurious regression." In the statsmodels' ARIMA class documentation, it is stated clearly that trend is "Parameter controlling the deterministic trend. Can be specified as a string where ‘c’ indicates a constant term, ‘t’ indicates a linear trend in time, and ‘ct’ includes both."

    It's quite doubtful that the author of this book understands what he is writing about. But at least he could have checked the documentation of the packages used in it.

    opened by niskrev 1
  • Time Series Clustering

    Time Series Clustering

    The coverage of time series clustering is poor. Actually does not exist beyond referencing a few libraries which people can find themselves. Any follow up with examples here would help I think.

    opened by ghost 3
Owner
Packt
Providing books, eBooks, video tutorials, and articles for IT developers, administrators, and users.
Packt
A data preprocessing package for time series data. Design for machine learning and deep learning.

A data preprocessing package for time series data. Design for machine learning and deep learning.

Allen Chiang 152 Jan 7, 2023
Python module for machine learning time series:

seglearn Seglearn is a python package for machine learning time series or sequences. It provides an integrated pipeline for segmentation, feature extr

David Burns 536 Dec 29, 2022
A machine learning toolkit dedicated to time-series data

tslearn The machine learning toolkit for time series analysis in Python Section Description Installation Installing the dependencies and tslearn Getti

null 2.3k Jan 5, 2023
A unified framework for machine learning with time series

Welcome to sktime A unified framework for machine learning with time series We provide specialized time series algorithms and scikit-learn compatible

The Alan Turing Institute 6k Jan 6, 2023
A machine learning toolkit dedicated to time-series data

tslearn The machine learning toolkit for time series analysis in Python Section Description Installation Installing the dependencies and tslearn Getti

null 2.3k Dec 29, 2022
Merlion: A Machine Learning Framework for Time Series Intelligence

Merlion is a Python library for time series intelligence. It provides an end-to-end machine learning framework that includes loading and transforming data, building and training models, post-processing model outputs, and evaluating model performance. I

Salesforce 2.8k Jan 5, 2023
Examples and code for the Practical Machine Learning workshop series

Practical Machine Learning Workshop Series Practical Machine Learning for Quantitative Finance Post conference workshop at the WBS Spring Conference D

CompatibL 21 Jun 25, 2022
Open source time series library for Python

PyFlux PyFlux is an open source time series library for Python. The library has a good array of modern time series models, as well as a flexible array

Ross Taylor 2k Jan 2, 2023
A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.

pmdarima Pmdarima (originally pyramid-arima, for the anagram of 'py' + 'arima') is a statistical library designed to fill the void in Python's time se

alkaline-ml 1.3k Dec 22, 2022
Probabilistic time series modeling in Python

GluonTS - Probabilistic Time Series Modeling in Python GluonTS is a Python toolkit for probabilistic time series modeling, built around Apache MXNet (

Amazon Web Services - Labs 3.3k Jan 3, 2023
A python library for easy manipulation and forecasting of time series.

Time Series Made Easy in Python darts is a python library for easy manipulation and forecasting of time series. It contains a variety of models, from

Unit8 5.2k Jan 4, 2023
STUMPY is a powerful and scalable Python library for computing a Matrix Profile, which can be used for a variety of time series data mining tasks

STUMPY STUMPY is a powerful and scalable library that efficiently computes something called the matrix profile, which can be used for a variety of tim

TD Ameritrade 2.5k Jan 6, 2023
A Python package for time series classification

pyts: a Python package for time series classification pyts is a Python package for time series classification. It aims to make time series classificat

Johann Faouzi 1.4k Jan 1, 2023
A Python toolkit for rule-based/unsupervised anomaly detection in time series

Anomaly Detection Toolkit (ADTK) Anomaly Detection Toolkit (ADTK) is a Python package for unsupervised / rule-based time series anomaly detection. As

Arundo Analytics 888 Dec 30, 2022
AtsPy: Automated Time Series Models in Python (by @firmai)

Automated Time Series Models in Python (AtsPy) SSRN Report Easily develop state of the art time series models to forecast univariate data series. Simp

Derek Snow 465 Jan 2, 2023
A python library for Bayesian time series modeling

PyDLM Welcome to pydlm, a flexible time series modeling library for python. This library is based on the Bayesian dynamic linear model (Harrison and W

Sam 438 Dec 17, 2022
A Python implementation of GRAIL, a generic framework to learn compact time series representations.

GRAIL A Python implementation of GRAIL, a generic framework to learn compact time series representations. Requirements Python 3.6+ numpy scipy tslearn

null 3 Nov 24, 2021
PyPOTS - A Python Toolbox for Data Mining on Partially-Observed Time Series

A python toolbox/library for data mining on partially-observed time series, supporting tasks of forecasting/imputation/classification/clustering on incomplete multivariate time series with missing values.

Wenjie Du 179 Dec 31, 2022
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Master status: Development status: Package information: TPOT stands for Tree-based Pipeline Optimization Tool. Consider TPOT your Data Science Assista

Epistasis Lab at UPenn 8.9k Jan 9, 2023