Code repo for the book "Feature Engineering for Machine Learning," by Alice Zheng and Amanda Casari, O'Reilly 2018

Overview

feature-engineering-book

This repo accompanies "Feature Engineering for Machine Learning," by Alice Zheng and Amanda Casari. O'Reilly, 2018.

The repo does not contain the data because we do not have rights to disseminate them. Please follow the URLs given in the book to download the data.

Comments
  • Dataset needing

    Dataset needing

    Hi everyone, there are many datasets used in this book, but I didn't see source page links attached in the content. Could someone share those dataset please? Thank you!

    opened by HYLCHF 4
  • Data for the examples

    Data for the examples

    #Hello,

    Jupyter notebooks have a links to data files in local directories, but there are no directories in the repository, so it is unable to run notebooks, what is correct way to run these notebooks, need to manually download data files?

    Thank you.

    opened by labdmitriy 2
  • Renamed notebooks using book example ontology

    Renamed notebooks using book example ontology

    Walked through the book to match up chapters + example numbers to notebooks.

    This numbering allows readers + non-readers alike to follow through the repo in the order of the book.

    opened by amcasari 0
  • The problem in Tf-idf.ipynb

    The problem in Tf-idf.ipynb

    hi, when i run the 04.01-06 sample code,

    Create the target column--True for Nightlife businesses, and False otherwise

    twobiz_reviews['target'] = twobiz_reviews.apply(lambda x: 'Nightlife' in x['categories'], axis=1) there is a error: "ValueError: Cannot set a frame with no defined index and a value that cannot be converted to a Series" and i use the yelp dataset v13 how to solve this?

    opened by alexfeng2 0
  • Didn't find 'mag_subset20K.txt' file in the unzipped folder  mag_papers_0

    Didn't find 'mag_subset20K.txt' file in the unzipped folder mag_papers_0

    I downloaded the mag_papers_0 file from the following link, https://www.openacademic.ai/oag/ , but didn't find the mag_subset20K.txt used in https://github.com/alicezheng/feature-engineering-book/blob/master/09.01-05_%5BEnd-to-End_Example%5D_Recommender_Take_1.ipynb.

    Is the mag_subset20K.txt file reconstructed from one of the source file, if so, how to build? if not, where is the right link to download this file?

    Thanks

    opened by tlemar 1
  • Don't understand 2 lines of code

    Don't understand 2 lines of code

    Hi Alice,

    Thanks for the great book! There are two lines of code in the last two notebooks for the end-to-end examples that I don't quite understand:

    • if order < 5: order += 1
    • if order < top_n: order += 1

    What was the rationale behind the fact that you used if blocks here?

    Thank you!

    opened by ivanyschen 0
Owner
Alice Zheng
Alice Zheng
This program tries to book a tennis court slot in either Southwark Park or Tanner Street Park in Southwark, London.

Book tennis courts in London This program tries to book a tennis court slot in either Southwark Park or Tanner Street Park in Southwark, London. Note:

Daniele 1 Jul 25, 2022
That is a example of a Book app on Python, made with support of all JS libraries on React framework

React+Python Books App You can use this repository whenever you want Used for a video Create the database: python -m dbutils Start the web server: pyt

Koma Human 1 Apr 20, 2022
Open source book about making Python packages.

Python packages Tomas Beuzen & Tiffany Timbers Python packages are a core element of the Python programming language and are how you create organized,

Python Packages 169 Jan 6, 2023
In this repo i inherit the pos module and added QR code to pos receipt

odoo-pos-inherit In this repo i inherit the pos module and added QR code to pos receipt 1- Create new Odoo Module using command line $ python odoo-bin

null 5 Apr 9, 2022
AIST++ API This repo contains starter code for using the AIST++ dataset.

AIST++ API This repo contains starter code for using the AIST++ dataset. To download the dataset or explore details of this dataset, please go to our

Google 260 Dec 30, 2022
This repo presents you the official code of "VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention"

VISTA VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention Shengheng Deng, Zhihao Liang, Lin Sun and Kui Jia* (*) Corresponding a

null 104 Dec 29, 2022
Repo created for the purpose of adding any kind of programs and projects

Programs and Project Repository A repository for adding programs and projects of any kind starting from beginners level to expert ones Contributing to

Unicorn Dev Community 3 Nov 2, 2022
to learn how to do pull request and do contribution to other's repo

Hacktoberfest-2021 - open-source-contribution An Open Source repository to Teach people How to contribute to open sources. ?? ?? JOIN PVX PROGRAMMING

Shubham Rawat 82 Dec 26, 2022
A repo to record how I prepare my Interview, and really hope it can help you as well. Really appreciate Kieran's help in the pattern's part.

Project Overview The purpose of this repo is to help others to find solutions and explaintion I will commit a solution and explanation to every proble

Vincent Zhenhao ZHAO 1 Nov 29, 2021
Repo created for the purpose of adding any kind of programs and projects

Programs and Project Repository A repository for adding programs and projects of any kind starting from beginners level to expert ones Contributing to

Unicorn Dev Community 3 Nov 2, 2022
A toy repo illustrating a minimal installable Python package

MyToy: a minimal Python package This repository contains a minimal, toy Python package with a few files as illustration for students of how to lay out

Fernando Perez 19 Apr 24, 2022
This is the repo for Uncertainty Quantification 360 Toolkit.

UQ360 The Uncertainty Quantification 360 (UQ360) toolkit is an open-source Python package that provides a diverse set of algorithms to quantify uncert

International Business Machines 207 Dec 30, 2022
This repo is related to Google Coding Challenge, given to Bright Network Internship Experience 2021.

BrightNetworkUK-GCC-2021 This repo is related to Google Coding Challenge, given to Bright Network Internship Experience 2021. Language used here is py

Dareer Ahmad Mufti 28 May 23, 2022
Educational Repo. Used whilst learning Flask.

flask_python Educational Repo. Used whilst learning Flask. The below instructions will be required whilst establishing as new project. Install Flask (

Jordan 2 Oct 15, 2021
CalHacks 8 Repo: Megha Jain, Gaurav Bhatnagar, Howard Meng, Vibha Tantry

CalHacks8 CalHacks 8 Repo: Megha Jain, Gaurav Bhatnagar, Howard Meng, Vibha Tantry Setup FE Install React Native via Expo, run App.js. Backend Create

null 0 Aug 20, 2022
hey, this repo is the backend of the sociio project

sociio backend Hey, this repository is a part of sociio project , In this repo we are working to create an independent server for everything you can i

null 2 Jun 9, 2022
Repo to demo translating colab/jupyter notebook to streamlit webapp

Repo to demo translating colab/jupyter notebook to streamlit webapp

Marisa Smith 2 Feb 2, 2022
It's a repo for Cramer's rule, which is some math crap or something idk

It's a repo for Cramer's rule, which is some math crap or something idk (just a joke, it's not crap; don't take that seriously, math teachers)

Module64 0 Aug 31, 2022
This repo will have a small amount of Chrome tools that can be used for DFIR, Hacking, Deception, whatever your heart desires.

Chrome-Tools Overview Welcome to the repo. This repo will have a small amount of Chrome tools that can be used for DFIR, Hacking, Deception, whatever

null 5 Jun 8, 2022