This repository is a series of notebooks that show solutions for the projects at Dataquest.io.

Overview

Dataquest Project Solutions

This repository is a series of notebooks that show solutions for the projects at Dataquest.io.

Of course, there are always going to be multiple ways to solve any one problem, so these notebooks just show one possible solution.

Comments
  • Can you tell me the issue with my code?

    Can you tell me the issue with my code?

    def member_english(a_string): for character in a_string: if ord(character) > 127: return False else: return True

    test_app_names = ['Instagram', '爱奇艺PPS -《欢乐颂2》电视剧热播', 'Docs To Go™ Free Office Suite', 'Instachat 😜'] for app_name in test_app_names: eng_or_non_eng = member_english(app_name) print(app_name + " is in English : ", eng_or_non_eng) It is checking just the First Character. what's the difference between putting "Return True" inside the loop with else statement and outside the loop, If I put it outside, It works fine . I'm getting a little confused here.

    opened by Shivam23Thaman 2
  • Missing 469

    Missing 469

    Hi,

    I could not find the guided project for https://app.dataquest.io/m/469/guided-project%3A-popular-data-science-questions/2/stack-exchange

    https://github.com/dataquestio/solutions/blob/master/Mission469Solutions.ipynb is not found

    Thanks,

    opened by minhtd1981 1
  • Broken links

    Broken links

    Links for data sets are giving 404 error

    • Google play store: https://www.kaggle.com/lava18/google-play-store-apps/home
    • App store : https://www.kaggle.com/ramamet4/app-store-apple-data-set-10k-apps/home

    they should be

    • Google play store: https://www.kaggle.com/lava18/google-play-store-apps
    • App store : https://www.kaggle.com/ramamet4/app-store-apple-data-set-10k-apps
    opened by DishinGoyani 1
  • Mission350: Solution mistake?

    Mission350: Solution mistake?

    While defining under_100_m = [], the aim is to keep only communication applications with less than 100m installs. However, the code in the solution keeps all applications with less than 100m installs. Below is a suggestion on how the code might look like:

    under_100_m = []
    
    for app in android_final:
        n_installs = app[5]
        n_installs = n_installs.replace(',', '')
        n_installs = n_installs.replace('+', '')
        if float(n_installs) < 100000000 and (app[1] == 'COMMUNICATION'):
            under_100_m.append(float(n_installs))
            
    sum(under_100_m) / len(under_100_m)
    
    opened by mdrobovtseva 1
  • Couple of issues

    Couple of issues

    Couple of questions

    In [8]: import sqlite3 import pandas as pd

    conn = sqlite3.connect("factbook.db") cursor = conn.cursor() cursor.execute(q1).fetchall()

    The q1 is missing?

    Also, you have two q7 queries, and the second q7 is identical to q6.

    opened by ualisher1 1
  • ERROR: Line magic function `%mathplotlib` not found.

    ERROR: Line magic function `%mathplotlib` not found.

    I'm using jupiter provided by dataquest and I get an error "ERROR: Line magic function %mathplotlib not found". I searched online and the solutions are variants of installation of more recent versions of python. But since the jupiter resides on your server, I can't do that. Can you please suggest a solution? Thanks.

    opened by ualisher 1
  • Wrong results in the

    Wrong results in the "Customers and Products Analysis Using SQL" Solution

    The solution for calculating low stock (in screen 4) is wrongly sorted in ascending order. The code:

    SELECT productCode, 
           ROUND(SUM(quantityOrdered) * 1.0 / (SELECT quantityInStock
                                                 FROM products p
                                                WHERE od.productCode = p.productCode), 2) AS low_stock
      FROM orderdetails od
     GROUP BY productCode
     ORDER BY low_stock
     LIMIT 10;
    

    wrongly assumes that a higher priority for restocking is given to products with a lower "low_stock" value. The code clearly shows that a lower value will correspond with a lower quantityordered and/or higher quantityInStock. This is clearly against the purpose stated in the project's instructions.

    A simple correction can be made to the code by simply sorting by descending

    opened by cheifowo 0
  • Problem with Guided Prison Break: Fetch_year function

    Problem with Guided Prison Break: Fetch_year function

    Hello, I'm at mission 610 and I have a problem when I try replace the full date with just the year:

    This is the code I'm using.

    for row in data:
        date = fetch_year(row[0])
        row[0] = date
    

    This is the error I get...

    TypeError                                 Traceback (most recent call last)
    <ipython-input-10-8a078f6cee5c> in <module>
          1 for row in data:
    ----> 2     date = fetch_year(row[0])
          3     row[0] = date
    
    ~/notebook/helper.py in fetch_year(date_string)
         10 
         11 def fetch_year(date_string):
    ---> 12     return int(re.findall("\d{4}", date_string)[0])
         13 
         14 def barplot(list_of_2_element_list):
    
    /dataquest/system/env/python3/lib/python3.8/re.py in findall(pattern, string, flags)
        237 
        238     Empty matches are included in the result."""
    --> 239     return _compile(pattern, flags).findall(string)
        240 
        241 def finditer(pattern, string, flags=0):
    
    TypeError: expected string or bytes-like object
    
    
    opened by MyNameisMatt1 0
  • what parameter did you consider to know if it is an album?

    what parameter did you consider to know if it is an album?

    About the Mission191Solutions.ipynb It seems that all album_id is not null and is a real album. We have sales/listen/plays in 2 tracks of the same artist, which is not a proper album. Could you please explain the last query?

    opened by KC2016 0
  • Found sth in Mission350

    Found sth in Mission350

    Following the guide on Mission350Solutions.ipynb (I really appreciate this rookie-friendly notebook),I have found something about duplicated apps.🤒

    Firstly,it's not proper to judge duplicated apps just by thier names.For example,there are two apps from different types(game and family) both named Solitaire,but you will loss this information if you only judge duplication by apps' name. Secondly,for certain app named Cardiac diagnosis (heart rate, arrhythmia),it has both free version and paid version,if you do the same thing like you did on Solitaire,you will miss the free data or the paid data which will influence your analysis on all free apps.

    Not sure if I make myself clear,once again,I reallyyyyy appreciate your brilliant work!!!

    opened by ERICMIAO0817 0
Owner
Dataquest
Learn data science online
Dataquest
Time-series-deep-learning - Developing Deep learning LSTM, BiLSTM models, and NeuralProphet for multi-step time-series forecasting of stock price.

Stock Price Prediction Using Deep Learning Univariate Time Series Predicting stock price using historical data of a company using Neural networks for

Abdultawwab Safarji 7 Nov 27, 2022
Repository for scripts and notebooks from the book: Programming PyTorch for Deep Learning

Repository for scripts and notebooks from the book: Programming PyTorch for Deep Learning

Ian Pointer 368 Dec 17, 2022
Experimental solutions to selected exercises from the book [Advances in Financial Machine Learning by Marcos Lopez De Prado]

Advances in Financial Machine Learning Exercises Experimental solutions to selected exercises from the book Advances in Financial Machine Learning by

Brian 1.4k Jan 4, 2023
Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

Modeling High-Frequency Limit Order Book Dynamics Using Machine Learning Framework to capture the dynamics of high-frequency limit order books. Overvi

Chang-Shu Chung 1.3k Jan 7, 2023
Exact Pareto Optimal solutions for preference based Multi-Objective Optimization

Exact Pareto Optimal solutions for preference based Multi-Objective Optimization

Debabrata Mahapatra 40 Dec 24, 2022
🏅 The Most Comprehensive List of Kaggle Solutions and Ideas 🏅

?? Collection of Kaggle Solutions and Ideas ??

Farid Rashidi 2.3k Jan 8, 2023
LeetCode Solutions https://t.me/tenvlad

leetcode LeetCode Solutions groupped by common patterns YouTube: https://www.youtube.com/c/vladten Telegram: https://t.me/nilinterface Problems source

Vlad Ten 158 Dec 29, 2022
Solutions of Reinforcement Learning 2nd Edition

Solutions of Reinforcement Learning, An Introduction

YIFAN WANG 1.4k Dec 30, 2022
Google Landmark Recogntion and Retrieval 2021 Solutions

Google Landmark Recogntion and Retrieval 2021 Solutions In this repository you can find solution and code for Google Landmark Recognition 2021 and Goo

Vadim Timakin 5 Nov 25, 2022
Solutions and questions for AoC2021. Merry christmas!

Advent of Code 2021 Merry christmas! ?? ?? To get solutions and approximate execution times for implementations, please execute the run.py script in t

Wilhelm Ågren 5 Dec 29, 2022
Template repository for managing machine learning research projects built with PyTorch-Lightning

Tutorial Repository with a minimal example for showing how to deploy training across various compute infrastructure.

Sidd Karamcheti 3 Feb 11, 2022
This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

TSForecasting This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the tim

Rakshitha Godahewa 80 Dec 30, 2022
The GitHub repository for the paper: “Time Series is a Special Sequence: Forecasting with Sample Convolution and Interaction“.

SCINet This is the original PyTorch implementation of the following work: Time Series is a Special Sequence: Forecasting with Sample Convolution and I

null 386 Jan 1, 2023
Official implementation for (Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching, AAAI-2021)

Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching Official pytorch implementation of "Show, Attend and Distill: Kn

Clova AI Research 80 Dec 16, 2022
Using some basic methods to show linkages and transformations of robotic arms

roboticArmVisualizer Python GUI application to create custom linkages and adjust joint angles. In the future, I plan to add 2d inverse kinematics solv

Sandesh Banskota 1 Nov 19, 2021
Tesla Light Show xLights Guide With python

Tesla Light Show xLights Guide Welcome to the Tesla Light Show xLights guide! You can create and run your own light shows on Tesla vehicles. Running a

Tesla, Inc. 2.5k Dec 29, 2022
Locationinfo - A script helps the user to show network information such as ip address

Description This script helps the user to show network information such as ip ad

Roxcoder 1 Dec 30, 2021
Cancer-and-Tumor-Detection-Using-Inception-model - In this repo i am gonna show you how i did cancer/tumor detection in lungs using deep neural networks, specifically here the Inception model by google.

Cancer-and-Tumor-Detection-Using-Inception-model In this repo i am gonna show you how i did cancer/tumor detection in lungs using deep neural networks

Deepak Nandwani 1 Jan 1, 2022
A repo to show how to use custom dataset to train s2anet, and change backbone to resnext101

A repo to show how to use custom dataset to train s2anet, and change backbone to resnext101

jedibobo 3 Dec 28, 2022