Web Scraping COVID 19 Meta Portal with Python

Aarif Munwar Jahan

Last update: Jan 4, 2022

Related tags

Web Crawling Web-Scraping-COVID-19-Meta-Portal-with-Python

Overview

Web-Scraping-COVID-19-Meta-Portal-with-Python

Requests API and Beautiful Soup to scrape real-time COVID statistics from worldometer website and perform data cleaning and visual analysis in Jupyter notebook.

Data Preparation Notebook

In the first module, web scraping techniques using requests, beautifulsoup packages are utilized to collect and manipulate COVID related data from the worldometer website

The notebook has a total of five code blocks.

The first four code blocks provide the following data:

Summary Data for ALL Global COVID Cases
Summary Data for ACTIVE Global COVID Cases
Summary Data for CLOSED Global COVID Cases
Tabular Data for COVID Cases by Country

The fifth and final code block provides an interactive interface for exporting each of these four tables

Data Analysis Notebook

In the second module, data analysis techniques using pandas, numpy, seaborn and statsmodels packages are utilized to collect effective insights from the data and plot necessary graphs. The raw csv data is the same table we collected in Part A of the project taken from the worldometer website regarding COVID cases tabulated by country.

The notebook has a total of twelve code blocks.

Importing a CSV file, reading it and counting no. of rows and columns
Using the to_numeric method to ensure all numerical columns get passed as numeric
Using the describe function to display and analyze basic statistical data on the numerical columns of the imported data
Working with a smaller set of imported data - Top 20 countries with most cases
Horizontal bar chart to analyze total cases in the top 20 countries
Vertical bar chart to analyze total deaths in the top 20 countries with most cases
Distribution plot to analyze spread of data for Deaths/1M Population of the 20 countries
Using the describe function to display basic statistical data on the numerical columns of the REDUCED dataset
Comparing and analyzing mean and standard deviation between population of the Full dataset and the Reduced dataset
Using regression scatter plot to check for data independence between tests/million people and the size of the population
Finding and analyzing correlations between the variables in the dataset
Applying a statistical model to collect useful information about Total Cases and Total Deaths in the full data set

-- Aarif M Jahan -- May 08, 2021

You might also like...

Linkedin webscraping - Linkedin web scraping with python

linkedin_webscraping This is the first step of a full project called "LinkedIn J

4 Apr 24, 2022

Basic-html-scraper - A complete how to of web scraping with Python for beginners

basic-html-scraper Code from YT Video This video includes a complete how to of w

12 Oct 22, 2022

A training task for web scraping using python multithreading and a real-time-updated list of available proxy servers.

Parallel web scraping The project is a training task for web scraping using python multithreading and a real-time-updated list of available proxy serv

1 Feb 10, 2022

A Python Covid-19 cases tracker that scrapes data off the web and presents the number of Cases, Recovered Cases, and Deaths that occurred because of the pandemic.

1 Nov 13, 2021

A web scraping pipeline project that retrieves TV and movie data from two sources, then transforms and stores data in a MySQL database.

New to Streaming Scraper An in-progress web scraping project built with Python, R, and SQL. The scraped data are movie and TV show information. The go

1 Mar 28, 2022

Web Scraping COVID 19 Meta Portal with Python

Related tags

Overview

Web-Scraping-COVID-19-Meta-Portal-with-Python

Data Preparation Notebook

Data Analysis Notebook

You might also like...

Linkedin webscraping - Linkedin web scraping with python

Basic-html-scraper - A complete how to of web scraping with Python for beginners

A training task for web scraping using python multithreading and a real-time-updated list of available proxy servers.

A Python Covid-19 cases tracker that scrapes data off the web and presents the number of Cases, Recovered Cases, and Deaths that occurred because of the pandemic.

Web Scraping Framework

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)

🥫 The simple, fast, and modern web scraping library

A Web Scraping Program.

A web scraping pipeline project that retrieves TV and movie data from two sources, then transforms and stores data in a MySQL database.

Owner

Aarif Munwar Jahan

Scrapy, a fast high-level web crawling & scraping framework for Python.

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Async Python 3.6+ web scraping micro-framework based on asyncio

Transistor, a Python web scraping framework for intelligent use cases.

Web Scraping Practica With Python

Here I provide the source code for doing web scraping using the python library, it is Selenium.

Web Scraping OLX with Python and Bsoup.

Demonstration on how to use async python to control multiple playwright browsers for web-scraping

Web Scraping images using Selenium and Python

Web-scraping - A bot using Python with BeautifulSoup that scraps IRS website by form number and returns the results as json