A script that will warn you, by opening a new browser tab, when there are new content in your favourite websites.

Jaime Álvarez

Last update: Mar 15, 2022

Related tags

Miscellaneous webscraping begginer

Overview

web check

A script that will warn you, by opening a new browser tab, when there are new content in your favourite websites.

What it does

The script will check, when run, if there are any changes in the websites. If any changes are found, it will open a new browser tab.

Not every website can be scrap.

How does it work?

After adding an url, the script creates a copy of website's content in your hard drive. When run again, it will compare the website against that stored content line by line, if there are any differences a new tab will be open. Note: Script don't need to open browser when running, you'll only see the terminal.

A lot of websites have some kind of calendar, that means, every day there will be changes in those websites. To avoid this, you can add a unique css selector to each stored url. With this unique identification, the script targets only specific parts of the website, and avoid unnecessary calls to browser.

If there is a change, a new back up file will be created at storage/url_data/backup.

All urls are stored in a JSON file with all the needed information, including encoding.

How to get the unique css selector

Go to the website, right click in the zone you want the script to check. Go to inspect mode. Hover your mouse until you see (usually in blue) everything you want. Right click and copy selector. Paste that in the css field in add url, or modify url.

Set up

Install python from https://www.python.org/ (built under python 3.9)
Download repository web_check.git
Create a new virtual environment following the instructions in https://docs.python.org/3/library/venv.html

python3 -m venv /path/to/new/virtual/environment
Activate said venv
Install requirements.txt

pip install -r /path/to/requirements.txt
Script is ready!

Running the script

Once everything is installed, launch the script with web_check/main.pyw. There are four tabs.

Home: it's the main tab. From here you can launch checker.py with the button Run!. Checker.py it's in charge of all the logic. It will access your stored url and compare it with the actual website.
Add url: From this tab, you can add a new url for checking, and its unique css selector.

Important: url have to start with http:// or https://. Hit Submit new url and the script will make all necessary checks.

There is a second option, Import file. Import file will let you select a .txt file with several urls, and all of them will be stored.

The txt file needs to follow the structure: url(white space)css selector. Url only means script will download whole website. Only one url per line.

https://github.com/

https://www.reddit.com/ #SHORTCUT_FOCUSABLE_DIV

https://postal.fsc.ccoo.es/Inicio #divMainContent
Modify url: If you need to change an url css selector, you can do it from here. Enter a new css selector, or leave it empty for capturing the whole site, and hit submit.
Delete url: Two options for deleting. Check one, or several, urls and hit delete. Delete all will delete all urls stored.

At the Options' menu, it's possible to reset the url_list.txt if, for some reason, the file can't be read with 'reset url'. 'Create batch file' will let automate the script, for faster use.

Automate the script

There is no need to run web_check/main.py every time you want to check your websites, for that, only checker.py is required.

You can run checker.py manually whenever you want, but that's tedious and forgettable, first you would have to activate a virtual environment, and then, run checker.py. With 'Create batch file' you only have to point where python.exe is (the virtual environment one) and a directory where the file will be created.

After all, it's easier to run directly web_check.bat or add the batch file to windows' task scheduler.

Create shortcut

Create shortcut at Options' menu will create a batch file with all information about the script itself and the virtual environment.

Now you don't need to activate each time a venv, web_check.bat will take care of it.

Log file

Every time the script is run, script will output a log file. It clears its content automatically for easier reading. Any error, or info, will be written here.

Log is located in storage/logs/log.txt.

Copyright (C) 2021 Jaime Álvarez Fernández

Comments

Update some classes to functions.
Refactor:

CreateFolder = create_folder

DeleteUrlGUI = delete_url

ModifyCssGUI = modify_css_selector

get_charset move to class NewUrl

Add comments for easier reading

enhancement
opened by Jaime-alv 0
New frames for modify url and delete url

Create new frames in both, modify url and delete url. When calling def refresh, tabs don't get destroy, instead, only frames with the different url gets destroy and create again, getting a better refresh
enhancement

opened by Jaime-alv 0
Clean main.py

main.py needs to be cleaned with a class, it should be easier to pass arguments and whatever I need into the diferent functions. It will improve its readability
enhancement Must have

opened by Jaime-alv 0
Modify url
Closes #13, closes #4, closes #5

Add, modify and delete are all in the same file for easier access.

There is a new backup folder for those website with changes, you can see the differences manually.

Backup files have now _backup added.
opened by Jaime-alv 0
Some nice improvements
Closes #10, closes #7

Add encoding as a variable.

Upgraded script for using classes.

Change regex for domain_name(), now it should work better.

Setup.py works with if not exists() for all files, including log
opened by Jaime-alv 0

Releases(v.1.2.0)

v.1.2.0(Nov 9, 2021)
What's Changed

Upgrade file paths by @Jaime-alv in https://github.com/Jaime-alv/web_check/pull/38

Full Changelog: https://github.com/Jaime-alv/web_check/compare/v1.1.0...v.1.2.0
Source code(tar.gz)
Source code(zip)
v1.1.0(Oct 13, 2021)
New whats_new.txt file.

File clean each time

File starts with today's date for easier use.

Only write down differences.

temp.txt move to url_data.

Update README.

Source code(tar.gz)
Source code(zip)
v1.0.0(Oct 6, 2021)

It's ready!
Source code(tar.gz)
Source code(zip)
v0.5.1(Oct 5, 2021)
Several fixes

Better GUI

Add logo

Add info about the script

New menu in cascade

Source code(tar.gz)
Source code(zip)
v0.5.0(Oct 2, 2021)
Add GUI

Reformat files for different needs

New classes in add_url.py

Source code(tar.gz)
Source code(zip)
v.0.4.0(Sep 28, 2021)

The script works, it's only command line and you have to run it manually, but it works as intended. setup.py will create necessary folders and files required. add_url.py is the module in charge of adding, deleting and modifying the files and fields needed for the script. main.py is the one that compares websites with the ones already stored, if there is any difference it will open the website in your default browser. After adding a website, you only need to run main.py
Source code(tar.gz)
Source code(zip)
v0.3.1(Sep 27, 2021)

Now it's possible to delete url!
Source code(tar.gz)
Source code(zip)
v0.3.0(Sep 21, 2021)

Scripts works on a basic level. You can add a new url through add_url.py followed by a css selector and the script will store it. When run from main.py, script will compare to the stored version an open it if necessary.
Source code(tar.gz)
Source code(zip)

Owner

Jaime Álvarez

Passionate about tinkering and knowledge.

GitHub

Python script for changing the SSH banner content with other content

Banner-changer-py Python script for changing the SSH banner content with other content. The Script will take the content of a specified file range and

2 Nov 23, 2021

An implementation to rank your favourite songs from World of Walker

World-Of-Walker-Elo An implementation to rank your favourite songs from Alan Walker's 2021 album World of Walker. Uses the Elo rating system, which is

1 Nov 26, 2021

Dot Browser is a privacy-conscious web browser with smarts built-in for protection against trackers and advertisments online.

?? Take back your privacy with Dot Browser, the privacy-conscious web browser that protects you from being tracked and monitored online.

1k Jan 7, 2023

Hexa is an advanced browser.It can carry out all the functions present in a browser.

Hexa is an advanced browser.It can carry out all the functions present in a browser.It is coded in the language Python using the modules PyQt5 and sys mainly.It is gonna get developed more in the future.It is made specially for the students.Only 1 tab can be used while using it so that the students cant missuse the pandemic situation :)

1 Dec 10, 2021

A framework that let's you compose websites in Python with ease!

Perry Perry <= A framework that let's you compose websites in Python with ease! Perry works similar to Qt and Flutter, allowing you to create componen

13 Oct 9, 2022

Islam - This is a simple python script.In this script I have written all the suras of Al Quran. As a result, by using this script, you can know the number of any sura at the moment.

Introduction: If you want to know sura number of al quran by just typing the name of sura than you can use this script. Usage in termux: $ pkg install

1 Jan 2, 2022

A script where you execute a script that generates a base project for your gdextension

GDExtension Project Creator this is a script (currently only for linux) where you execute a script that generates a base project for your gdextension,

11 Nov 17, 2022

Simple Python tool to check if there is an Office 365 instance linked to a domain.

o365chk.py Simple Python script to check if there is an Office365 instance linked to a particular domain.

37 Jan 2, 2023

flake8 plugin which checks that there is no use of sleep in the code.

flake8-sleep flake8 plugin which checks for use of sleep function. installation Using Pypi: pip install flake8-sleep flake8 codes Code Description SLP

1 Nov 26, 2021

Traditionally, there is considerable friction for developers when setting up development environments

This self-led, half-day training will teach participants the patterns and best practices for working with GitHub Codespaces

12 Dec 2, 2022

switching computer? changing your setup? You need to automate the download of your current setup? This is the right tool for you :incoming_envelope:

?? setup_shift(SS.py) switching computer? changing your setup? You need to automate the download of your current setup? This is the right tool for you

15 Aug 26, 2022

Demo content - Automate your automation!

Automate-AAP2 Demo Content - Automate your automation! A fully automated Ansible Automation Platform. Context Installing and configuring Ansible Autom

0 Oct 27, 2022

A one place destination to check whatever is trending on the top social and news websites at present.

UpTrend A one place destination to check whatever is trending on the top social and news websites at present. Explore the docs » View Demo · Report Bu

10 Oct 3, 2021

A light library to build tiny websites

1 Dec 23, 2021

PORTSCANNING-IN-PYTHON - A python threaded portscanner to scan websites and ipaddresses

PORTSCANNING-IN-PYTHON This is a python threaded portscanner to scan websites an

1 Feb 16, 2022

This python application let you check for new announcements from MMLS, take attendance while your lecturer is sharing QR Code on the screen.

5 Jul 17, 2022

Python script to commit to your github for a perfect commit streak. This is purely for education purposes, please don't use this script to do bad stuff.

Daily-Git-Commit Commit to repo every day for the perfect commit streak Requirments pip install -r requirements.txt Setup Download this repository. Cr

34 Dec 14, 2022

This is a vscode extension with a Virtual Assistant that you can play with when you are bored or you need help..

VS Code Virtual Assistant This is a vscode extension with a Virtual Assistant that you can play with when you are bored or you need help. Its currentl

6 Aug 22, 2021

You can easily send campaigns, e-marketing have actually account using cash will thank you for using our tools, and you can support our Vodafone Cash +201090788026

*** Welcome User Sorry I Mean Hello Brother ✓ Devolper and Design : Mokhtar Abdelkreem ========================================== You Can Follow Us O

1 Nov 3, 2021

A script that will warn you, by opening a new browser tab, when there are new content in your favourite websites.

Related tags

Overview

web check

What it does

How does it work?

How to get the unique css selector

Set up

Running the script

Automate the script

Create shortcut

Log file

Copyright (C) 2021 Jaime Álvarez Fernández

Comments

Update some classes to functions.

New frames for modify url and delete url

Clean main.py

Modify url

Some nice improvements

Releases(v.1.2.0)

v.1.2.0(Nov 9, 2021)

What's Changed

v1.1.0(Oct 13, 2021)

v1.0.0(Oct 6, 2021)

v0.5.1(Oct 5, 2021)

v0.5.0(Oct 2, 2021)

v.0.4.0(Sep 28, 2021)

v0.3.1(Sep 27, 2021)

v0.3.0(Sep 21, 2021)

Owner

Jaime Álvarez

Python script for changing the SSH banner content with other content

An implementation to rank your favourite songs from World of Walker

Dot Browser is a privacy-conscious web browser with smarts built-in for protection against trackers and advertisments online.

Hexa is an advanced browser.It can carry out all the functions present in a browser.

A framework that let's you compose websites in Python with ease!

Islam - This is a simple python script.In this script I have written all the suras of Al Quran. As a result, by using this script, you can know the number of any sura at the moment.

A script where you execute a script that generates a base project for your gdextension

Simple Python tool to check if there is an Office 365 instance linked to a domain.

flake8 plugin which checks that there is no use of sleep in the code.

Traditionally, there is considerable friction for developers when setting up development environments

switching computer? changing your setup? You need to automate the download of your current setup? This is the right tool for you :incoming_envelope:

Demo content - Automate your automation!

A one place destination to check whatever is trending on the top social and news websites at present.

A light library to build tiny websites

PORTSCANNING-IN-PYTHON - A python threaded portscanner to scan websites and ipaddresses

This python application let you check for new announcements from MMLS, take attendance while your lecturer is sharing QR Code on the screen.

Python script to commit to your github for a perfect commit streak. This is purely for education purposes, please don't use this script to do bad stuff.

This is a vscode extension with a Virtual Assistant that you can play with when you are bored or you need help..

You can easily send campaigns, e-marketing have actually account using cash will thank you for using our tools, and you can support our Vodafone Cash +201090788026