Baua Biocides Scraper
Scrapping the data from each page of biocides listed on the BAUA website (https://www.baua.de/DE/Biozid-Meldeverordnung/Offen/offen.html) into a csv file.
A windows standalone client is avalaible in the dist folder
About the project
What's the problem?
Baua website contains many usefull data for biocides domain, but the website only allows you to search product by product and it is not easy to find and get some informations with over 80,000 products listed
The idea
Facilitate the data manipulation with providing a csv file with all data scraped from Baua website.
How does it work ?
- The user start the program.
- The program extract data from Baua website.
- A csv file containing data are created.
Roadmap
This project was created after a request and is not intended to evolve. Nevertheless you can fork the project to improve it by yourself and propose them via the project pull requests. or make a suggestion via the project issues.
Build with
- Programming language : Python 3.10.0
- Scraping Framework : Scrapy 2.5.1
- HTTP library : Requests 2.26.0
- Standalone Builder : PyInstaller 4.7
Demo
You can use the windows standalone client in the dist folder
Version management
We use a semantic version management, that is a version number MAJOR.MINOR.CORRECTIVE :
- the MAJOR version number when there are non backward compatible changes,
- the MINOR version number when there are backward compatible feature additions,
- the FIX version number when there are backwards compatible bug fixes.
See SignMail tags For more info: semver.org
Authors
- Eric De Maria - Numio - Initial work
License
This project is licensed under the GNU GPL 3 license - See the LICENSE file for more details.