A webmining CLI tool & library for python.

médialab Sciences Po

Last update: Dec 17, 2022

Related tags

Overview

minet is a webmining command line tool & library for python (>= 3.6) that can be used to collect and extract data from a large variety of web sources such as raw webpages, Facebook, CrowdTangle, YouTube, Twitter, Media Cloud etc.

It adopts a very simple approach to various webmining problems by letting you perform a variety of actions from the comfort of the command line. No database needed: raw CSV files should be sufficient to do most of the work.

In addition, minet also exposes its high-level programmatic interface as a python library so you can tweak its behavior at will.

Shortcuts: Command line documentation, Python library documentation.

What it does

Minet can single-handedly:

Extract URLs from a text file (or a table)
Parse URLs (get useful information, with Facebook- and Youtube-specific stuff)
Join two CSV files by matching the columns containing URLs
From a list of URLs, resolve their redirections
- ...and check their HTTP status
- ...and download the HTML
- ...and extract hyperlinks
- ...and extract the text content and other metadata (title...)
- ...and scrape structured data (using a declarative language to define your heuristics)
Crawl (using a declarative language to define a browsing behavior, and what to harvest)
Mine or search:
- Crowdtangle (requires API access)
- Mediacloud (requires free API access)
- Twitter (requires free API access)
- Youtube (requires free API access)
Scrape (without requiring special access):
- Facebook
- Twitter
- Google Drive (spreadsheets etc.)
Grab & dump cookies from your browser
Dump Hyphe data

Documented use cases

Fetching a large amount of urls
Joining 2 CSV files by urls
Using minet from a Jupyter notebook (very useful to experiment with the tool or teach students)
Downloading images associated with a given hashtag on Twitter
Scraping DSL Tutorial

Features (from a technical standpoint)

Multithreaded, memory-efficient fetching from the web.
Multithreaded, scalable crawling using a comfy DSL.
Multiprocessed raw text content extraction from HTML pages.
Multiprocessed scraping from HTML pages using a comfy DSL.
URL-related heuristics utilities such as extraction, normalization and matching.
Data collection from various APIs such as CrowdTangle.

Installation

minet can be installed as a standalone CLI tool (currently only on mac >= 10.14, ubuntu & similar) by running the following command in your terminal:

curl -sSL https://raw.githubusercontent.com/medialab/minet/master/scripts/install.sh | bash

Don't trust us enough to pipe the result of a HTTP request into bash? We wouldn't either, so feel free to read the installation script here and run it on your end if you prefer.

On ubuntu & similar you might need to install curl and unzip before running the installation script if you don't already have it:

sudo apt-get install curl unzip

Else, minet can be installed directly as a python CLI tool and library using pip:

pip install minet

If you need more help to install and use minet from scratch, you can check those installation documents.

Finally if you want to install the standalone binaries by yourself (even for windows) you can find them in each release here.

Upgrading

To upgrade the standalone version, simply run the install script once again:

curl -sSL https://raw.githubusercontent.com/medialab/minet/master/scripts/install.sh | bash

To upgrade the python version you can use pip thusly:

pip install -U minet

Uninstallation

To uninstall the standalone version:

curl -sSL https://raw.githubusercontent.com/medialab/minet/master/scripts/uninstall.sh | bash

To uninstall the python version:

pip uninstall minet

Documentation

Contributing

To contribute to minet you can check out this documentation.

How to cite

minet is published on Zenodo as

You can cite it thusly:

Guillaume Plique, Pauline Breteau, Jules Farjas, Héloïse Théro, Jean Descamps, & Amélie Pellé. (2019, October 14). Minet, a webmining CLI tool & library for python. Zenodo. http://doi.org/10.5281/zenodo.4564399

Comments

casanova.exceptions.EmptyFileError

I am trying to run minet in a github action. It fails with the following message:

  minet tw scrape tweets -o tweets.csv "from:@taniki #tutotal2022"
  shell: /usr/bin/bash -e {0}
  env:
    pythonLocation: /opt/hostedtoolcache/Python/3.9.5/x64
    LD_LIBRARY_PATH: /opt/hostedtoolcache/Python/3.9.5/x64/lib

Collecting tweets: 0 tweets [00:00, ? tweets/s]Traceback (most recent call last):
  File "/opt/hostedtoolcache/Python/3.9.5/x64/lib/python3.9/site-packages/casanova/reader.py", line 151, in __init__
    fieldnames = next(self.reader)
StopIteration

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/opt/hostedtoolcache/Python/3.9.5/x64/bin/minet", line 8, in <module>
    sys.exit(main())
  File "/opt/hostedtoolcache/Python/3.9.5/x64/lib/python3.9/site-packages/minet/cli/__main__.py", line 218, in main
    fn(cli_args)
  File "/opt/hostedtoolcache/Python/3.9.5/x64/lib/python3.9/site-packages/minet/cli/twitter/__init__.py", line 33, in twitter_action
    twitter_scrape_action(cli_args)
  File "/opt/hostedtoolcache/Python/3.9.5/x64/lib/python3.9/site-packages/minet/cli/twitter/scrape.py", line 45, in twitter_scrape_action
    enricher = casanova.enricher(
  File "/opt/hostedtoolcache/Python/3.9.5/x64/lib/python3.9/site-packages/casanova/enricher.py", line 31, in __init__
    super().__init__(input_file, no_headers=no_headers, **kwargs)
  File "/opt/hostedtoolcache/Python/3.9.5/x64/lib/python3.9/site-packages/casanova/reader.py", line 157, in __init__
    raise EmptyFileError
casanova.exceptions.EmptyFileError

Collecting tweets: 0 tweets [00:00, ? tweets/s]
Error: Process completed with exit code 1.

opened by taniki 16

Get Retweeters

Hi, thanks for the last release, I'm glad to see there is a Retweeters tool but I went through some issues with it... for a few days.. I may not understood how it should implemented ? I run it and I get this error : May someone who manage with it help me ?

Thank you

opened by jlbreeeez 15
Twitter API scraper: acquire guest_token by API

new method to acquire the guest_token through activate API relates #384 #382

Method taken from @JustAnotherArchivist in snscrape see: https://github.com/JustAnotherArchivist/snscrape/commit/0336ce13edbd195b3e91487061a0e7a2857f0c68 Thanks for sharing the solution.

For now this edit is simply a new method to acquire the token. The token is used as a cookie as before but it's not preserved on disk in case of multiple calls.

opened by paulgirard 11
tw scrape fails on some queries due to Over capacity error

minet tw scrape tweets '#5gcovid' > tweets.csv

<class 'minet.twitter.exceptions.TwitterPublicAPIInvalidResponseError'>

{'errors': [{'message': 'Over capacity', 'code': 130}]} 503
bug

opened by Yomguithereal 10
[retweeters] KeyError: 'url'

Hi, when I try to retrieve the retweeters list from a file containing tweets previously extracted from Twitter using minet scrapper, I get this error after scanning a few tweets from my list (after 7, 10, or 30 tweets scanned... it depend of the database...). Does anyone encountered this error before ? Thanks for helping :-)

opened by tloops329384 8
impossible d'extraire totalité des tweets d'une requête

Lorsque je lance une requête, avec comme critère un mot clé + un utilisateur, le résultat est très aléatoire : une fois 0 tweet, une fois 1 tweet, une fois 20 tweets, une fois 80 tweets etc sans jamais arriver à une extraction totale (qui est d'environ seulement 200 tweets pourtant). J'ai relancé cette requête de nombreuses fois, sans jamais extraire l'ensemble des tweets en question.

Que dois-je faire pour y parvenir ? Merci

opened by parisGH 8

[twitter] unable to get user tweets

Hello,

Thanks for sharing the lib with the community. I am not able to get user tweets , I got the error:

Traceback (most recent call last):
  File "/home/bafou/.local/bin/minet", line 8, in <module>
    sys.exit(main())
  File "/home/bafou/.local/pipx/venvs/minet/lib/python3.8/site-packages/minet/cli/__main__.py", line 198, in main
    to_close = resolve_arg_dependencies(cli_args, config)
  File "/home/bafou/.local/pipx/venvs/minet/lib/python3.8/site-packages/minet/cli/argparse.py", line 290, in resolve_arg_dependencies
    setattr(cli_args, name, value.resolve(config))
  File "/home/bafou/.local/pipx/venvs/minet/lib/python3.8/site-packages/minet/cli/argparse.py", line 253, in resolve
    return getpath(config, self.key, self.default)
  File "/home/bafou/.local/pipx/venvs/minet/lib/python3.8/site-packages/ebbe/utils.py", line 72, in getpath
    target = target[step]
TypeError: string indices must be integers

when executingminet tw user-tweets screen_name users.csv > tweets.csv with users.csv

Regards.

bug

opened by billmetangmo 6

GH actions + Minet Scrap Twitter fail.

hi,

i have this GH action to generate a twitter scrap csv (written by @taniki) :

name: scrape bfm

on:
  workflow_dispatch:
  schedule:
    - cron:  '0 9 * * *'

jobs:
  scrape_bfm:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v2
      - uses: actions/setup-python@v2
        with:
          python-version: '3.x'
      - name: install minet
        run: |
          python -m pip install --upgrade pip
          pip install minet==0.56.2
      - name: scrape @BFMTV tweets
        shell: bash
        run: |
          minet tw scrape tweets "from:@BFMTV since:2021-09-01" > bfmtv-tweets.csv
      - name: commit
        uses: ./.github/actions/commit
        with:
          message: lol @bfmtv

Sometimes, no problem. Sometimes, GH return error log :

Run minet tw scrape tweets "from:@CNEWS since:2021-09-01" > cnews-tweets.csv
Collecting tweets: 0 tweets [00:00, ? tweets/s]                            
Collecting tweets: 0 tweets [00:00, ? tweets/s]                   
Searching for "from:@CNEWS since:2021-09-01"

Collecting tweets: 0 tweets [00:00, ? tweets/s]
Collecting tweets: 0 tweets [00:00, ? tweets/s, queries=1, tokens=1]Traceback (most recent call last):
  File "/opt/hostedtoolcache/Python/3.10.1/x64/bin/minet", line 8, in <module>
    sys.exit(main())
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/site-packages/minet/cli/__main__.py", line 218, in main
    fn(cli_args)
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/site-packages/minet/cli/twitter/__init__.py", line 31, in twitter_action
    twitter_scrape_action(cli_args)
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/site-packages/minet/cli/twitter/scrape.py", line 69, in twitter_scrape_action
    for tweet, meta in iterator:
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/site-packages/minet/twitter/api_scraper.py", line 370, in search
    new_cursor, tweets = retryer(self.request_search, query, cursor, refs=refs)
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/site-packages/tenacity/__init__.py", line 404, in __call__
    do = self.iter(retry_state=retry_state)
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/site-packages/tenacity/__init__.py", line 349, in iter
    return fut.result()
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/concurrent/futures/_base.py", line 438, in result
    return self.__get_result()
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/concurrent/futures/_base.py", line 390, in __get_result
    raise self._exception
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/site-packages/tenacity/__init__.py", line 407, in __call__
    result = fn(*args, **kwargs)
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/site-packages/minet/twitter/api_scraper.py", line 72, in wrapped
    self.acquire_guest_token()
  File "/opt/hostedtoolcache/Python/3.10.1/x64/lib/python3.10/site-packages/minet/twitter/api_scraper.py", line 261, in acquire_guest_token
    raise TwitterGuestTokenError
minet.twitter.exceptions.TwitterGuestTokenError

Collecting tweets: 0 tweets [00:00, ? tweets/s, queries=1, tokens=1]
Error: Process completed with exit code 1.

Dont understand. Did anyone have the same problem Twitter ban GH sometimes ?

Thanks for Minet, super outil !

opened by stefw 6

Access denied

Forewords : sorry, new on GitHub, and I'm not sure it is the appropriate place to post my question... Is it ?

Hi, First, thank you for the tool which will help me a lot in my research ! I got a problem, which I think is not that complicated, but when I run Minet in order to get the "friends" of the twitter_users contained in the data_users.csv file, I don't manage to get access to the file : "Permission Denied"... I tried to open the CMD as an Administrator but it didn't solve the problem. Can you help me ?

opened by jlbreeeez 6
error in installing pip install mineit

while installing mineit via pip it does not work. says, "" Collecting mineit Could not install packages due to an EnvironmentError: 404 Client Error: Not Found for url: https://pypi.org/simple/mineit/

""

is this issue already solved?

opened by moonisali 6
Twitter scrape: systematic TwitterGuestTokenError with v0.56.2 or v0.56.1

As in #382 I experience systematic TwitterGuestTokenError exceptions. Was not the case a few weeks ago. I didn't test other versions than 0.56.1 and 0.56.2.

Looks like we need to review the twitter scrape heuristic. I will try to have a look later today or tomorrow.
bug

opened by paulgirard 5
instagram
[ ] get comments from a post id: https://www.instagram.com/api/v1/media/POST_ID/comments/?can_support_threading=true&permalink_enabled=false

[x] get user info from username: https://i.instagram.com/api/v1/users/web_profile_info/?username=USERNAME

[ ] other route for posts associated with hashtag (more info but don't know how to change page): https://www.instagram.com/api/v1/tags/web_info/?tag_name=HASHTAG

[ ] get post info from post id: https://www.instagram.com/api/v1/media/POST_ID/info/

[ ] get post likers from post id (it seems that we can only have access to a limited number of them): https://www.instagram.com/api/v1/media/POST_ID/likers/

Need 'cookie' and 'x-ig-app-id'
enhancement
opened by MiguelLaura 0

Releases(0.66.1)

0.66.1(Dec 13, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.08 MB)
ubuntu_18.zip(43.44 MB)
ubuntu_20.zip(44.94 MB)
ubuntu_22.zip(44.28 MB)
windows.zip(29.70 MB)
0.66.0(Dec 7, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.08 MB)
ubuntu_18.zip(43.44 MB)
ubuntu_20.zip(44.95 MB)
ubuntu_22.zip(44.28 MB)
windows.zip(29.70 MB)
0.65.0(Nov 9, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.05 MB)
ubuntu_18.zip(43.43 MB)
ubuntu_20.zip(44.93 MB)
ubuntu_22.zip(44.27 MB)
windows.zip(29.90 MB)
0.64.0(Nov 8, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.04 MB)
ubuntu_18.zip(43.41 MB)
ubuntu_20.zip(44.92 MB)
ubuntu_22.zip(44.25 MB)
windows.zip(29.88 MB)
0.63.1(Oct 14, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.05 MB)
ubuntu_18.zip(43.44 MB)
ubuntu_20.zip(44.94 MB)
ubuntu_22.zip(44.27 MB)
windows.zip(29.89 MB)
0.63.0(Oct 14, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.05 MB)
ubuntu_18.zip(43.44 MB)
ubuntu_20.zip(44.94 MB)
ubuntu_22.zip(44.27 MB)
windows.zip(29.89 MB)
0.62.1(Sep 26, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.04 MB)
ubuntu_18.zip(43.42 MB)
ubuntu_20.zip(44.93 MB)
ubuntu_22.zip(44.27 MB)
windows.zip(29.89 MB)
0.62.0(Sep 21, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.04 MB)
ubuntu_18.zip(43.42 MB)
ubuntu_20.zip(44.93 MB)
ubuntu_22.zip(44.27 MB)
windows.zip(29.89 MB)
0.61.6(Sep 14, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.04 MB)
ubuntu_18.zip(43.42 MB)
ubuntu_20.zip(44.92 MB)
windows.zip(29.88 MB)
0.61.5(Aug 10, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.05 MB)
ubuntu_18.zip(43.42 MB)
ubuntu_20.zip(44.81 MB)
windows.zip(29.87 MB)
v0.61.4(Jul 29, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.05 MB)
ubuntu_18.zip(43.42 MB)
ubuntu_20.zip(44.81 MB)
windows.zip(29.87 MB)
v0.61.3(Jul 27, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.04 MB)
ubuntu_18.zip(43.42 MB)
ubuntu_20.zip(44.81 MB)
windows.zip(29.86 MB)
v0.61.2(Jul 27, 2022)

Source code(tar.gz)
Source code(zip)
ubuntu_18.zip(43.42 MB)
ubuntu_20.zip(44.81 MB)
windows.zip(29.86 MB)
0.61.1(Jul 26, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.05 MB)
ubuntu_18.zip(43.42 MB)
ubuntu_20.zip(44.81 MB)
windows.zip(29.86 MB)
0.61.0(Jul 25, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(30.05 MB)
ubuntu_18.zip(43.43 MB)
ubuntu_20.zip(44.81 MB)
windows.zip(29.86 MB)
0.60.4(May 19, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(29.95 MB)
ubuntu_18.zip(43.09 MB)
ubuntu_20.zip(44.46 MB)
windows.zip(29.77 MB)
0.60.3(May 5, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(29.85 MB)
ubuntu_18.zip(42.98 MB)
ubuntu_20.zip(44.39 MB)
windows.zip(29.67 MB)
0.60.2(Apr 27, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(29.85 MB)
ubuntu_18.zip(42.98 MB)
ubuntu_20.zip(44.39 MB)
windows.zip(29.67 MB)
0.60.1(Apr 11, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(29.71 MB)
ubuntu_18.zip(42.60 MB)
ubuntu_20.zip(44.01 MB)
windows.zip(29.46 MB)
0.60.0(Apr 6, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(29.83 MB)
ubuntu_18.zip(42.72 MB)
ubuntu_20.zip(44.13 MB)
windows.zip(29.58 MB)
0.59.0(Apr 6, 2022)

Source code(tar.gz)
Source code(zip)
ubuntu_18.zip(42.81 MB)
ubuntu_20.zip(44.23 MB)
0.58.1(Mar 2, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(29.39 MB)
ubuntu_18.zip(42.79 MB)
ubuntu_20.zip(44.21 MB)
windows.zip(29.43 MB)
0.58.0(Feb 23, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(29.42 MB)
ubuntu_18.zip(42.82 MB)
ubuntu_20.zip(44.24 MB)
windows.zip(29.46 MB)
0.57.0(Feb 14, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(29.39 MB)
ubuntu_18.zip(42.79 MB)
ubuntu_20.zip(44.21 MB)
windows.zip(29.43 MB)
0.56.4(Jan 12, 2022)

Source code(tar.gz)
Source code(zip)
macos.zip(29.22 MB)
ubuntu_18.zip(42.62 MB)
ubuntu_20.zip(44.04 MB)
windows.zip(29.22 MB)
0.56.2(Dec 17, 2021)

Source code(tar.gz)
Source code(zip)
macos.zip(29.22 MB)
ubuntu_18.zip(42.62 MB)
ubuntu_20.zip(44.04 MB)
windows.zip(29.24 MB)
0.56.1(Dec 8, 2021)

Source code(tar.gz)
Source code(zip)
macos.zip(29.21 MB)
ubuntu_18.zip(42.62 MB)
ubuntu_20.zip(44.04 MB)
windows.zip(29.24 MB)
0.56.0(Dec 6, 2021)

Source code(tar.gz)
Source code(zip)
macos.zip(29.20 MB)
ubuntu_18.zip(42.60 MB)
ubuntu_20.zip(44.03 MB)
windows.zip(29.23 MB)
0.55.9(Nov 19, 2021)

Source code(tar.gz)
Source code(zip)
macos.zip(29.14 MB)
ubuntu_18.zip(42.52 MB)
ubuntu_20.zip(43.94 MB)
windows.zip(28.84 MB)
0.55.8(Nov 19, 2021)

Source code(tar.gz)
Source code(zip)
macos.zip(29.14 MB)
ubuntu_18.zip(42.52 MB)
ubuntu_20.zip(43.94 MB)
windows.zip(28.84 MB)

Owner

médialab Sciences Po

SciencesPo's médialab is an interdisciplinary research laboratory gathering engineers, designers & social science researchers.

GitHub

Sink is a CLI tool that allows users to synchronize their local folders to their Google Drives. It is similar to the Git CLI and allows fast and reliable syncs with the drive.

Sink is a CLI synchronisation tool that enables a user to synchronise local system files and folders with their Google Drives. It follows a git C

16 May 29, 2022

[WIP]An ani-cli like cli tool for movies and webseries

mov-cli A cli to browse and watch movies. Installation This project is a work in progress. However, you can try it out python git clone https://github

166 Dec 30, 2022

Python-Stock-Info-CLI: Get stock info through CLI by passing stock ticker.

Python-Stock-Info-CLI Get stock info through CLI by passing stock ticker. Installation Use the following command to install the required modules at on

1 Nov 5, 2021

Yts-cli-streamer - A CLI movie streaming client which works on yts.mx API written in python

YTSP It is a CLI movie streaming client which works on yts.mx API written in pyt

1 Feb 5, 2022

flora-dev-cli (fd-cli) is command line interface software to interact with flora blockchain.

Install git clone https://github.com/Flora-Network/fd-cli.git cd fd-cli python3 -m venv venv source venv/bin/activate pip install -e . --extra-index-u

14 Sep 11, 2022

AWS Interactive CLI - Allows you to execute a complex AWS commands by chaining one or more other AWS CLI dependency

2 Dec 10, 2021

Keybase-cli - Keybase docker container that exposes the keybase CLI and some common commands such as getting files or loading github action secrets

keybase-cli Keybase docker container that exposes the keybase CLI and some commo

4 Aug 4, 2022

CLI tool and python library that converts the output of popular command-line tools and file-types to JSON or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts.

jc JSONifies the output of many CLI tools and file-types for easier parsing in scripts

5.8k Jan 3, 2023

Python package with library and CLI tool for analyzing SeaFlow data

Seaflowpy A Python package for SeaFlow flow cytometer data. Table of Contents Install Read EVT/OPP/VCT Files Command-line Interface Configuration Inte

3 Nov 3, 2021

cli simple python script to interact with iphone afc api based on python library( tidevice )

afcclient cli simple python script to interact with iphone afc api based on python library( tidevice ) installation pip3 install -U tidevice cp afccli

2 Jul 15, 2022

Dead simple CLI tool to try Python packages - It's never been easier! :package:

try - It's never been easier to try Python packages try is an easy-to-use cli tool to try out Python packages. Features Install specific package versi

659 Dec 28, 2022

Unofficial Open Corporates CLI: OpenCorporates is a website that shares data on corporations under the copyleft Open Database License. This is an unofficial open corporates python command line tool.

Unofficial Open Corporates CLI OpenCorporates is a website that shares data on corporations under the copyleft Open Database License. This is an unoff

30 Sep 8, 2022

A webmining CLI tool & library for python.

Related tags

Overview

Summary

What it does

Documented use cases

Features (from a technical standpoint)

Installation

Upgrading

Uninstallation

Documentation

Contributing

How to cite

Comments

Releases(0.66.1)

0.66.1(Dec 13, 2022)

0.66.0(Dec 7, 2022)

0.65.0(Nov 9, 2022)

0.64.0(Nov 8, 2022)

0.63.1(Oct 14, 2022)

0.63.0(Oct 14, 2022)

0.62.1(Sep 26, 2022)

0.62.0(Sep 21, 2022)

0.61.6(Sep 14, 2022)

0.61.5(Aug 10, 2022)

v0.61.4(Jul 29, 2022)

v0.61.3(Jul 27, 2022)

v0.61.2(Jul 27, 2022)

0.61.1(Jul 26, 2022)

0.61.0(Jul 25, 2022)

0.60.4(May 19, 2022)

0.60.3(May 5, 2022)

0.60.2(Apr 27, 2022)

0.60.1(Apr 11, 2022)

0.60.0(Apr 6, 2022)

0.59.0(Apr 6, 2022)

0.58.1(Mar 2, 2022)

0.58.0(Feb 23, 2022)

0.57.0(Feb 14, 2022)

0.56.4(Jan 12, 2022)

0.56.2(Dec 17, 2021)

0.56.1(Dec 8, 2021)

0.56.0(Dec 6, 2021)

0.55.9(Nov 19, 2021)

0.55.8(Nov 19, 2021)

Owner

médialab Sciences Po

Sink is a CLI tool that allows users to synchronize their local folders to their Google Drives. It is similar to the Git CLI and allows fast and reliable syncs with the drive.

[WIP]An ani-cli like cli tool for movies and webseries

Python-Stock-Info-CLI: Get stock info through CLI by passing stock ticker.

Yts-cli-streamer - A CLI movie streaming client which works on yts.mx API written in python

flora-dev-cli (fd-cli) is command line interface software to interact with flora blockchain.

AWS Interactive CLI - Allows you to execute a complex AWS commands by chaining one or more other AWS CLI dependency

Keybase-cli - Keybase docker container that exposes the keybase CLI and some common commands such as getting files or loading github action secrets

CLI tool and python library that converts the output of popular command-line tools and file-types to JSON or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts.

Python package with library and CLI tool for analyzing SeaFlow data

cli simple python script to interact with iphone afc api based on python library( tidevice )

Dead simple CLI tool to try Python packages - It's never been easier! :package:

Unofficial Open Corporates CLI: OpenCorporates is a website that shares data on corporations under the copyleft Open Database License. This is an unofficial open corporates python command line tool.

🐍 Python CLI tool to get public information from a GitHub account

A simple Python CLI tool that draws routes/paths on a given map.

A Simple Python CLI Lockpicking Tool

This tool is a free and unlimited python CLI for google translate. based on google_trans_new.

Python CLI utility and library for manipulating SQLite databases

Python Library and CLI for exporting MySQL databases

CmdTube is a Python CLI library for searching, downloading, and watching YouTube tutorials