GithubLicenseCrawler
This script is intended to crawl license information of repositories through the GitHub API. Taking a csv file with requirements.txt format the script will return a csv with the associated license information.
Input
Input file is expected to be a requirements.txt Expected format looks like this, for two exemplary repositories:
HeartSeg-Dataset==0.0.1
DeepDive==0.0.1
Output
Output file will be generated on the fly, named licenses.csv and the columns depict:
| Repo name | Repo Url | License name | License Url |
Running the script should look like this:
Contact and Contribute
[email protected] Obviously the Github API is way more powerful than what has been done here. Feel free to extend this code or preferably directly contribute here.