24 Python Pdfs Libraries

Convert scans of handwritten notes to beautiful, compact PDFs

4.8k Jan 1, 2023

Awesome-AI-books - Some awesome AI related books and pdfs for learning and downloading

Awesome AI books Some awesome AI related books and pdfs for downloading and learning. Preface This repo only used for learning, do not use in business

1k Jan 1, 2023

A bulk pdf generator. This application can generate PDFs in bulk by using just one click.

A bulk html pdf generator. This application can generate PDFs in bulk by using just one click. Screenshots Requirements 🧱 Your system must have the f

3 Apr 23, 2022

Scans pdfs for links written in plaintext and checks if they are active or returns an error code.

Scans pdfs for links written in plaintext and checks if they are active or returns an error code. It then generates a report of its findings. Extract references (pdf, url, doi, arxiv) and metadata from a PDF.

22 Nov 21, 2022

Excalibur: A web interface to extract tabular data from PDFs

Excalibur: A web interface to extract tabular data from PDFs Excalibur is a web interface to extract tabular data from PDFs, written in Python 3! It i

1.2k Jan 4, 2023

A spider for Universal Online Judge(UOJ) system, converting problem pages to PDFs.

Universal Online Judge Spider Introduction This is a spider for Universal Online Judge (UOJ) system (https://uoj.ac/). It also works for all other Onl

1 Dec 7, 2021

pdf_sprinkles: sprinkles text in your PDFs

pdf_sprinkles: sprinkles text in your PDFs pdf_sprinkles remotely OCRs a PDF with Google Cloud Document AI, and returns the result as a PDF with searc

2 Dec 17, 2021

Reads Data from given Excel File and exports Single PDFs and a complete PDF grouped by Gateway

E-Shelter Excel2QR Reads Data from given Excel File and exports Single PDFs and a complete PDF grouped by Gateway Features Reads Excel 2021 Export Sin

1 Nov 13, 2021

pikepdf is a Python library for reading and writing PDF files.

A Python library for reading and writing PDF, powered by qpdf

1.6k Jan 3, 2023

Fully automated download and parsing for Texas A&M University's Registrar's grade distribution PDFs for years 2014+.

Fully automated download and parsing for Texas A&M University's Registrar's grade distribution PDFs for years 2014+. Adds the parsing results to a mySQL database.

1 Sep 28, 2022

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

75 Oct 21, 2022

Auto Convert PDFs to png files in python

This python tool, which is an application of PyMuPDF module, could auto convert PDFs to png files

4 Dec 5, 2021

This is the fuzzer I made to fuzz Preview on macOS and iOS like 8years back when I just started fuzzing things.

Fuzzing PDFs like its 1990s This is the fuzzer I made to fuzz Preview on macOS and iOS like 8years back when I just started fuzzing things. Some discl

14 Sep 30, 2022

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

75 Oct 21, 2022

Pdfencrypt is a tool to encrypt/lock PDFs

Pdfencrypt Pdfencrypt is a tool to encrypt/lock PDFs Installation $ apt update $ apt upgrade $ apt install git $ apt install python $ git clone https:

5 Nov 28, 2021

A toolkit to automatically crawl the paper list and download paper pdfs of ACL Ahthology.

ACL-Anthology-Crawler A toolkit to automatically crawl the paper list and download paper pdfs of ACL Anthology

9 Oct 9, 2022

Camelot is a Python library that can help you extract tables from PDFs!

A Python library to extract tabular data from PDFs

1.8k Jan 3, 2023

A python library for extracting text from PDFs without losing the formatting of the PDF content.

Multilingual PDF to Text Install Package from Pypi Install it using pip. pip install multilingual-pdf2text The library uses Tesseract which can be ins

49 Nov 7, 2022

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

doc2text doc2text extracts higher quality text by fixing common scan errors Developing text corpora can be a massive pain in the butt. Much of the tex

1.3k Jan 4, 2023

Python library to extract tabular data from images and scanned PDFs

Overview ExtractTable - API to extract tabular data from images and scanned PDFs The motivation is to make it easy for developers to extract tabular d

165 Dec 31, 2022

Extract tables from scanned image PDFs using Optical Character Recognition.

ocr-table This project aims to extract tables from scanned image PDFs using Optical Character Recognition. Install Requirements Tesseract OCR sudo apt

209 Dec 6, 2022

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

14.8k Jan 5, 2023

MkDocs Plugin allowing your visitors to File Print Save as PDF the entire site.

mkdocs-print-site-plugin MkDocs plugin that adds a page to your site combining all pages, allowing your site visitors to File Print Save as PDF th

67 Jan 4, 2023

A library for converting HTML into PDFs using ReportLab

XHTML2PDF The current release of xhtml2pdf is xhtml2pdf 0.2.5. Release Notes can be found here: Release Notes As with all open-source software, its us

2k Dec 27, 2022

Python Pdfs Resources

Python pdfs Libraries

Convert scans of handwritten notes to beautiful, compact PDFs

Awesome-AI-books - Some awesome AI related books and pdfs for learning and downloading

A bulk pdf generator. This application can generate PDFs in bulk by using just one click.

Scans pdfs for links written in plaintext and checks if they are active or returns an error code.

Excalibur: A web interface to extract tabular data from PDFs

A spider for Universal Online Judge(UOJ) system, converting problem pages to PDFs.

pdf_sprinkles: sprinkles text in your PDFs

Reads Data from given Excel File and exports Single PDFs and a complete PDF grouped by Gateway

pikepdf is a Python library for reading and writing PDF files.

Fully automated download and parsing for Texas A&M University's Registrar's grade distribution PDFs for years 2014+.

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Auto Convert PDFs to png files in python

This is the fuzzer I made to fuzz Preview on macOS and iOS like 8years back when I just started fuzzing things.

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Pdfencrypt is a tool to encrypt/lock PDFs

A toolkit to automatically crawl the paper list and download paper pdfs of ACL Ahthology.

Camelot is a Python library that can help you extract tables from PDFs!

A python library for extracting text from PDFs without losing the formatting of the PDF content.

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

Python library to extract tabular data from images and scanned PDFs

Extract tables from scanned image PDFs using Optical Character Recognition.

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

MkDocs Plugin allowing your visitors to File Print Save as PDF the entire site.

A library for converting HTML into PDFs using ReportLab

Python Pdfs Resources

Related tags

Python pdfs Libraries

Convert scans of handwritten notes to beautiful, compact PDFs

Awesome-AI-books - Some awesome AI related books and pdfs for learning and downloading

A bulk pdf generator. This application can generate PDFs in bulk by using just one click.

Scans pdfs for links written in plaintext and checks if they are active or returns an error code.

Excalibur: A web interface to extract tabular data from PDFs

A spider for Universal Online Judge(UOJ) system, converting problem pages to PDFs.

pdf_sprinkles: sprinkles text in your PDFs

Reads Data from given Excel File and exports Single PDFs and a complete PDF grouped by Gateway

pikepdf is a Python library for reading and writing PDF files.

Fully automated download and parsing for Texas A&M University's Registrar's grade distribution PDFs for years 2014+.

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Auto Convert PDFs to png files in python

This is the fuzzer I made to fuzz Preview on macOS and iOS like 8years back when I just started fuzzing things.

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Pdfencrypt is a tool to encrypt/lock PDFs

A toolkit to automatically crawl the paper list and download paper pdfs of ACL Ahthology.

Camelot is a Python library that can help you extract tables from PDFs!

A python library for extracting text from PDFs without losing the formatting of the PDF content.

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

Python library to extract tabular data from images and scanned PDFs

Extract tables from scanned image PDFs using Optical Character Recognition.

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

MkDocs Plugin allowing your visitors to *File Print Save as PDF* the entire site.

A library for converting HTML into PDFs using ReportLab

MkDocs Plugin allowing your visitors to File Print Save as PDF the entire site.