Screenshot OCR Tool
Extracting data from screen time screenshots in iOS and Android. We are exploring 3 options:
- Simple OCR with no text position using
pytesseract
andOpenCV
. We can then try and extract info withregex
- Extract text and its position from each screenshot, classify data according to its position in the screenshot
- Use YOLOv4 to extract some features from the screenshot and then use those features to train a ML model.
Instructions
So far there is not much to do really:
- Add your screenshots in each folder
- Run the script and wait for the
tkinter
window to show up - The panel on the right lets you explore the text extracted by
pytesseract
- Clicking on each top-level text in the tree view will highlight the text on the screenshot in red