178 words
1 minutes
Image to Text
2019-02-01
2025-01-31

I enjoy the transition to a paperless world, as I find it much easier to organize documents digitally. However, since many documents are still required as hard copies, one is forced to archive them. A tough question then is: How many and which categories of organizers will one need for all documents?

For me, I decided that I do not want any categories and each organizer should be used until it is full.

To be able to find documents, I wrote a Python program that extracts text from scanned documents and stores it in a computer-processable way.

Additionally, each document is stamped with a numbering stamp (which automatically increases its number with each stamping) and the computer-processable data is stored in folders named by the number of the document. So whenever I am looking for a certain document, I type in keywords from the document and the computer provides me with the number of the document and a preview of it. Then I can grab the organizer, which has the number range written on it, and take the document from it.

Jerey
/
image-to-pdf-and-txt
Waiting for api.github.com...
00K
0K
0K
Waiting...

Enjoyed the post? Have questions or feedback? I'd love to hear from you! Feel free to drop me an email at blog@jerey.at.

Image to Text
https://jerey.at/posts/image-2-text/
Author
Anton A. Jerey
Published at
2019-02-01