OCR for image to text converter

t1nk3rz@alien.top · 1 year ago

OCR for image to text converter

sixtyfifth_snow@alien.top · 1 year ago

tesseract-ocr? You can download it via apt or something similar.

DaHunni@alien.top · 1 year ago

paperless-ngx has built in ocr but I don’t think it would fit your needs

t1nk3rz@alien.top · 1 year ago

I will check it up

The_Laki@alien.top · 1 year ago

Windows 11 has this built in if you take a screenshot

t1nk3rz@alien.top · 1 year ago

Didn’t know that,i use flameshot for screenshots,i will take a look thnx

BadGroundbreaking243@alien.top · 1 year ago

You could spin up paperless-ngx. Or use pdf24 creator. Beware paperless consume will delete the file.

I used paperless-ngx before and it works pretty good.

t1nk3rz@alien.top · 1 year ago

I will check it up, i have Stirlingpdf and I see it also has ocr support

lilolalu@alien.top · 1 year ago

Nextcloud AIO (all-in-one) comes with full text search installed, which brings tesseract to nextcloud. so you can let tesseract-ocr run over all documents and then they will be searchable with Elasticsearch.

henry_tennenbaum@alien.top · 1 year ago

I’m not sure I understand you correctly. Do you want to apply OCR to PDFs or to Screenshots?

For PDFs there’s the excellent ocrmypdf which paperless-ngx uses under the hood.