You could also try adjusting the contrast a bit. I use an app called Genius Scan, which increases the contrast of the scanned image to reduce the number of bits needed per pixel. This reduces the size of the file quite a bit, although it obviously isn’t a true representation of the scanned document. The TextCleaner imagemagick plugin looks like it’s doing something similar.
- 2 Posts
- 9 Comments
Ah, I only use the OpenAI api. I haven’t really explored the rest of the providers out there yet. Claude looks interesting though!
I’ve never used paperless but just checked it out and it looks pretty neat. My first thought would be to scan documents in a higher resolution, let the OCR happen, then convert the file to a JPEG or something smaller after you’ve extracted the text.
I spent a few minutes looking at their wiki and it looks like it might be possible.
Like I said though, no experience with this software so I’m not sure that’d actually work.
I was having issues with it all day yesterday. GPT 3.5 worked fine though.
kyleto Asklemmy@lemmy.ml•Is your e-mail address private on Lemmy or are other users able to see it somehow?55·2 years agoYour instance admin can see it. It’s not public though.
kyleOPto Showerthoughts@lemmy.world•In the future bots will have CAPTCHAs to keep humans out of their communities20·2 years agoYou have 50 milliseconds…
Thanks for linking this!
I listened to your most recent podcast and enjoyed it! Excited for the next episode.
I’m getting errors with the link you posted.
Here’s one from the Wayback Machine
https://web.archive.org/web/20190905152042/http://truegamer.net/SA_911/911 SATHREAD/index.html