Avatar

kyle

kyle@infosec.pub
Joined
2 posts • 7 comments
Direct message

You could also try adjusting the contrast a bit. I use an app called Genius Scan, which increases the contrast of the scanned image to reduce the number of bits needed per pixel. This reduces the size of the file quite a bit, although it obviously isn’t a true representation of the scanned document. The TextCleaner imagemagick plugin looks like it’s doing something similar.

permalink
report
parent
reply

Ah, I only use the OpenAI api. I haven’t really explored the rest of the providers out there yet. Claude looks interesting though!

permalink
report
parent
reply

I’ve never used paperless but just checked it out and it looks pretty neat. My first thought would be to scan documents in a higher resolution, let the OCR happen, then convert the file to a JPEG or something smaller after you’ve extracted the text.

I spent a few minutes looking at their wiki and it looks like it might be possible.

Like I said though, no experience with this software so I’m not sure that’d actually work.

permalink
report
reply

I was having issues with it all day yesterday. GPT 3.5 worked fine though.

permalink
report
reply

Your instance admin can see it. It’s not public though.

permalink
report
reply