- Don’t have ChatGPT
- OCR needed
- Preferably Android
Thanks.
Android won’t be easy, but you can slap together a python script that runs tesseract or easyOCR and runs it through a pretrained LLM like T5. Those are well-known and well-documented, so chatGPT can probably write the script for you without too many hiccups.
Good luck trying to install tesseract and a deep learning framework in termux.
chatGPT can probably write the script for you
From OP:
- Don’t have ChatGPT
Well, you give open AI a lot of personal data, so it’s not free from a certain point of view. That may be the reason why OP don’t want to use it.
I read that as either “I don’t have premium” or “I can’t run this data through chatgpt for whatever reason”.
Free chatGPT is viable for writing scripts in any case.
- Download any OCR software from f-droid, or preferred store.
- Copy text.
- Run llama-gpt¹ if you want something self-hosted or any LLM² on huggingface chat if you want ready solution
- Paste text and write something like “summary:” below.
¹Theoretically possible on mobile, but for better performance, run it on PC.
²Default one should do the job.
Disclaimer: I think that it should work, but I haven’t done anything like that before
I have actually tried it, but from doc files on a PC and running python.
My main issue is that the model doing it well need a commercial licence. I have the paygrade to experiment by myself on my work time, but not the one to spend company’s money for it. And IT just signed a contract to get GPT4 has part of bing chat pro
It will be a great deal quicker just to read the damn thing.