1. Don’t have ChatGPT
  2. OCR needed
  3. Preferably Android

Thanks.

7 points

Android won’t be easy, but you can slap together a python script that runs tesseract or easyOCR and runs it through a pretrained LLM like T5. Those are well-known and well-documented, so chatGPT can probably write the script for you without too many hiccups.

permalink
report
reply
0 points

And you can run that in termux, so you csn use it in android

permalink
report
parent
reply
2 points

Good luck trying to install tesseract and a deep learning framework in termux.

permalink
report
parent
reply
2 points

You can’t tell me what to do! Just watch me

permalink
report
parent
reply
6 points
*

chatGPT can probably write the script for you

From OP:

  1. Don’t have ChatGPT
permalink
report
parent
reply
1 point

I’m guessing they meant don’t want to use chatGPT considering it’s free

permalink
report
parent
reply
1 point
*

Well, you give open AI a lot of personal data, so it’s not free from a certain point of view. That may be the reason why OP don’t want to use it.

permalink
report
parent
reply
5 points

I read that as either “I don’t have premium” or “I can’t run this data through chatgpt for whatever reason”.

Free chatGPT is viable for writing scripts in any case.

permalink
report
parent
reply
2 points

Yeah, maybe he/she don’t have API access, I didn’t think about it that way.

permalink
report
parent
reply
12 points
*
  1. Download any OCR software from f-droid, or preferred store.
  2. Copy text.
  3. Run llama-gpt¹ if you want something self-hosted or any LLM² on huggingface chat if you want ready solution
  4. Paste text and write something like “summary:” below.

¹Theoretically possible on mobile, but for better performance, run it on PC.

²Default one should do the job.

Disclaimer: I think that it should work, but I haven’t done anything like that before

permalink
report
reply
2 points

I have actually tried it, but from doc files on a PC and running python.

My main issue is that the model doing it well need a commercial licence. I have the paygrade to experiment by myself on my work time, but not the one to spend company’s money for it. And IT just signed a contract to get GPT4 has part of bing chat pro

permalink
report
parent
reply
3 points
*

It’s probably just be easier to read it and write a summary? Or try this

permalink
report
reply
24 points

It will be a great deal quicker just to read the damn thing.

permalink
report
reply

Asklemmy

!asklemmy@lemmy.ml

Create post

A loosely moderated place to ask open-ended questions

Search asklemmy 🔍

If your post meets the following criteria, it’s welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

Icon by @Double_A@discuss.tchncs.de

Community stats

  • 11K

    Monthly active users

  • 5.3K

    Posts

  • 296K

    Comments