1. Don’t have ChatGPT
  2. OCR needed
  3. Preferably Android

Thanks.

24 points

It will be a great deal quicker just to read the damn thing.

permalink
report
reply
12 points
*
  1. Download any OCR software from f-droid, or preferred store.
  2. Copy text.
  3. Run llama-gpt¹ if you want something self-hosted or any LLM² on huggingface chat if you want ready solution
  4. Paste text and write something like “summary:” below.

¹Theoretically possible on mobile, but for better performance, run it on PC.

²Default one should do the job.

Disclaimer: I think that it should work, but I haven’t done anything like that before

permalink
report
reply
2 points

I have actually tried it, but from doc files on a PC and running python.

My main issue is that the model doing it well need a commercial licence. I have the paygrade to experiment by myself on my work time, but not the one to spend company’s money for it. And IT just signed a contract to get GPT4 has part of bing chat pro

permalink
report
parent
reply
7 points

Android won’t be easy, but you can slap together a python script that runs tesseract or easyOCR and runs it through a pretrained LLM like T5. Those are well-known and well-documented, so chatGPT can probably write the script for you without too many hiccups.

permalink
report
reply
6 points
*

chatGPT can probably write the script for you

From OP:

  1. Don’t have ChatGPT
permalink
report
parent
reply
5 points

I read that as either “I don’t have premium” or “I can’t run this data through chatgpt for whatever reason”.

Free chatGPT is viable for writing scripts in any case.

permalink
report
parent
reply
2 points

Yeah, maybe he/she don’t have API access, I didn’t think about it that way.

permalink
report
parent
reply
1 point

I’m guessing they meant don’t want to use chatGPT considering it’s free

permalink
report
parent
reply
1 point
*

Well, you give open AI a lot of personal data, so it’s not free from a certain point of view. That may be the reason why OP don’t want to use it.

permalink
report
parent
reply
0 points

And you can run that in termux, so you csn use it in android

permalink
report
parent
reply
2 points

Good luck trying to install tesseract and a deep learning framework in termux.

permalink
report
parent
reply
2 points

You can’t tell me what to do! Just watch me

permalink
report
parent
reply
4 points
*

What‘s the worth of AI generated summaries if they are not factually reliable? The new Google search result previews that are generated by AI (and I believe Google as a large company has more resources than most of us do) contain so many obvious factual errors (i.e. made-up names, wrong places, false dates) that I really doubt current generation AI is ready to be a reliable help in this use case.

I, too, like the idea of not having to do all this work manually. But we’re not there yet.

permalink
report
reply
3 points
*

It’s probably just be easier to read it and write a summary? Or try this

permalink
report
reply

Asklemmy

!asklemmy@lemmy.ml

Create post

A loosely moderated place to ask open-ended questions

Search asklemmy 🔍

If your post meets the following criteria, it’s welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

Icon by @Double_A@discuss.tchncs.de

Community stats

  • 9.6K

    Monthly active users

  • 4.9K

    Posts

  • 275K

    Comments