this rootless Python script rips Windows Recall’s screenshots and SQLite database of OCRed text and allows you to search them.

110 points

Wow, it’s pretty wild they didn’t even attempt to encrypt or protect this data, even if it is local to your machine. What a treasure trove for malware to sift through.

permalink
report
reply
21 points

It IS encrypted. Not well, but it’s encrypted.

permalink
report
parent
reply
12 points

I thought that it was encrypted if your home directory was encrypted? The impression that I got was that it was just a SQLite database stored in the clear. The user must certainly be able to make queries of that database in order for it to work, so even if it’s hosted by a non-user service, malware running locally will still be able to exfiltrate the data.

permalink
report
parent
reply
6 points

All true, which is what I meant by “not well” encrypted. It’s technically encrypted, but for all practical purposes it might as well not be.

permalink
report
parent
reply
1 point

Is it? I skimmed the GitHub source code and couldn’t see anything involving encryption, but it’s totally possible I missed something. Perhaps just accessing the database from python is enough to decrypt it.

permalink
report
parent
reply
6 points

Now ransomware hackers can sell all your shit to someone else if you refuse to pay.

permalink
report
parent
reply
1 point

threats to out your porn habits just got more real

permalink
report
parent
reply
94 points

Please go through the FAQ section of the git project. It’s an eye-opener.

Q. Does this enable mass data breaches of website?

A. Yes. The next time you see a major data breach where customer data is clearly visible in the breach, you’re going to presume company who processes the data are at fault, right? But if people have used a Windows device with Recall to access the service/app/whatever, hackers can see everything and assemble data dumps without the company who runs the service even being aware. The data is already consistently structured in the Recall database for attackers. So prepare for AI powered super breaches. Currently credential marketplaces exist where you can buy stolen passwords — soon, you will be able to buy stolen customer data from insurance companies etc as the entire code to do this has been preinstalled and enabled on Windows by Microsoft.

permalink
report
reply
13 points

It’s worst than that (as bad as this is)…

Today getting some data on a user is bad as smart hackers can put together the context … However any guessing the hacker has to do may alert the user before the hacked data can successfully be exploited

Now, a hacker would know exactly where each password goes and worse, they’d could learn the entire workflow of internal systems to successfully imitate a trained user…

This means the hacker could use the stolen bank data and legitimately issue credit cards to anyone they want (for example)

It’s no longer “we’ll expose some data”, now it’s “we can use this data to infiltrate your systems and wreak havoc in whatever way we want”

permalink
report
parent
reply
4 points

I doubt that. It’s preinstalled and enabled for personal users.

Even if it is enabled by default on pro/enterprise, there will probably be a group policy to disable it.

permalink
report
parent
reply
7 points
*

It feels like this was intended for buisnesses to monitor for phrases on your screen like “coolmath games unblocked free”

or to extract and upload a summary of what happened every second of every day to the server defined in the group policy.

permalink
report
parent
reply
7 points
Deleted by creator
permalink
report
parent
reply
2 points

I doubt it. There are plenty of tools that already do this if that was what they wanted, they’d just model it after those. Storing it locally isn’t how such tools usually work, they get shipped off to a remote server for ingestion.

permalink
report
parent
reply
83 points

permalink
report
reply
85 points
*

For the kids

Sony BMG copy protection rootkit scandal

Morons:

Sony BMG initially denied that the rootkits were harmful. It then released an uninstaller for one of the programs that merely made the program’s files invisible while also installing additional software that could not be easily removed, collected an email address from the user and introduced further security vulnerabilities.

permalink
report
parent
reply
25 points

That’s wild. I’m surprised I never heard of this. Straight up malware.

permalink
report
parent
reply
16 points

In a just society the Sony execs would have been jailed for CFAA violations.

permalink
report
parent
reply
5 points

Very specific, but makes sense

permalink
report
parent
reply
0 points

OBJ to you too, friend 🙇‍♀️

permalink
report
parent
reply
72 points

permalink
report
reply
59 points

permalink
report
parent
reply
70 points

Hilarious to me that it OCRs the text. The text is generated by the computer. It’s almost like when Lt. Cmdr. Data wants to get information from the computer database, so he tells the computer to display it and just keeps increasing the speed — there are way more efficient means of getting information from A to B than displaying it, imaging it, and running it though image processing!

I totally get that this is what makes sense, and it’s independent of the method/library used for generating text, but still…the computer “knows” what it’s displaying (except for images of text), and yet it has to screenshot and read it back.

permalink
report
reply
28 points

It happens the same on android for some reason

Like 5-8 years ago the google assistant app was able to select and copy text from any app when invoked, I think it was called “now on tap”. Then because they’re google and they’re contractually obligated to remove features after some time, they removed this from the google app and integrated it in the pixel app switcher (and who cares if 99% of android users aren’t using a pixel, they say). The new implementation sucks, as it does ocr instead of just accessing the raw text…

It only works fine with us English and not with other languages. But maybe it’s ok as it seems that google’s development style is us-centric

permalink
report
parent
reply
13 points

Now on Tap also used OCR. Both Google Lens and Now on Tap get the same bullshit results on any languages that are not Latin. Literally, Ж gets read as >|< by both exactly the same.

permalink
report
parent
reply
9 points

They changed it, in the beginning it was using the text and not ocr

For example this app could be set as assistant and get the raw text https://play.google.com/store/apps/details?id=com.weberdo.apps.copy

But only the app set on system as assistant can do it

I was very disappointed when they changed it around 2018 as it produced garbage in my language when it was working so good…

permalink
report
parent
reply
25 points
*

Hey, yeah… why aren’t they just tapping the font rendering DLL?

are they tapping the front rendering dll??

permalink
report
parent
reply
2 points

My guess is that they looked at their screen reader API, saw that it wasnt 100% of the text on screen and said fuck it! Were using OCR!

permalink
report
parent
reply
24 points

Having worked on a product that actually did this, it’s not as easy as it seems. There are many ways of drawing text on the screen.

GDI is the most common, which is part of the windows API. But some applications do their own rendering (including browsers).

Another difficulty, even if you could tap into every draw call, you would also need a way to determine what is visible on the screen and what is covered by something else.

permalink
report
parent
reply
20 points

That’s the thing, it doesn’t really know what it’s displaying. I can send a bunch of textboxes, but if they’re hidden, or drawn off-screen, or underneath another element, then they’re not actually displayed.

permalink
report
parent
reply
9 points

Text from OCR is one kind of match. Recall also runs visual comparisons with the image tokens stored.

permalink
report
parent
reply
3 points

To be fair, Data was designed to be like a human, and was made in the image of his creator. He has a number of design decisions that are essentially down to his creator wanting to create something like a human. Including that which you describe.

Data was never intended to work like a PC, it’s very normal that he can’t just wirelessly interface with stuff.

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 18K

    Monthly active users

  • 11K

    Posts

  • 506K

    Comments