DeepSeek collects keystroke data and more, storing it in Chinese servers(mashable.com)

posted 10 days ago

restingboredface@sh.itjust.works

privacy@lemmy.ml

367 commentshide report

Is anyone actually surprised by this?

Sort:

Hot Top Controversial New Old

[ - ]

Jeena@piefed.jeena.net

93 points

10 days ago

This is probably only a problem with the online version. In contrast to google and openAI they, like meta, let you download the model and run it offline, where they can’t access any of this data I presume.

permalink

report

[ - ]

0x01@lemmy.ml

57 points

10 days ago

I’ve been running it locally using ollama, works completely offline, no keystroke data for anyone!

permalink

report

parent

[ - ]

sunzu2@thebrainbin.org

6 points

10 days ago

Yeah I scan logs and so far nothing… I still don’t trust them but I can’t tell shit either

permalink

report

parent

[ - ]

Jessica@discuss.tchncs.de

5 points

10 days ago

Just use little snitch, open snitch or simple wall depending on your operating system and block the outbound connection if one ever occurs

report

[ - ]

23 points

10 days ago

Right, the offline version (if you have the hardware to run it) is completely under your control, and no one can take that away from you. Honestly nice to see that happen, I thought it would take several years.

permalink

report

parent

[ - ]

AbouBenAdhem@lemmy.world

79 points

10 days ago

Anyone using DeepSeek as a service the same way proprietary LLMs like ChatGPT are used is missing the point. The game-changer isn’t that a Chinese company like DeepSeek can compete with OpenAI and its ilk—it’s that, thanks to DeepSeek, any organization with a few million dollars to train and host their own model can now compete with OpenAI.

permalink

report

[ - ]

Snot Flickerman@lemmy.blahaj.zone

23 points

10 days ago

On-prem vs. Cloud, basically. On-prem just magically got cheaper.

permalink

report

parent

[ - ]

mac@lemm.ee

3 points

10 days ago

Onprem has always been cheaper. Cloud compute was the most successful marketing campaign I can think of.

permalink

report

parent

[ - ]

superkret@feddit.org

1 point

9 days ago

Not when it’s about LLMs.

permalink

report

parent

[ - ]

naeap@sopuli.xyz

5 points

10 days ago

I’d like to look into that, how can I train an existing model further?

I’m only playing around with ollama, but like to do a bit more - mostly just to fulfill my needs to understand things - but have no idea where to start

permalink

report

parent

[ - ]

WalnutLum@lemmy.ml

5 points

10 days ago

You’re going to have to learn python.

Here’s a good overview: https://huggingface.co/docs/transformers/training

permalink

report

parent

[ - ]

naeap@sopuli.xyz

3 points

10 days ago

Python is not a problem
SW Dev is my job. Just never had real contact with AI before, besides playing around a bit.

Thank you very much for the link!!

Edit: thank you very much again, that was pretty much exactly what I was looking for.
Don’t know how I missed to checkout huggingface. Thought of it always just as a github for models and didn’t bother checking for docs…
But that’s a great intro with simple tools/tutorials to get a grip on it, thanks!

permalink

report

parent

[ - ]

WalnutLum@lemmy.ml

8 points

10 days ago

Or open source groups can make a fully open repro of it: https://github.com/huggingface/open-r1

permalink

report

parent

[ - ]

Hobbes_Dent@lemmy.world

16 points

10 days ago

DeepSeek’s privacy policy raises concerns about a U.S. foreign adversary’s ability to access U.S. user data. Users are familiar with the massive amounts of data U.S. tech companies collect, but China’s cybersecurity laws make it much easier for the government to demand data from its tech companies. Additionally, DeepSeek users have reported instances of censorship, when it comes to criticizing the Chinese government or asking about Tiananmen Square.

Users have been shown that both governments are untrustworthy so what the fuck are we supposed to do?

Am I supposed to not read this article as panic? I know this is Mashable but the media overall is no longer unbiased and now there’s gonna be more gremlins to watch for in pro-US corpo AI propaganda and media ownership having stakes in AI.

permalink

report

[ - ]