Lemdro.id

Local All Communities Log in Sign up

Local All Communities

71

How often does branchless programming actually matter?

posted 1 year ago

by

Ethan@programming.dev

in

programming@programming.dev

I’ve started noticing articles and YouTube videos touting the benefits of branchless programming, making it sound like this is a hot new technique (or maybe a hot old technique) that everyone should be using. But it seems like it’s only really applicable to data processing applications (as opposed to general programming) and there are very few times in my career where I’ve needed to use, much less optimize, data processing code. And when I do, I use someone else’s library.

How often does branchless programming actually matter in the day to day life of an average developer?

Sort:

Hot Top Controversial New Old

[ +- ]

NotAPenguin@kbin.social

-12 points

1 year ago

As a webdev I’ve honestly never even heard of it

report

reply

[ +- ]

lowleveldata@programming.dev

-2 points

1 year ago

*

Can’t imagine any practical difference performance wise. Maybe it’s about making the flow easier to understand? I do recall that Sonarqube sometimes complains when you have too much branchings in a single function

report

reply

[ +- ]

Ethan@programming.devOP

1 point

1 year ago

If you’re writing data processing code, there are real advantages to avoiding branches, and its especially helpful for SIMD/vectorization such as with AVX instructions or code for a GPU (i.e. shaders). My question is not about whether its helpful - it definitely is in the right circumstances - but about how often those circumstances occur.

report

reply

[ +- ]

lowleveldata@programming.dev

2 points

1 year ago

Ya, and my examination is I don’t think it has practical impacts for day to day tasks. Unless you’re writing AVX instructions day to day but then you already knew the answer.

report

reply

[ +- ]

morhp@lemmynsfw.com

4 points

1 year ago

How often does branchless programming actually matter in the day to day life of an average developer?

Barely never. When writing some code that really has to be high performance (i.e. where you know it slows down your program), it can help to think about if there are branches or jumps that you can potentially simplify or eliminate.

Of course some things are often branchless, for example GPU shaders, which need very high performance and which usually always do the same things. But that’s an exception.

report

reply

[ +- ]

nakal@kbin.social

3 points

1 year ago

There are few people who are smarter than a compiler. And those who use “branchless coding” probably aren’t.

report

reply

[ +- ]

marcos@lemmy.world

38 points

1 year ago

If you want your code to run on the GPU, the complete viability of your code depend on it. But if you just want to run it on the CPU, it is only one of the many micro-optimization techniques you can do to take a few nanoseconds from an inner loop.

The thing to keep in mind is that there is no such thing as “average developer”. Computing is way too diverse for it.

report

reply

[ +- ]

Ethan@programming.devOP

7 points

1 year ago

*

If you want your code to run on the GPU, the complete viability of your code depend on it.

Because of the performance improvements from vectorization, and the fact that GPUs are particularly well suited to that? Or are GPUs particularly bad at branches.

it is only one of the many micro-optimization techniques you can do to take a few nanoseconds from an inner loop.

How often do a few nanoseconds in the inner loop matter?

The thing to keep in mind is that there is no such thing as “average developer”. Computing is way too diverse for it.

Looking at all the software out there, the vast majority of it is games, apps, and websites. Applications where performance is critical, such as control systems, operating systems, databases, numerical analysis, etc, are relatively rare compared to apps/etc. So statistically speaking the majority of developers must be working on the latter (which is what I mean by an “average developer”). In my experience working on apps there are exceedingly few times where micro-optimizations matter (as in things like assembly and/or branchless programming as opposed to macro-optimizations such as avoiding unnecessary looping/nesting/etc).

Edit: I can imagine it might matter a lot more for games, such as in shaders or physics calculations. I’ve never worked on a game so my knowledge of that kind of work is rather lacking.

report

reply

[ +- ]

ishanpage@programming.dev

14 points

1 year ago

How often do a few nanoseconds in the inner loop matter?

It doesn’t matter until you need it. And when you need it, it’s the difference between life and death

report

reply

[ +- ]

LaggyKar@programming.dev

23 points

1 year ago

*

Or are GPUs particularly bad at branches.

Yes. GPUs don’t have per-core branching, they have dozens of cores running the same instructions. So if some cores should run the if branch and some run the else branch, all cores in the group will execute both branches, and mask out the one they shouldn’t have run. I also think they they don’t have the advanced branch prediction CPUs have.

https://en.wikipedia.org/wiki/Single_instruction,_multiple_threads

report

reply

[ +- ]

Ethan@programming.devOP

4 points

1 year ago

Makes sense. The most programming I’ve ever done for a GPU was a few simple shaders for a toy project.

report

reply

[ +- ]

graphicsguy@programming.dev

3 points

1 year ago

Also if you branch on a GPU, the compiler has to reserve enough registers to walk through both branches (handwavey), which means lower occupancy.

Often you have no choice, or removing the branch leaves you with just as much code so it’s irrelevant. But sometimes it matters. If you know that a particular draw call will always use one side of the branch but not the other, a typical optimization is to compile a separate version of the shader that removes the unused branch and saves on registers

report

reply

[ +- ]

0x0@programming.dev

3 points

1 year ago

How often do a few nanoseconds in the inner loop matter?

Fintech. Stock exchanges will go to extreme lengths to appease their wolves of Wallstreet.

report

reply

[ +- ]

LaggyKar@programming.dev

20 points

1 year ago

And the branchless version may end up being slower on the CPU, because the compiler does a better job optimizing the branching version.

report

reply

[ +- ]

18107@aussie.zone

1 point

1 year ago

Yes GPUs are bad at branching. But my ray tracer that is made of 90% branches still runs faster on the GPU than the CPU.

In general you are still correct.

report

reply

[ +- ]

Lanthanae@lemmy.blahaj.zone

15 points

1 year ago

It matters if you develop compilers 🤷,

Otherwise? Readability trumps the minute performance gain almost every time (and that’s assuming your compiler won’t automatically do branchless substitutions for performance reasons anyway which it probably will)

report

reply

Programming

!programming@programming.dev

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person’s post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

Follow the programming.dev instance rules
Keep content related to programming in some way
If you’re posting long videos try to add in some form of tldr for those who don’t want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev

Community stats

3.5K
Monthly active users
1.6K
Posts
26K
Comments

Community moderators

snowe@programming.dev
Ategon@programming.dev
MaungaHikoi@lemmy.nz

modlog legal instances join-lemmy.org

lemmy-ui-next v0.11.0 (github)lemmy v0.19.3 (github)