The problem with AI alignment is that humans aren't aligned

posted 1 year ago

I’m sure there are some AI peeps here. Neural networks scale with size because the number of combinations of parameter values that work for a given task scales exponentially (or, even better, factorially if that’s a word???) with the network size. How can such a network be properly aligned when even humans, the most advanced natural neural nets, are not aligned? What can we realistically hope for?

Here’s what I mean by alignment:

Ability to specify a loss function that humanity wants
Some strict or statistical guarantees on the deviation from that loss function as well as potentially unaccounted side effects

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments View context

[ - ]

milicent_bystandr@lemmy.ml

3 points

1 year ago

Some of the human-alignment projects

And some look like “I flip shit bigger, align with me or I will flip your shit”

permalink

report

parent

Showerthoughts

!showerthoughts@lemmy.world

Create post

A “Showerthought” is a simple term used to describe the thoughts that pop into your head while you’re doing everyday things like taking a shower, driving, or just daydreaming. The best ones are thoughts that many people can relate to and they find something funny or interesting in regular stuff.

Rules

All posts must be showerthoughts
The entire showerthought must be in the title
Avoid politics (NEW RULE as of 5 Nov 2024, trying it out)
Posts must be original/unique
Adhere to Lemmy’s Code of Conduct

Community stats

7.6K
Monthly active users
1.4K
Posts
47K
Comments

Rules

Community stats

Community moderators