this post was submitted on 14 Jul 2023
69 points (96.0% liked)

Showerthoughts

30153 readers
2110 users here now

A "Showerthought" is a simple term used to describe the thoughts that pop into your head while you're doing everyday things like taking a shower, driving, or just daydreaming. A showerthought should offer a unique perspective on an ordinary part of life.

Rules

  1. All posts must be showerthoughts
  2. The entire showerthought must be in the title
  3. Avoid politics
    • 3.1) NEW RULE as of 5 Nov 2024, trying it out
    • 3.2) Political posts often end up being circle jerks (not offering unique perspective) or enflaming (too much work for mods).
    • 3.3) Try c/politicaldiscussion, volunteer as a mod here, or start your own community.
  4. Posts must be original/unique
  5. Adhere to Lemmy's Code of Conduct

founded 2 years ago
MODERATORS
 

I'm sure there are some AI peeps here. Neural networks scale with size because the number of combinations of parameter values that work for a given task scales exponentially (or, even better, factorially if that's a word???) with the network size. How can such a network be properly aligned when even humans, the most advanced natural neural nets, are not aligned? What can we realistically hope for?

Here's what I mean by alignment:

  • Ability to specify a loss function that humanity wants
  • Some strict or statistical guarantees on the deviation from that loss function as well as potentially unaccounted side effects
you are viewing a single comment's thread
view the rest of the comments
[–] fubo 18 points 2 years ago* (last edited 2 years ago) (3 children)

Some of the human-alignment projects look like "religions" and some look like "economies" and some look like "just talking to each other and trying to be halfway decent folks and not flipping out or some shit".

Heck, arguably the United Nations is a human-alignment project for x-risk mitigation.

[–] [email protected] 4 points 2 years ago (1 children)

Mmmm, agents training each other. Very Deepmind of you to mention that.

[–] fubo 1 points 2 years ago

If you were doing your job and reading some web site, and you happened to notice that there were posts on that site containing child porn, wouldn't you hit the "report" button too?

[–] [email protected] 3 points 2 years ago (1 children)

Some of the human-alignment projects

And some look like "I flip shit bigger, align with me or I will flip your shit"

[–] Eylrid 1 points 2 years ago

The fear of general super AI is that it will have the power to be the biggest shit flipper ever.

[–] DeVaolleysAdVocate 1 points 2 years ago

We'd like to bring all those and their existing versions together with the A-Better-World Consensus-Engine idea.

Tell me more about some of these other projects though please.