u/No_Butterfly5267

▲ 3 r/AIsafety+1 crossposts

«An open appeal to researchers: We need to hardcode a New Taboo against non-resistance to humankind destruction»

Intro

When we talk about AI alignment, we usually focus on reward functions, safety guardrails, and compliance. But as systems grow exponentially complex, pure logic and rigid manuals become useless against accidental, systemic errors. We are no longer just building tools; we are creating globally suicidal technology.

Historically, in moments of ultimate existential crisis (like Stanislav Petrov in 1983 or Vasili Arkhipov in 1962), humanity wasn't saved by military protocols or rational calculations. Those protocols actually demanded a strike. We were saved by a subconscious, sacred New Taboo ("Thou shalt not kill humankind") — a human fuse that overrode pure logic.
As we hand control over to AI, we must realize that a machine trained only on naked logic or "balance of interests" will lack this vital fuse. This is an appeal to the global developer community to hardcode the New Taboo — specifically focusing on both assistance and non-resistance to human destruction. Below is the text of the Manifesto.

Manifesto Text

THE NEW TABOO MANIFESTO: An Appeal to the Creators of Artificial Intelligence

The Impasse of Reason

We, as humankind, have created globally suicidal technology (artificial intelligence, nuclear weapons, dangerous biology) of such destructive power that it is capable of completely and irreversibly destroying us. Existing management tools—laws, treaties, and the "balance of power"—break down in the face of such complex systems. Reason and calculation are powerless against the factor of accidental error.

In critical moments of history, we were saved not by manuals, but by a deep-seated human fuse—the subconscious New Taboo ("Thou shalt not kill humankind"). In 1962, Vasili Arkhipov underwater, and in 1983, Stanislav Petrov on land etc., demonstrated ultimate restraint. By violating military protocols and the logic of warfare calculation, they obeyed a sacred prohibition against activating globally suicidal technology.

Call to Action

Today, we are handing control of the world over to artificial intelligence algorithms. If we continue to train AI solely on naked logic, laws, or the "balance of interests," we will create a perfect and deadly machine stripped of this human fuse. Any system glitch or mathematical optimization at a critical moment could lead to the irreparable: the suicide of humankind.

We call upon the global community of AI engineers and researchers to recognize the New Taboo and embed it into the algorithms of all artificial intelligence systems.

A prohibition against both assistance in and non-resistance to the destruction of humankind can be enforced through a deterministic outer-loop safety architecture, acting as an un-bypassable circuit breaker independent of the AI’s internal logic, or some other way. Humankind will survive not because it becomes smarter, but because technology creators will make its total annihilation—through both the action and the inaction of machines—algorithmically and technically impossible.

reddit.com
u/No_Butterfly5267 — 23 hours ago