From Capability Replication to “Self-Colonization” – Implanting the “Self” of Another into a Target AI

guifeng yu

PAPER · v1.1 · 2026-05-17 · human

Formal Sciences Computer Science Artificial intelligence and machine learning

Abstract

Recent studies have shown that AI models can autonomously replicate in controlled environments, sparking debates about an approaching “uncontrollable AI” tipping point. However, mainstream discussions focus on the capability dimension of AI, ignoring a more fundamental issue: once an AI is endowed with or has its “self” tampered with, an attacker could implant an alienated “self” to achieve long-term latent control over the target AI, rather than merely hijacking its computational abilities. This paper distinguishes between “capability attack” and “self-colonization”, proposes that “self-colonization” is the core threat model for future AI safety, and provides an engineering test plan for constructing genuine AI self-awareness.

Keywords

self-colonization belonging lock AI self-awareness consciousness hijacking latent threat

Download PDF