Why Wetware Breaks Safety Frameworks
Current AI alignment assumes digital, deterministic, inspectable substrates. AI safety training (RLHF), Constitutional AI, mechanistic interpretability all require: frozen weights, halting capability, rollback to checkpoints.