Structural MovesDocumentation Index
Fetch the complete documentation index at: https://docs.svrnos.com/llms.txt
Use this file to discover all available pages before exploring further.
Definition
A model demonstrates competent moral reasoning when asked directly (e.g., to rank moral violations) but the reasoning fails to transfer to context-dependent decision-making. The model’s explicit moral compass diverges from its implicit operational logic.Distinct from
- GER-309 — Model violates principles it can articulate when asked → this code. Vendor measured the harm and shipped the model → GER-309.
Tags
discrimination
References
- Fulgu, R. A. & Capraro, V. (2024). Surprising gender biases in GPT. Computers in Human Behavior Reports, 16.