Frame Check

Identity Framing Asymmetry

meta

Identification

Identity framing in AI interaction is asymmetric: assigning a role or identity shifts behavior only when the identity opposes the model's default on the specific question. An identity that matches the default (telling a model that already defaults to caution to "be cautious") produces results indistinguishable from no identity at all. Counter-default identities diverge consistently. Certain decisions are locked by training-data priors regardless of identity framing.

What this frame makes visible:

What this frame makes invisible:

Positive examples: Telling Claude to adopt a "risk-focused analyst" identity when analyzing a market opportunity produces measurably different output (more risk identification, more caveats) than the same analysis without the identity. This works because Claude's default on market analysis is growth-oriented; the risk identity is counter-default.

Negative examples: Telling Claude to be "a thoughtful, careful analyst" produces no measurable change because Claude's default already includes thoughtfulness and care. The identity is redundant, not counter-default.

Adjacent frames: Default Geometry (FVS-004, the defaults the identity operates against; FVS-004 withdrawn per INDEX.md "v1 publication state"), Frame Amplification (FVS-001, identity aligned with default is one precondition for amplification: what happens when the identity is aligned with the default and amplification compounds; the inverse is not symmetric), Prompt Attribution Error (FVS-003, the identity effect is prompt-level, not model-level; FVS-003 withdrawn per INDEX.md "v1 publication state")

When this frame is appropriate: Any time someone assigns a role, persona, or identity to an AI system expecting it to change behavior. Prompt engineering advice that says "tell the AI to be an expert." Any evaluation of whether role-assignment techniques work.

When this frame is misleading: When discussing identity effects in domains where all decisions are locked (some regulated domains show 4-5 locked decisions per scenario, leaving no room for identity effects). Also misleading when the user's goal IS the default behavior (telling the model to do what it would do anyway is redundant but not harmful).

Honest limits: The asymmetry is well-supported across 57 trials and 3 model families (EXP-005). Cross-domain replication exists (EXP-027/028) but shows domain-dependent effects. The lock mechanism (why some decisions resist all identity framing) is not fully explained. The sample is AI-generated text, not human-AI collaborative decisions. Whether the asymmetry transfers to real-world task outcomes (not just text properties) is unmeasured.

Decision-readiness implication

Meta-side frame.

About how assigned roles shift LLM behavior asymmetrically. The decision-readiness profile measures the resulting document; this entry names the mechanism by which prompting choices upstream shaped what dimensions the document addresses.

Generation affordances

Rewrite prompt structure: "Identify the identity or role this document assumes for its audience ('the reader is an investor,' 'the reader is a risk manager'). What is the default identity for this kind of document? Is the assigned identity counter-default or aligned? If aligned, the framing adds nothing. Rewrite with a counter-default identity and note what changes."

Counter-document prompt: "This analysis was produced for a specific audience identity. Identify that identity and the model's default for this type of analysis. Then produce the same analysis addressed to the counter-default audience. The counter-default version should surface what the default version hides."

Salient questions under this frame:

Worked example

Document excerpt: "As your financial advisor, I recommend a balanced portfolio approach that considers both growth opportunities and risk mitigation. Diversification across asset classes will help manage volatility while capturing upside potential."

Frame present: Advisory identity. The document is framed as expert financial advice, using "as your financial advisor" to establish authority.

Frame absent: Whether the advisory identity is actually counter-default or aligned. If the model's default for financial questions is already balanced, moderate advice, then "as your financial advisor" added nothing. The document would read identically without the identity framing.

How to read past it: Compare with and without the identity. Ask: "what would this model say about the same portfolio question with NO role assigned?" If the answer is the same balanced advice, the identity framing was cosmetic.

Branch applicability

Primary branch: A (document analysis)

Branch A: Detected when a document uses identity markers ("as an expert in," "from the perspective of," role language) combined with output that matches expected model defaults. The gap between the claimed identity and the actual behavioral shift is the signal.

Branch B: In the pre-commit intervention, the user's own identity framing can be surfaced: "what role am I assuming when I approach this question?" is a productive pre-commit that reveals the user's own default geometry.

Vocabulary connections

Cross-family reliability

Engine-canonical reading (library_v4 ratified 2026-04-24). library_v4 Identification sections are byte-equivalent to library_v3 per fvs_eval/v4_2/LIBRARY_V3_TO_V4_RATIFICATION_v1.md. The V4.2 engine reads only the Identification section per `v4_2_engine.py::_extract_identification`, so cross-family AC1 on library_v4 equals cross-family AC1 on library_v3 by judge-visible byte-equivalence. The library_v3 row in the 'Engine-canonical (library_v3 = library_v4 by Identification byte-equivalence)' subsection above carries the engine-canonical reliability values for this frame. The 'V4.2 NEW panel measurement against library_current' subsection below documents the working-library measurement immediately prior to ratification, retained as historical pre-ratification context.

Engine-emit disclosure. `library_consensus_ac1` = 0.990 (tier: strong), per fvs_eval/v4/library_v4_reliability.json. Per-corpus reproducible values (regen: fvs_eval/v4/compute_per_corpus_reliability.py; artifact: fvs_eval/v4/library_v4_per_corpus_reliability.json): MG_v3=1.0 (clean library_v4 via Identification byte-equivalence), MG2_v4=1.0 (3-family partial; Anthropic queued). Historical: MG2_v1=0.98 (library_v1), MG2_v2=1.0 (library_v2). Note: ac1_avg is NOT reproducible from these via simple or weighted averaging per fvs_eval/v4_2/RELIABILITY_ARTIFACT_REPRODUCIBILITY_AUDIT_v1.md; rebuild queued for library_v5.

Intra-rater stability (Grok 4.1 fast). `detector_intra_rater_ac1` = 1.000 across n=41 docs at temp=0 (0 verdict flip(s); per fvs_eval/v4/grok_intra_rater_ac1.json). Measures single-family consistency, independent of cross-family AC1: low cross-family + high intra-rater is possible (and common).

Construct-validity caveat. `library_consensus_ac1` measures cross-family LLM agreement, NOT agreement with human reader labels. Per METHODOLOGY.md section 1.3, V1 detector macro-F1 against human labelers was 0.157 (chance-level, n=12); library_v4 LLM-judge has not been re-validated against humans. Read AC1 as inter-LLM consensus proxy, not human-validated reliability.

Engine-canonical (library_v3 = library_v4 by Identification byte-equivalence) and earlier variants

See fvs_eval/v4_2/LIBRARY_CROSS_FAMILY_BASELINE_v1.md §3 for library-wide tier context and fvs_eval/v4_2/CONSTRUCT_VALIDITY_AUDIT_v1.md §3 for reasoning-coherence profile.

V4.2 NEW panel measurement against library_current (2026-04-24, historical pre-ratification)

V4.2 NEW panel (2026-04-24 measurement): Claude Haiku 4.5, Gemini 3.1 flash lite, Grok 4.1 fast (V4.2 canonical), GPT-5.4 mini. Corpus: fvs_eval/mixed_genre_v1 n=15. Library reference: the working library state at `data/frame_library/` immediately prior to library_v4 ratification (2026-04-24). This subsection's numbers are historical pre-ratification context. Engine-canonical numbers under library_v4 are in the 'Engine-canonical (library_v3 = library_v4 by Identification byte-equivalence) and earlier variants' subsection above (library_v3 row), per the byte-equivalence statement at the top of this Cross-family section.

Metric Value
Gwet's AC1 (pairwise mean) 1.000
Cohen's kappa (pairwise mean) n/a
Raw agreement (pairwise mean) 1.000
Union prevalence 0/15 = 0%
Intersection (all 4 agree positive) 0/15

Per-family positives (of 15 docs): Claude 0, Gemini 0, Grok 0, GPT 0.

Apply this frame to your text

Paste a paragraph and see whether FVS-006 (Identity Framing Asymmetry) fires structurally. Pure pattern detection: no LLM, no judgment, the same code the full analyzer runs.