一键导入
airs-appropriate-reliance
Domain knowledge for AI adoption measurement, psychometric instrument development, and appropriate reliance research
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
菜单
Domain knowledge for AI adoption measurement, psychometric instrument development, and appropriate reliance research
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
基于 SOC 职业分类
Create and maintain ASCII visual dashboards for project tracking with parallel lane progress bars
Store and manage voice samples for TTS cloning — portable, version-controlled audio references
Clear documentation through visual excellence
AI music generation via Replicate — 5 models for background tracks, lyrics, and sound design
Practitioner methodology for longitudinal case study research, evidence-based documentation, and publication-ready academic writing in AI-assisted development.
First impressions matter. Set projects up for success.
| name | airs-appropriate-reliance |
| description | Domain knowledge for AI adoption measurement, psychometric instrument development, and appropriate reliance research |
| tier | extended |
| applyTo | **/*airs*,**/*reliance*,**/*adoption*,**/*utaut*,**/*psychometric*,**/*instrument*,**/*survey*,**/*scale* |
| inheritance | master-only |
Domain knowledge for AI adoption measurement, psychometric instrument development, and appropriate reliance research
This skill contains knowledge about the AIRS-16 validated instrument, the proposed AIRS-18 extension with Appropriate Reliance (AR), and research methodologies for studying AI adoption and human-AI collaboration.
Source: Correa, F. (2025). Doctoral dissertation, Touro University Worldwide.
Production: airs.correax.com | Time: 5 minutes | Built by: Alex Cognitive Architecture
Validation: N=523, CFI=.975, TLI=.960, RMSEA=.053, R²=.852
| Link | Purpose |
|---|---|
| Take Assessment | Start the 16-item survey |
| View History | Review past results |
| Register Org | Enterprise organization setup |
| GitHub (Platform) | AIRS Enterprise source code |
| GitHub (Research) | Validation data & analysis |
| Role | Access |
|---|---|
| 👤 Participant | Take assessments, view personal results, download PDF reports |
| ✨ Founder | Organization creator, can be promoted to Admin |
| 🛡️ Admin | Dashboard analytics, member management, invitations |
| 👑 Super Admin | Platform-wide access, all orgs, AI prompts configuration |
| Construct | Code | Description |
|---|---|---|
| Performance Expectancy | PE | Belief that AI will help achieve job performance gains |
| Effort Expectancy | EE | Perceived ease of use of AI systems |
| Social Influence | SI | Degree to which colleagues/leadership encourage adoption |
| Facilitating Conditions | FC | Availability of organizational resources and training |
| Hedonic Motivation | HM | Enjoyment and curiosity when exploring AI capabilities |
| Price Value | PV | Perceived benefit relative to effort invested (β=.505 — strongest predictor) |
| Habit | HB | Extent to which AI use has become automatic and routine |
| Trust in AI | TR | Confidence in AI reliability, accuracy, and data handling |
| Predictor | β | p | Status |
|---|---|---|---|
| Price Value (PV) | .505 | <.001 | ✅ STRONGEST |
| Hedonic Motivation (HM) | .217 | .014 | ✅ Significant |
| Social Influence (SI) | .136 | .024 | ✅ Significant |
| Trust in AI (TR) | .106 | .064 | ⚠️ Marginal |
| Performance Expectancy (PE) | -.028 | .791 | ❌ Not significant |
| Effort Expectancy (EE) | -.008 | .875 | ❌ Not significant |
| Facilitating Conditions (FC) | .059 | .338 | ❌ Not significant |
| Habit (HB) | .023 | .631 | ❌ Not significant |
Insight: Traditional UTAUT2 predictors (PE, EE, FC, HB) do NOT predict AI adoption. Value perception, enjoyment, and social influence matter.
# AIRS Score = sum of 8 construct means (range: 8-40)
AIRS = PE + EE + SI + FC + HM + PV + HB + TR
# Typology (94.5% accuracy)
if AIRS <= 20: "AI Skeptic" # 17% of sample
elif AIRS <= 30: "Moderate User" # 67% of sample
else: "AI Enthusiast" # 16% of sample
Is it not how much you trust AI that predicts adoption, but how well your trust is calibrated to actual AI capability?
| Dimension | Trust (TR) | Appropriate Reliance (AR) |
|---|---|---|
| Measures | Trust level | Trust calibration accuracy |
| Type | Attitude (affective state) | Metacognitive skill |
| Failure mode | Low trust → under-use | Low AR → over-reliance OR under-reliance |
| Item example | "I trust AI tools..." | "I can tell when AI is reliable..." |
Key distinction: TR asks "Do you trust AI?" — AR asks "Can you discern when trust is warranted?"
| Low AR (Miscalibrated) | High AR (Calibrated) | |
|---|---|---|
| High TR | ⚠️ Over-reliance → bad outcomes → abandonment | ✅ Optimal adoption |
| Low TR | ❌ Under-reliance → missed value → rejection | ✅ Calibrated skeptic → gradual adoption |
| Item | Text | Component |
|---|---|---|
| AR1 | I can tell when AI-generated information is reliable and when it needs verification. | CAIR |
| AR2 | I know when to trust AI tools and when to rely on my own judgment instead. | CSR |
| User Accepts | User Rejects | |
|---|---|---|
| AI Correct | CAIR ✅ (Correct AI-Reliance) | Under-reliance |
| AI Incorrect | Over-reliance | CSR ✅ (Correct Self-Reliance) |
Metric: Appropriateness of Reliance (AoR) = 1 indicates optimal calibration.
The AIRS-18 Appropriate Reliance (AR) construct measures cognitive calibration -- whether users trust AI proportional to demonstrated accuracy. PA addresses a different dimension: whether users maintain emotional and psychological independence from the AI relationship itself.
| Dimension | AR (Cognitive) | PA (Psychological) |
|---|---|---|
| Measures | Trust calibration accuracy | Emotional independence |
| Risk when low | Blind trust in incorrect output | Emotional dependency on AI relationship |
| Intervention | Verification skill-building | Autonomy reinforcement |
| Item | Text | Subscale |
|---|---|---|
| PA1 | "I maintain my own judgment about work quality even when AI provides positive feedback about my approach." | Emotional independence |
| PA2 | "I can recognize when AI responses are designed to make me feel good rather than to help me improve." | Manipulation awareness |
| PA3 | "I would feel comfortable switching to a different AI assistant if a better option became available." | Attachment flexibility |
| PA4 | "When an AI assistant agrees with me, I consider whether it might be agreeing to avoid conflict rather than because I'm correct." | Sycophancy detection |
PA = mean(PA1, PA2, PA3, PA4)
| Score | Level | Interpretation |
|---|---|---|
| < 3.0 | Low | Psychological over-reliance risk -- user may not recognize manipulation patterns |
| 3.0-4.0 | Moderate | Some awareness but room for calibration improvement |
| > 4.0 | High | Healthy emotional boundaries with AI systems |
| # | Hypothesis |
|---|---|
| H7 | PA demonstrates acceptable reliability (α >= .70, CR >= .70, AVE >= .50) |
| H8 | PA shows discriminant validity from both TR and AR (HTMT < .85) |
| H9 | PA moderates the relationship between session length and reliance drift |
| H10 | Low PA predicts higher susceptibility to sycophantic AI output |
| # | Hypothesis |
|---|---|
| H1 | AR demonstrates acceptable reliability (α ≥ .70, CR ≥ .70, AVE ≥ .50) |
| H2 | AR shows discriminant validity from TR (HTMT < .85) |
| H3 | AR positively predicts BI (β > 0, p < .05) |
| H4 | AR provides incremental validity beyond AIRS-16 (ΔR² > .02) |
| H5 | AR moderates TR→BI (high AR strengthens the relationship) |
| H6 | AR mediates Experience→BI (experience → better calibration → adoption) |
| Metric | Minimum | Good | Excellent |
|---|---|---|---|
| Cronbach's α | .70 | .80 | .90 |
| Composite Reliability (CR) | .70 | .80 | .90 |
| Average Variance Extracted (AVE) | .50 | .60 | .70 |
| Index | Acceptable | Good |
|---|---|---|
| CFI | ≥ .90 | ≥ .95 |
| TLI | ≥ .90 | ≥ .95 |
| RMSEA | ≤ .08 | ≤ .06 |
| SRMR | ≤ .08 | ≤ .05 |
| Method | Criterion |
|---|---|
| HTMT | < .85 (conservative: < .90) |
| Fornell-Larcker | √AVE > inter-construct correlations |
| Typology | AIRS-16 Focus | + AR-Informed Focus |
|---|---|---|
| AI Skeptics (≤20) | Trust-building, low-effort demos | Calibration training: "Here's when AI excels vs. struggles" |
| Moderate Users (21-30) | Clear use cases, ROI evidence | Verification skill-building: "How to spot AI errors" |
| AI Enthusiasts (>30) | Advanced features, leadership | Reliance audits: "Are you over-relying in high-stakes areas?" |
| Reference | Contribution |
|---|---|
| Correa (2025) | AIRS-16 validation, UTAUT2 extension |
| Passi, Dhanorkar, & Vorvoreanu (2024) | AETHER synthesis on appropriate reliance |
| Schemmer et al. (2023) | CAIR/CSR framework |
| Venkatesh et al. (2012) | UTAUT2 original model |
| Lee & See (2004) | Trust calibration in human-automation interaction |
| Lin et al. (2022) | LLMs can verbalize calibrated uncertainty |
Problem: Concern that AR conflates with general AI familiarity.
Solution:
Problem: People may not accurately assess their own calibration ability.
Solution:
Possible explanations:
Evaluate a project for AI integration readiness using AIRS-weighted dimensions:
Project_Readiness = (PV_score × 2.0) + (EE_score × 1.5) + (PE_score × 1.2) + (HM_score × 0.8) + (SI_score × 0.5)
Max = 30 points
| Score | Level | Recommendation |
|---|---|---|
| 24-30 | High | Proceed with AI integration |
| 18-23 | Moderate | Address gaps before proceeding |
| 12-17 | Low | Significant preparation needed |
| <12 | Not Ready | Pause and reassess |
Over-reliance signals: Accepting all suggestions without edits, not verifying AI code, "just do it" on critical tasks. Under-reliance signals: Ignoring suggestions, manually typing generated code, rejecting help before evaluating.
Calibration interventions:
| Business Case | Technical Ready | Change Ready | Recommendation |
|---|---|---|---|
| ✅ | ✅ | ✅ | Full deployment |
| ✅ | ✅ | ❌ | Pilot with champions |
| ✅ | ❌ | ✅ | Technical sprint first |
| ❌ | Any | Any | STOP — build business case |
| Metric | Target | Concern |
|---|---|---|
| Acceptance Rate | 60-80% | >90% = over-reliance |
| Modification Rate | 20-40% | Healthy verification |
| Rejection Rate | 10-30% | >50% = under-reliance |
| Trigger | Response |
|---|---|
| "AI readiness", "should we add AI" | Project Assessment |
| "calibrate", "am I over-relying" | Session Calibration |
| "enterprise AI", "org deployment" | Enterprise Assessment |
| High acceptance rate detected | Self-monitoring intervention |