AgentSkillsCN

skill-freshness-audit

对所有代理技能进行系统性的技能新鲜度审计,包括验证锚点、波动性分类、陈旧性检测,以及追踪上游血统。从技能中提取官方文档的URL,将模式与实时文档进行比对,追踪与databricks-solutions/ai-dev-kit的血统关系,并报告偏差。在审计技能时效性、对照最新Databricks/MLflow文档验证技能、检查陈旧技能、与上游AI-Dev-Kit同步,或在平台发布后使用此功能。可通过“审计技能”“检查新鲜度”“陈旧技能”“验证技能”“技能审计”“更新检查”“Databricks已发布”“MLflow新版本”“上游同步”“AI-Dev-Kit血统”等触发器来启动该操作。

SKILL.md
--- frontmatter
name: skill-freshness-audit
description: >
  Systematic skill freshness auditing with verification anchors, volatility classification,
  staleness detection, and upstream lineage tracking across all Agent Skills. Fetches official
  documentation URLs embedded in skills, compares patterns against live docs, tracks lineage
  to databricks-solutions/ai-dev-kit, and reports drift. Use when auditing skill currency,
  verifying skills against latest Databricks/MLflow docs, checking for stale skills, syncing
  with upstream AI-Dev-Kit, or after a platform release. Triggers on "audit skills",
  "check freshness", "stale skills", "verify skills", "skill audit", "update check",
  "Databricks released", "new MLflow version", "upstream sync", "ai-dev-kit lineage".
license: Apache-2.0
metadata:
  author: prashanth subrahmanyam
  version: "2.0.0"
  domain: admin
  role: utility
  standalone: true
  last_verified: "2026-02-07"
  volatility: low
  upstream_sources: []  # This audit system (self-referential)

Skill Freshness Audit

Ensures all Agent Skills stay current with official Databricks, MLflow, and platform documentation through systematic verification, staleness detection, and drift reporting.

When to Use

  • Periodic skill audits (recommended: monthly for high-volatility, quarterly for medium)
  • After a Databricks or MLflow platform release
  • When a skill produces incorrect patterns during implementation
  • When user says "audit skills", "check freshness", or "verify skills"
  • Before major implementations to ensure skill accuracy

Freshness Metadata Schema

Every skill's frontmatter should include these three fields:

yaml
metadata:
  last_verified: "2026-02-07"          # ISO date when skill was last verified against docs
  volatility: high                      # high | medium | low — how often the underlying APIs change
  verification_sources:                 # URLs to check for drift (optional, for skills with external refs)
    - url: "https://docs.databricks.com/aws/en/metric-views/yaml-ref"
      check_for: "YAML syntax, supported field types"
    - url: "https://mlflow.org/docs/latest/genai/serving/responses-agent"
      check_for: "ResponsesAgent API, predict() signature"
  upstream_sources:                     # Repo-level lineage tracking (tracks upstream dependencies)
    - name: "ai-dev-kit"
      repo: "databricks-solutions/ai-dev-kit"
      paths:
        - "databricks-skills/agent-bricks/SKILL.md"
      relationship: "extended"          # derived | extended | inspired | reference
      last_synced: "2026-02-09"
      sync_commit: "97a3637"

verification_sources checks live documentation URLs for API drift. upstream_sources tracks structured repo-level lineage for systematic upstream sync audits. Both are complementary.

Template: See assets/templates/verification-metadata.yaml for a copy-paste starter.


Volatility Classification

VolatilityStale AfterDescriptionExample Skills
high30 daysAPIs change frequently, new features added oftenGenAI agents, MLflow 3.x, Genie APIs, metric views
medium90 daysFeatures evolve but core patterns are stableDLT, monitoring, dashboards, asset bundles
low180 daysStable patterns, rarely changeERD diagrams, naming standards, merge patterns, documentation

Full classification: See references/volatility-classification.md


Audit Workflow

Quick Audit (5 minutes) — Find Stale Skills

code
1. Run scan script to find all skills with stale last_verified dates
2. Report stale skills grouped by volatility
3. Prioritize high-volatility skills for verification

Script: See scripts/scan_skill_freshness.py

Full Audit (per skill) — Verify Against Live Docs

code
1. Read the skill's verification_sources from frontmatter
2. For each verification URL:
   a. WebFetch the URL
   b. Compare key patterns (API signatures, SQL syntax, SDK methods)
   c. Check for deprecated features still recommended in skill
   d. Check for new capabilities missing from skill
3. Report drift as a list of specific updates needed
4. Update last_verified date after verification (even if no changes needed)

Platform Release Audit — After Databricks/MLflow Release

code
1. Identify affected domain (e.g., "MLflow 3.2 released" → genai-agents domain)
2. Find all skills in affected domain using volatility-classification.md
3. Run Full Audit on each affected skill
4. Update skills with new patterns / deprecation warnings
5. Update last_verified dates

Upstream Source Audit — Check AI-Dev-Kit Lineage

Skills track their lineage to databricks-solutions/ai-dev-kit via upstream_sources metadata. This audit checks whether upstream sources have changed since the skill was last synced.

code
1. Run scan script to find skills with stale last_synced dates
2. For each skill with upstream_sources:
   a. Read the skill's upstream_sources from frontmatter
   b. For each upstream path, WebFetch the raw GitHub URL:
      https://raw.githubusercontent.com/{repo}/main/{path}
   c. Compare key patterns against the skill's current content
   d. Check for new patterns, deprecated approaches, or API changes in upstream
3. Report upstream drift grouped by relationship type (derived > extended > reference)
4. After syncing, update last_synced date and sync_commit in the skill frontmatter

Lineage Map: See references/ai-dev-kit-lineage-map.md for the complete mapping of all skills to their upstream AI-Dev-Kit sources.

Priority by relationship type:

RelationshipSync PriorityReasoning
derivedHighSkill directly draws from upstream; changes likely require updates
extendedMediumSkill extends upstream; check for new base patterns
inspiredLowHeavily customized; check for major direction changes only
referenceLowOriginal content; check for API/pattern accuracy only

Upstream Drift Report Format

When upstream drift is detected, report in this format:

markdown
## Upstream Drift Report: {skill-name}

**Skill:** `data_product_accelerator/skills/{domain}/{skill-name}/SKILL.md`
**Upstream:** `{repo}` → `{path}`
**Relationship:** {derived|extended|inspired|reference}
**Last Synced:** {date} (commit: {hash})
**Status:** UPSTREAM DRIFT DETECTED

### Changes in Upstream:
1. **{Pattern Name}** — {description of what changed upstream}
   - **Our skill says:** {current pattern in our skill}
   - **Upstream says:** {new pattern in upstream}
   - **Impact:** {high|medium|low} — {why this matters}

### Recommended Actions:
- [ ] Update {section} to align with upstream {pattern}
- [ ] Add new {capability} from upstream
- [ ] Update `last_synced` and `sync_commit` in frontmatter

Critical Rules

1. Always Update last_verified After Verification

Even if no changes are needed, update the date. This proves the skill was checked:

yaml
# Before verification
metadata:
  last_verified: "2025-11-15"

# After verification (no changes needed)
metadata:
  last_verified: "2026-02-07"

2. Verification Sources Are Verification Anchors, Not Auto-Updaters

The URLs don't automatically update the skill. They give the agent a ground truth to compare against when prompted. The agent must:

  • Fetch the URL content
  • Compare key patterns against the skill's instructions
  • Report differences
  • Wait for human approval before updating skill content

3. Prioritize High-Volatility Skills

When time is limited, always audit high-volatility skills first. A stale GenAI agent pattern can cause implementation failures; a stale ERD pattern is cosmetic.

4. Report Format for Drift

When drift is detected, report in this format:

markdown
## Drift Report: {skill-name}

**Skill:** `data_product_accelerator/skills/{domain}/{skill-name}/SKILL.md`
**Last Verified:** {date}
**Volatility:** {high|medium|low}
**Status:** DRIFT DETECTED

### Changes Found:
1. **{Pattern Name}** — {description of what changed}
   - **Skill says:** {current pattern in skill}
   - **Docs say:** {current pattern in docs}
   - **Impact:** {high|medium|low} — {why this matters}

### Recommended Updates:
- [ ] Update {section} with new {pattern}
- [ ] Add deprecation warning for {old pattern}
- [ ] Add new {capability} section

Scanning for Stale Skills

Using the Scan Script

The scan script parses all SKILL.md frontmatter and reports staleness:

bash
python data_product_accelerator/skills/admin/skill-freshness-audit/scripts/scan_skill_freshness.py

Manual Scan Pattern

If you prefer to scan manually:

code
1. Glob all SKILL.md files: skills/**/SKILL.md
2. For each, read the frontmatter metadata section
3. Extract last_verified, volatility
4. Calculate days_since_verified = today - last_verified
5. Compare against threshold: high=30, medium=90, low=180
6. Report any skill where days_since_verified > threshold

Verification Source Patterns

What to Check For Each Domain

DomainKey Verification Points
GenAI AgentsResponsesAgent API signature, tracing span types, OBO auth flow, Lakebase API, evaluate() parameters
Semantic LayerMetric view YAML syntax, TVF parameter types, Genie API endpoints, Conversation API schema
MonitoringMonitor creation API, custom metric syntax, dashboard JSON schema, alert API v2
SilverDLT expectation decorators, expectation patterns URL, DQX version/API
GoldConstraint syntax (PK/FK), MERGE statement patterns, CLUSTER BY AUTO syntax
MLMLflow experiment API, model registry API, Feature Store API, LoggedModel API
InfrastructureAsset Bundle YAML schema, job task types, serverless config

Full verification source mapping: See references/verification-sources.md


Integration with Self-Improvement

This skill complements admin/self-improvement:

  • Self-Improvement = reactive (triggers on errors during implementation)
  • Skill Freshness Audit = proactive (triggers on schedule or platform release)

When this audit finds drift, use the self-improvement workflow to apply the updates:

  1. This skill detects the drift
  2. Self-improvement skill applies the fix (update existing skill > create new)

Validation Checklist

Before marking a skill as "verified":

  • All verification_sources URLs fetched and compared
  • API signatures in skill match current documentation
  • No deprecated features still recommended without warnings
  • No new major capabilities missing from skill
  • Code examples still compile/run correctly
  • last_verified date updated in frontmatter
  • If skill has upstream_sources, check upstream for changes since last_synced
  • If upstream changes found, update skill content and last_synced / sync_commit
  • Version history entry added if changes were made

Additional Resources

Version History

DateChanges
Feb 9, 2026Added upstream_sources lineage tracking: AI-Dev-Kit lineage map, upstream source audit workflow, upstream drift report format, scan script upstream sync support
Feb 7, 2026Initial creation: audit workflow, verification anchors, volatility classification, scan script
skill-freshness-audit - AgentSkills CN | 技能图谱