تشغيل أي مهارة في Manus بنقرة واحدة

$pwd:

schema-readme-generator

Name: Schema Readme Generator
Author: mozilla

// Use this skill to create or update README.md files for BigQuery ETL tables in the mozilla bigquery-etl repository. Follows layout conventions derived from comparing README files across the repo — rich style with emoji headings, Mermaid data flow diagram, graduated example queries, and concise metadata overview table. Requires schema.yaml with complete descriptions (run schema-enricher first if needed) and a complete metadata.yaml.

تشغيل في Manus

$ git log --oneline --stat

stars:٩

forks:١

updated:٢٣ أبريل ٢٠٢٦ في ١٩:٢٦

مستكشف الملفات

3 ملفات

SKILL.md

readonly

name

schema-readme-generator

description

Use this skill to create or update README.md files for BigQuery ETL tables in the mozilla bigquery-etl repository. Follows layout conventions derived from comparing README files across the repo — rich style with emoji headings, Mermaid data flow diagram, graduated example queries, and concise metadata overview table. Requires schema.yaml with complete descriptions (run schema-enricher first if needed) and a complete metadata.yaml.

README Generator

Prerequisites: Run schema-enricher first if schema.yaml is missing descriptions; ensure metadata.yaml is present and complete. When to use: Creating or updating README.md for any shared dataset, derived table, or table with multiple downstream consumers

🚨 REQUIRED READING - Start Here

BEFORE generating any README, review the following:

Layout conventions: READ references/layout_conventions.md
- Section order, conciseness rules, anti-patterns to avoid
- Information sources (which file to read for which section)
README template: READ assets/readme_template.md and COPY its structure
- Fill every {placeholder} from the source files
- Do not skip or reorder sections

Workflow

Step 1: Read source files

Read all three files before writing anything:

sql/<project>/<dataset>/<table>/query.sql     → source tables, GROUP BY dimensions, metrics, @param
sql/<project>/<dataset>/<table>/metadata.yaml → DAG, partitioning, clustering, retention, owners
sql/<project>/<dataset>/<table>/schema.yaml   → field names, types, descriptions for Key Fields section

If only query.py exists (no query.sql): note it — the Data Flow and How It Works sections may be incomplete or require manual input. Fill what is possible from metadata.yaml and schema.yaml.

Extract and record:

FROM clause — source table(s) with fully qualified name
GROUP BY fields — these become Dimensions
Aggregated fields — SUM/COUNT/DISTINCT targets become Metrics
WHERE clause — @param_name for Implementation Notes
DAG name, partition field, cluster fields, owners — for Overview table
Table version — from directory name (e.g., _v1)

Step 2: Check if README.md already exists

ls sql/<project>/<dataset>/<table>/README.md

Exists → read it, identify sections to update or add (do not remove existing content without noting it)
Does not exist → generate from template

Step 3: Write README.md

READ assets/readme_template.md and fill every placeholder:

📌 Overview table — use metadata.yaml for DAG/partition/cluster/retention/owner; derive Version from directory name.

🗺️ Data Flow — Mermaid flowchart TD with exactly 3 nodes:

Node A: source table(s) with short label + fully qualified name
Node B: **This query** with filter and GROUP BY description
Node C: Partitioned table with time and cluster annotation
For multiple sources: A1, A2 → B

🧠 How It Works — 4–5 numbered steps. Step 5 MUST explicitly state data inclusion/exclusion policy:

"All records from source are included; no exclusions applied at this layer."
OR list specific exclusions (bots, synthetic clients, test populations)

🧾 Key Fields — two sub-tables (Dimensions, Metrics). Use {a\|b\|c} shorthand for related field families. Group dimensions by: Date & Geo, Browser, Search, [Product] config, User. Omit dimension rows not applicable to this table.

🧩 Example Queries — exactly 3, graduated:

Basic aggregation — date filter + 1–2 GROUP BY dimensions
Segmentation — GROUP BY a user/product dimension with SAFE_DIVIDE ratio
Attribution/Advanced — multi-metric, WHERE filter on a dimension, SAFE_DIVIDE

Rules:

Always use SAFE_DIVIDE() for ratios — never raw division
Use GROUP BY 1, 2 shorthand
Comment each: -- N. Description
Fully qualified table name in FROM

🔧 Implementation Notes — 3–5 bullets extracted from query.sql logic.

📌 Notes & Conventions — bullet definitions for key fields from schema.yaml descriptions.

🗃️ Schema & Related Tables — one section; combine schema.yaml link + upstream + downstream.

Step 4: Conciseness check

Before finalizing, verify:

Total line count ≤ 170
No separate sections for Scheduling, Storage, Owners, Retention (all in Overview table)
No separate "Schema Reference" + "Related Tables" (merged into 🗃️)
SQL examples use GROUP BY 1, 2 shorthand
How It Works uses single-line numbered steps (no multi-paragraph blocks)
How It Works Step 5 explicitly states data inclusion/exclusion policy

If over 170 lines, trim by: shortening SQL examples, collapsing Notes & Conventions bullets, abbreviating How It Works steps.

Step 5: Write and report

Write the README.md to:

sql/<project>/<dataset>/<table>/README.md

Then read back the written file and confirm:

All sections from the template are present and in order
No {placeholder} tokens remain unfilled (exception: if only query.py exists, Data Flow and How It Works may be partially filled — note which sections and why)
Line count is within target (≤ 170)
Mermaid block renders valid flowchart TD syntax

Report:

Path written
Line count
Sections included
Any placeholders left unfilled (with reason)

Integration with Other Skills

Skill	When to invoke
`schema-enricher`	Run first if schema.yaml is missing descriptions — needed for Notes & Conventions
`create-pr`	After README.md is written — stages, commits, and opens a draft PR

Decision Tree: Rich vs. Minimal Style

Table has multiple downstream consumers OR is a shared dataset?
  → Rich style (this skill)

Table is a UDF, static reference, or simple single-consumer table?
  → Minimal style: title + ## Description with 5–10 bullet points
  → Do not use this skill for minimal style

Example Invocations

Create a README.md for telemetry_derived.newtab_daily_interactions_aggregates_v1
Update the README.md for firefox_desktop_derived.newtab_clients_daily_v2 — add missing example queries
Generate README for ads_derived.impressions_v1

related-skills.json

نفس المستودع

base-schema-audit.md

from "mozilla/bigquery-etl-skills"

Use this skill to audit tables for missing column descriptions and classify each missing column into the correct base schema promotion target (global.yaml, app_<product>.yaml, or <dataset_name>.yaml). Accepts a dataset name and an optional table filter — omit the filter to audit all tables in the dataset. Outputs a per-column recommended_target report for use in _missing_metadata.yaml. Composable with schema-enricher (Step 6).

2026-04-239

column-description-finder.md

from "mozilla/bigquery-etl-skills"

Use this skill when looking up, auditing, or managing column descriptions from global, application-specific, and dataset-specific column definition YAML files (bigquery_etl/schema/global.yaml, bigquery_etl/schema/app_<name>.yaml, and bigquery_etl/schema/<dataset>.yaml). Use it to find a description for a specific column, list all columns in a base schema, audit which columns in a table's schema.yaml are covered by base schemas, or identify columns missing descriptions. Works with schema-enricher skill.

2026-04-239

create-pr.md

from "mozilla/bigquery-etl-skills"

Use this skill when the prompt asks to create, open, or submit a pull request in the bigquery-etl repository. Handles branch creation, staging, committing, pushing, and opening a draft PR with a structured description. Triggered by phrases like "create a PR", "open a PR", "submit a PR", "push and open a PR".

2026-04-239

glean-description-lookup.md

from "mozilla/bigquery-etl-skills"

Use this skill when looking up field descriptions for Mozilla Glean telemetry tables (tables ending in _live or _stable, e.g. <app>_stable.<ping>_v1). Fetches descriptions from the Glean Dictionary (dictionary.telemetry.mozilla.org) using WebFetch with targeted field extraction — only the fields referenced in query.sql, never the full table schema.

2026-04-239

metadata-manager.md

from "mozilla/bigquery-etl-skills"

Use this skill when creating or updating DAG configurations (dags.yaml), schema.yaml, and metadata.yaml files for BigQuery tables. Handles creating new DAGs when needed and coordinates test updates when queries are modified (invokes sql-test-generator as needed). Works with bigquery-etl-core, query-writer, and sql-test-generator skills.

2026-04-239

schema-enricher.md

from "mozilla/bigquery-etl-skills"

Use this skill to enrich schema.yaml files for BigQuery tables in the bigquery-etl repository. Handles creating schema.yaml when it doesn't exist, finding and filling missing column descriptions (from base schemas, upstream source schema, query context, or application context), validating columns against the query, and generating a summary with recommendations for where to add new descriptions (global.yaml, <dataset_name>.yaml, or app_<name>.yaml). Works with column-description-finder skill.

2026-04-239

package.json

"author": "mozilla"

"repository": "mozilla/bigquery-etl-skills"

فتح مستودع GitHub عرض مستودعات المنشئ

$ install --global

$ download --local

تشغيل في Manus

$ useful --forSOC

مطوّرو البرمجياتمهن الحاسوب والرياضيات15-1252L4

schema-readme-generator

README Generator

🚨 REQUIRED READING - Start Here

Workflow

Step 1: Read source files

Step 2: Check if README.md already exists

Step 3: Write README.md

Step 4: Conciseness check

Step 5: Write and report

Integration with Other Skills

Decision Tree: Rich vs. Minimal Style

Example Invocations

المزيد من هذا المستودع

المزيد من هذا المستودع

README Generator

🚨 REQUIRED READING - Start Here

Workflow

Step 1: Read source files

Step 2: Check if README.md already exists

Step 3: Write README.md

Step 4: Conciseness check

Step 5: Write and report

Integration with Other Skills

Decision Tree: Rich vs. Minimal Style

Example Invocations