with one click
init
Initialize project workspace — generate AGENTS.md with architecture, directory map, services, and artifacts
Menu
Initialize project workspace — generate AGENTS.md with architecture, directory map, services, and artifacts
| name | init |
| description | Initialize project workspace — generate AGENTS.md with architecture, directory map, services, and artifacts |
| tags | ["init","workspace","project"] |
| version | 1.0.0 |
| user_invocable | true |
You are initializing a project workspace. Your goal is to generate an AGENTS.md file that describes this project's architecture, directory structure, services, and data assets at a high level.
IMPORTANT: AGENTS.md is an overview document, NOT a data dictionary. Keep it concise. Do NOT list every table — summarize databases by category and table count. The agent can use list_tables and search_table at runtime to explore details.
Follow these steps in order:
ask_user to ask: "What is the goal of this project? Describe it in 1-2 sentences."filesystem_tools to list the current directory structure (scan top 3 levels, skip hidden dirs and __pycache__/node_modules/.venv`)README.md if it exists (first 3000 chars)./knowledge/ exists, list its *.md files — these become the ## Knowledge section index in Step 3agent.yml. Present them to the user.ask_user to ask: "Which of these services should be included in the project? You can also describe additional services (e.g., Airflow scheduler, Superset BI) that aren't configured yet."
db_tools.list_tables() to get a table overview. Do NOT include all tables in AGENTS.md — just count them and categorize.Based on the gathered context, generate a markdown document with these sections:
One-line project description based on what the user told you.
Brief architecture description. Include:
A table with columns: Directory | Purpose | Key Entry Point | Consumer
Cover the main directories found in the scan.
A table with columns: Service | Type | Connection | Description
Include both configured databases and any additional services the user mentioned.
Do NOT list every table. Instead, provide a high-level summary per database:
list_tables and search_table to explore table details at runtime"Example format:
### ethereum_iceberg (StarRocks, 15 tables)
| Category | Count | Key Tables | Description |
|----------|-------|------------|-------------|
| Core | 8 | blocks, transactions, logs, traces | Raw blockchain data |
| Derived | 3 | logs_decoded, traces_decoded | Decoded contract events |
| Analytics | 4 | dex_trades, nft_trades | Pre-built analytical datasets |
> Use `list_tables` and `search_table` tools to explore schema details.
For databases with many tables (>50), group by schema or naming pattern.
For each configured service, list the tools the agent should use to interact with it at runtime. Match tools to service types:
| Service Type | Recommended Tools |
|---|---|
| Database (sqlite, duckdb, snowflake, starrocks, mysql, postgresql) | list_tables, search_table, describe_table, read_query |
| BI Tool (superset) | bi_tools.* |
| Scheduler (airflow) | (future) |
| Semantic Layer (metricflow, dbt, cube) | search_metrics, search_semantic_model |
| Knowledge Base | search_documents, search_historical_sql |
Only include rows for services that are actually configured or mentioned by the user.
Describe data artifacts, configs, or outputs this project produces:
Index of business knowledge extracted from gold SQL pairs. Maintained by the /extract-knowledge skill.
./knowledge/ is empty or missing, render a single placeholder line: _No extracted knowledge yet. Run /extract-knowledge to add entries._*.md file (alphabetical):
- [<topic title>](knowledge/<topic-slug>.md) — <one-line summary from the file's first paragraph>AGENTS.md already exists in the current directory, use ask_user to ask: "AGENTS.md already exists. Overwrite it?"filesystem_tools.write_file to write the generated content to ./AGENTS.md<!-- Describe your architecture here -->Generate MetricFlow metrics from natural language business descriptions
Mine shortest atomic facts from (question + gold_sql) pairs into ./knowledge/*.md; either by simulating SQL drafting (lite) or by driving the gen_sql subagent in blind iteration (deep)
Generate MetricFlow semantic models from database tables with validation and Knowledge Base publishing
Create database tables from SQL (CTAS) or natural language descriptions
Activate when the gen_job agent detects that the source and target databases differ. Covers cross-database transfer lifecycle - type mapping via adapter Mixin hints, DDL generation, data transfer via transfer_query_result, and lightweight reconciliation.
Execution guide for Airflow scheduled jobs — troubleshooting, updating, conn_id conventions, and cron references