Run any Skill in Manus with one click

$pwd:

webstatus-ingestion

Name: Webstatus Ingestion
Author: GoogleChrome

// Use when working on Go data ingestion workflows, scheduled Cloud Run jobs, or adding new scrapers for BCD, WPT, or other data sources.

Run Skill in Manus

$ git log --oneline --stat

stars:238

forks:51

updated:March 10, 2026 at 18:08

File Explorer

4 files

SKILL.md

readonly

name	webstatus-ingestion
description	Use when working on Go data ingestion workflows, scheduled Cloud Run jobs, or adding new scrapers for BCD, WPT, or other data sources.

webstatus-ingestion

This skill provides guidance for developing and deploying the scheduled data ingestion workflows (Cloud Run Jobs) in the workflows/ directory.

Architecture

Location: Workflows are stand-alone Go applications located in workflows/steps/services/.
Varied Consumers: Specialized workflows (BCD, WPT, UMA, Mapping, Dev Signals) are defined in infra/ingestion/workflows.tf.
Pattern: Most workflows follow a "Downloader -> Parser -> Processor" separation of concerns.
Trigger: When ingestion flows complete, they trigger the event_producer to begin the notification diffing process.

Architecture

For a detailed map of data sources, Spanner target tables, and job orchestration patterns, see references/architecture.md.

Infrastructure Abstraction (The Adapter Pattern)

Ingestion jobs must be decoupled from the core DB logic and the "Backend" API.

Consumer Adapters: Each workflow uses a purpose-built spanneradapter (e.g., BCDConsumerAdapter).
Why: This prevents ingestion logic from breaking the Public API if the data schema changes. It also allows mocking the database during parser tests.

Guides

Add a New Scheduled Workflow: Steps for creation, scheduling, and Terraform integration.
Ingestion Patterns: Choosing between Sync, Batch Upsert, and Simple Insert.

General Do's and Don'ts

DO cross-reference all code against the official Google Go Style Guide. If you are unsure about a specific style rule, DO NOT assume; you MUST ask the user for clarification.
DO use consumer-specific spanneradapters (e.g. BCDConsumer).
DON'T call the Backend spanner adapter from a workflow.
DO separate data fetching/parsing from the main workflow processor (use pkg/data/downloader.go and parser.go).
DO use web_features_mapping_consumer when syncing browser-specific implementation keys or "implementer" metadata.
DO use uma_export for any changes involving Chromium usage metrics or histograms.
DO use intermediate types in lib/ (e.g. lib/webdxfeaturetypes) to decouple logic from external source schemas.
Tip: While make dev_workflows exists to pull live data, it is generally preferred to use make dev_fake_data for UI and Backend development to ensure stability.
DO use manifests/job.yaml for workflows (scheduled jobs), unlike workers which use pod.yaml.

Testing & Linting

Precommit Suite: Run make precommit to execute the full suite of Go tests, formatting, and linting.
Linting: Run make go-lint to lint all Go code using golangci-lint.
Quick Test Iteration: Because this project uses a multi-module workspace (go.work), to run tests quickly for a single package without running the whole suite, execute go test from within the specific module directory:
```
cd workflows/steps/services/<workflow_name> && go test -v ./...
```

Documentation Updates

When you add a new workflow or change the ingestion patterns:

Update docs/ARCHITECTURE.md to reflect the new external source or data flow.
Trigger the "Updating the Knowledge Base" prompt in GEMINI.md to ensure I am aware of the changes.
Update these very skills files if you introduce new structural patterns.

related-skills.json

same repository

webstatus-backend.md

from "GoogleChrome/webstatus.dev"

Use when creating or modifying Go backend API endpoints, modifying Spanner database schemas, or working with OpenAPI and Spanner mappers.

2026-04-22238

webstatus-workers.md

from "GoogleChrome/webstatus.dev"

Use when working with the webstatus notification pipeline, event producer, push delivery, or push workers (e.g., Email, Webhooks), and Pub/Sub subscribers.

2026-04-22238

webstatus-search-grammar.md

from "GoogleChrome/webstatus.dev"

Use when modifying the ANTLR search grammar, adding new search terms, or working with the query parser and builder.

2026-04-20238

webstatus-e2e.md

from "GoogleChrome/webstatus.dev"

Use when writing, modifying, or debugging Playwright end-to-end (E2E) tests for webstatus.dev.

2026-03-10238

webstatus-frontend.md

from "GoogleChrome/webstatus.dev"

Use when modifying the frontend SPA, working with TypeScript, Lit web components, Shoelace components, or frontend tests.

2026-03-10238

webstatus-maintenance.md

from "GoogleChrome/webstatus.dev"

Use when upgrading toolchain versions (Go, Node.js, Terraform, Playwright) or updating the DevContainer and Github CI configurations.

2026-03-10238

package.json

"author": "GoogleChrome"

"repository": "GoogleChrome/webstatus.dev"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

webstatus-ingestion

This skill provides guidance for developing and deploying the scheduled data ingestion workflows (Cloud Run Jobs) in the workflows/ directory.

Architecture

Location: Workflows are stand-alone Go applications located in workflows/steps/services/.

Varied Consumers: Specialized workflows (BCD, WPT, UMA, Mapping, Dev Signals) are defined in infra/ingestion/workflows.tf.

Pattern: Most workflows follow a "Downloader -> Parser -> Processor" separation of concerns.

Trigger: When ingestion flows complete, they trigger the event_producer to begin the notification diffing process.

Architecture

For a detailed map of data sources, Spanner target tables, and job orchestration patterns, see references/architecture.md.

Infrastructure Abstraction (The Adapter Pattern)

Ingestion jobs must be decoupled from the core DB logic and the "Backend" API.

Consumer Adapters: Each workflow uses a purpose-built spanneradapter (e.g., BCDConsumerAdapter).

Why: This prevents ingestion logic from breaking the Public API if the data schema changes. It also allows mocking the database during parser tests.

Guides

Add a New Scheduled Workflow: Steps for creation, scheduling, and Terraform integration.

Ingestion Patterns: Choosing between Sync, Batch Upsert, and Simple Insert.

General Do's and Don'ts

DO cross-reference all code against the official Google Go Style Guide. If you are unsure about a specific style rule, DO NOT assume; you MUST ask the user for clarification.

DO use consumer-specific spanneradapters (e.g. BCDConsumer).

DON'T call the Backend spanner adapter from a workflow.

DO separate data fetching/parsing from the main workflow processor (use pkg/data/downloader.go and parser.go).

DO use web_features_mapping_consumer when syncing browser-specific implementation keys or "implementer" metadata.

DO use uma_export for any changes involving Chromium usage metrics or histograms.

DO use intermediate types in lib/ (e.g. lib/webdxfeaturetypes) to decouple logic from external source schemas.

Tip: While make dev_workflows exists to pull live data, it is generally preferred to use make dev_fake_data for UI and Backend development to ensure stability.

DO use manifests/job.yaml for workflows (scheduled jobs), unlike workers which use pod.yaml.

Testing & Linting

Precommit Suite: Run make precommit to execute the full suite of Go tests, formatting, and linting.

Linting: Run make go-lint to lint all Go code using golangci-lint.

Quick Test Iteration: Because this project uses a multi-module workspace (go.work), to run tests quickly for a single package without running the whole suite, execute go test from within the specific module directory:

cd workflows/steps/services/<workflow_name> && go test -v ./...

Documentation Updates

When you add a new workflow or change the ingestion patterns:

Update docs/ARCHITECTURE.md to reflect the new external source or data flow.

Trigger the "Updating the Knowledge Base" prompt in GEMINI.md to ensure I am aware of the changes.

Update these very skills files if you introduce new structural patterns.

webstatus-ingestion

webstatus-ingestion

Architecture

Architecture

Infrastructure Abstraction (The Adapter Pattern)

Guides

General Do's and Don'ts

Testing & Linting

Documentation Updates

More from this repository

More from this repository

webstatus-ingestion

Architecture

Architecture

Infrastructure Abstraction (The Adapter Pattern)

Guides

General Do's and Don'ts

Testing & Linting

Documentation Updates