Run any Skill in Manus with one click

Get Started

$pwd:

develop-web-translator

Name: Develop Web Translator
Author: zotero

// Develop a web translator that scrapes bibliographic data from a website. This is the most common translator type.

Run Skill in Manus

$ git log --oneline --stat

stars:1,623

forks:884

updated:April 1, 2026 at 20:38

SKILL.md

readonly

name	develop-web-translator
description	Develop a web translator that scrapes bibliographic data from a website. This is the most common translator type.

Prerequisites

Fetch and read the Zotero translator documentation:

Also read index.d.ts in the repo root for type definitions. Give more weight to recently created translators when looking for examples.

Step 1: Gather information

Collect from the user:

Label: The translator name (usually the site name)
Creator: The author's name
Target URL(s): One or more example URLs from the target site

From the URLs, derive the target regex.

Step 2: Analyze the site

DO NOT fetch site pages with WebFetch, curl, or any HTTP tool. Use the tools instead:

node .bin/capture-har.mjs "<example url>"

Read the generated YAML file. It contains full API schemas. This is your source of truth.

node .bin/inspect-page.mjs "<example url>"

This gives you meta tags, accessibility tree, and screenshot.

Step 3: Choose an approach

Check the inspect-page meta tags first:

Embedded Metadata (EM) — if the page has Highwire Press tags (citation_title, citation_author, citation_doi, etc.), Dublin Core (DC.title, etc.), or good JSON-LD with bibliographic data, use EM. This is the most common approach (~180 translators use it):

async function scrape(doc, url = doc.location.href) {
    let translator = Zotero.loadTranslator('web');
    translator.setTranslator('951c027d-74ac-47d4-a107-9c3069ab7b48'); // EM
    translator.setDocument(doc);
    translator.setHandler('itemDone', (_obj, item) => {
        // fix up fields EM gets wrong
        item.complete();
    });
    await translator.translate();
}

Call await translator.getTranslatorObject() only if you need to customize EM before translation (e.g. setting itemType).

DOI search — if the page doesn't have rich metadata but you can extract a DOI, use a search translator to look it up via DOI Content Negotiation:

async function scrape(doc, url = doc.location.href) {
    let doi = doc.querySelector('a[href*="/doi/"]')?.href.match(/10\.\d{4,}\/[^\s]+/)?.[0];
    if (!doi) return;
    let translate = Zotero.loadTranslator('search');
    translate.setSearch({ DOI: doi });
    translate.setHandler('error', () => {});
    translate.setHandler('itemDone', (_obj, item) => {
        item.complete();
    });
    await translate.translate();
}

API-based — the site has a clean JSON API visible in the YAML. Call it with requestJSON().
HTML scraping — no useful APIs or metadata. Parse the DOM directly. Last resort.
Hybrid — combine any of the above.

Step 4: Initialize and write code

node .bin/init-translator.mjs --label "<Label>" --creator "<Creator>" --target "<regex>" --type web

Implement detectWeb(doc, url), getSearchResults(doc, checkOnly), doWeb(doc, url), and scrape(doc, url).

Step 5: Create tests

node .bin/create-test.mjs "<Label>.js" --url "<example url>"

Include at least one single-item test and one multiple-item test (if supported).

Step 6: Verify and submit

Update lastUpdated every time you modify translator code. Zotero uses it to determine when to push updates to users.

node .bin/update-metadata.mjs "<Label>.js"
npm run lint -- "<Label>.js"
node .bin/run-tests.mjs "<Label>.js"

All tests must pass. Then create a branch and PR.

related-skills.json

same repository

capture-api.md

from "zotero/translators"

Analyze a website's API by capturing network traffic (HAR) and generating an OpenAPI spec via mitmproxy2swagger.

2026-04-011.6k

create-test.md

from "zotero/translators"

Create or update test cases for a Zotero translator by running it against live URLs and capturing the output.

2026-04-011.6k

develop-export-translator.md

from "zotero/translators"

Develop an export translator that converts Zotero items into a file format (JSON, XML, CSV, etc.).

2026-04-011.6k

develop-import-translator.md

from "zotero/translators"

Develop an import translator that parses a file format (JSON, XML, RIS, BibTeX, CSV, etc.) into Zotero items.

2026-04-011.6k

develop-search-translator.md

from "zotero/translators"

Develop a search translator that looks up items by identifier (DOI, ISBN, PMID, arXiv ID, etc.) via an external API. NOT for websites with search pages — use develop-web-translator for those.

2026-04-011.6k

inspect-page.md

from "zotero/translators"

Inspect a live web page using headless Chrome. Gets screenshots, meta tags, accessibility tree, and runs CSS selectors or JS expressions against the rendered DOM.

2026-04-011.6k

package.json

"author": "zotero"

"repository": "zotero/translators"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

async function scrape(doc, url = doc.location.href) { let translator = Zotero.loadTranslator('web'); translator.setTranslator('951c027d-74ac-47d4-a107-9c3069ab7b48'); // EM translator.setDocument(doc); translator.setHandler('itemDone', (_obj, item) => { // fix up fields EM gets wrong item.complete(); }); await translator.translate(); }

async function scrape(doc, url = doc.location.href) { let doi = doc.querySelector('a[href*="/doi/"]')?.href.match(/10\.\d{4,}\/[^\s]+/)?.[0]; if (!doi) return; let translate = Zotero.loadTranslator('search'); translate.setSearch({ DOI: doi }); translate.setHandler('error', () => {}); translate.setHandler('itemDone', (_obj, item) => { item.complete(); }); await translate.translate(); }

develop-web-translator

Prerequisites

Step 1: Gather information

Step 2: Analyze the site

Step 3: Choose an approach

Step 4: Initialize and write code

Step 5: Create tests

Step 6: Verify and submit

More from this repository

More from this repository

Prerequisites

Step 1: Gather information

Step 2: Analyze the site

Step 3: Choose an approach

Step 4: Initialize and write code

Step 5: Create tests

Step 6: Verify and submit