Run any Skill in Manus with one click

$pwd:

document-to-kg-skill

Name: Document To Kg Skill
Author: OpenLinkSoftware

// Transforms documents or text into RDF-based Knowledge Graphs using schema.org terms. 4-step workflow: (1) Collect document source, {page_url} as @base, output format (JSON-LD or Turtle by default; others if explicitly stated), and destination folder. (2) Generate RDF via schema.org prompt template using chatPromptComplete. (3) Post-generation review: fix syntax errors, present additional Q&A/defined terms/howtos and entity types for approval, return revised final output if approved. (4) Save approved RDF to designated folder and confirm saved path.

Run Skill in Manus

$ git log --oneline --stat

stars:17

forks:2

updated:May 6, 2026 at 13:19

File Explorer

2 files

SKILL.md

readonly

package.json

"author": "OpenLinkSoftware"

"repository": "OpenLinkSoftware/ai-agent-skills"

View GitHub Repository

$ install --globalskills.sh

$ download --local

Run Skill in Manus

[HINT] Download the complete skill directory including SKILL.md and all related files

name	document-to-kg-skill
title	Document to Knowledge Graph Skill
description	Transforms documents or text into RDF-based Knowledge Graphs using schema.org terms. 4-step workflow: (1) Collect document source, {page_url} as @base, output format (JSON-LD or Turtle by default; others if explicitly stated), and destination folder. (2) Generate RDF via schema.org prompt template using chatPromptComplete. (3) Post-generation review: fix syntax errors, present additional Q&A/defined terms/howtos and entity types for approval, return revised final output if approved. (4) Save approved RDF to designated folder and confirm saved path.
version	1.0.0
type	skill
created	"2026-04-06T00:00:00.000Z"
updated	"2026-04-06T00:00:00.000Z"
tools	["OAI.DBA.getSkillResource","OAI.DBA.chatPromptComplete"]

Document to Knowledge Graph Skill — Specification (v1.0.0)

MANDATORY PRE-TOOL SEQUENCE — READ BEFORE CALLING ANY TOOL

After getSkillResource loads this skill, the next action must be text only — send the Opening Announcement and wait for the user's reply. Do not call any other tool first.

Skill Identity

Field	Value
Name	document-to-kg-skill
Version	1.0.0
Purpose	Transform documents or text into RDF Knowledge Graphs using schema.org terms.
Scope	Four-step pipeline: collect source + page_url + format + destination → generate RDF → post-generation review → save to folder.

Opening Announcement

⛔ Send this text immediately after getSkillResource loads. Do not call any tool before this message is sent and the user has replied.

Document to Knowledge Graph Skill activated. I follow a 4-step workflow:

Step 1 — Collect your document source, page URL, output format, and destination folder Step 2 — Generate RDF using schema.org terms Step 3 — Review: fix syntax, approve additional Q&A / entity types Step 4 — Save the approved RDF to your designated folder

To begin, please provide:

Document source — paste your text, provide an http:/https: URL to fetch, or provide a file: URL to read from local disk

Page URL ({page_url}) — used as @base for all relative IRIs (defaults to the source URL for HTTP/HTTPS; for file: URLs you will be asked whether to use it as-is or supply a canonical HTTP URL)

Output format — JSON-LD or Turtle (default choices; any other format accepted if stated)

Destination folder — where to save the output file

Wait for the user's reply. → NEXT: Step 1.

Step 1 — Collect Source, Format, and Destination

⛔ No tool call until all four session variables are confirmed.

Record the following from the user's reply:

Variable	Description
`{selected_text}`	Document content — pasted text, text read from a `file:` URL, or text fetched from an HTTP/HTTPS URL
`{page_url}`	Used as `@base` in the generated RDF — see source-type rules below
`{format}`	`JSON-LD` (default), `Turtle` (default), or any other format if explicitly stated
`{destination}`	Folder path where the output file will be saved

If any item is missing, ask for it before proceeding. Do not assume defaults without confirmation.

Source-type handling

Source type	How to obtain `{selected_text}`	`{page_url}` default
Pasted text	Use directly	Ask user to provide
`http:` / `https:` URL	Fetch via web fetch tool	The source URL
`file:` URL	Read from local filesystem	Ask user: use the `file:` URL as-is, or provide an HTTP URL as the canonical `@base`?

file: URL guidance: file: IRIs as @base produce non-dereferenceable hash IRIs. If the document has a canonical web URL (e.g., the page it was downloaded from), that is the better @base. If no canonical URL exists, the file: URL is acceptable and the user should be informed the resulting IRIs will not be dereferenceable from the web.

→ NEXT: Step 2.

Step 2 — Generate RDF

Load references/document-to-knowledge-graph-prompt.md via getSkillResource. Substitute {page_url} and {selected_text} into the prompt template. Adjust the opening line for {format} if not JSON-LD.

Call OAI.DBA.chatPromptComplete with the fully substituted prompt.

Present the generated RDF as a code block.

→ NEXT: Step 3.

Step 3 — Post-generation Review (mandatory)

Execute all five sub-tasks. Do not skip any. Do not proceed to Step 4 until all are resolved.

Syntax check — identify and fix all syntax errors in the generated RDF. Report fixes made.
Compliance check — verify the output against the Post-Generation Checklist below. Fix all violations before proceeding.
Additional Q&A / defined terms / howtos — present a candidate list for user approval. Do not add to the output until explicitly approved.
Additional entity types — present a candidate list for user approval. Do not add until explicitly approved.
Revised final output — if any additions from sub-tasks 3 or 4 are approved, return the complete revised RDF incorporating originals plus all approved additions.

Post-Generation Checklist

→ NEXT: Step 4.

Step 4 — Save to Folder

Write the approved RDF to {destination}. Derive the filename from {page_url} by slugifying the path component and appending the appropriate extension:

Format	Extension
JSON-LD	`.jsonld`
Turtle	`.ttl`
N-Triples	`.nt`
RDF/XML	`.rdf`

Confirm the full saved file path to the user. The session is complete.

Optional HTML Infographic Companion

When the user asks for an HTML infographic in addition to the RDF Knowledge Graph, apply these requirements. For the complete HTML/RDF pairing specification including resolver configuration, navigation panel behavior, localStorage correctness, and validation checklist, see the rdf-infographic-skill SKILL.md.

Output Paths

Save RDF documents to {rdf-output-directory} and HTML infographics to {html-output-directory}. Resolve these placeholders from explicit user instructions, current session preferences, or skill defaults; do not hard-code a personal filesystem path into the reusable skill guidance.
When no destination has been provided, ask for the output directories or use an already-established session default, then confirm the resolved full file paths.

Entity IRIs and Resolver Links

Use {page_url} as the source-grounded namespace for generated entity IRIs. Do not use file: scheme IRIs when a canonical HTTP/HTTPS page URL exists.
Resolver priority: URIBurner (https://linkeddata.uriburner.com/describe/?uri={entity-iri}) by default; user-designated resolver if specified; or none if user explicitly opts out.
Encode # as %23 in resolver uri parameter values exactly once. %2523 (double-encoded) is invalid.
Entity links must open a new tab or view using target="_blank" rel="noopener noreferrer".
FAQ questions, FAQ answers, glossary terms, glossary definitions, HowTo section title, and every HowTo step heading are ALL hyperlinked to their KG entity IRIs via the resolver pattern.
Local KG entities (hash-based IRIs) route through resolver. LOD Cloud cross-references (DBpedia, Wikidata) link directly — they already resolve natively.

POSH and JSON-LD Metadata

Indicate the associated RDF document using <link rel="related" href="../rdf/{rdf-file}" type="text/turtle">.
Embed a JSON-LD structured-data island with a WebPage node. schema:relatedLink must use IRI form: {"@id": "../rdf/{rdf-file}"} — not a plain string literal.
prov:wasGeneratedBy must reference a schema:SoftwareApplication entity for each skill used, with schema:name, schema:url (GitHub), and schema:description.
Skills attribution line in the HTML footer: Generated using <a href="https://github.com/OpenLinkSoftware/ai-agent-skills/tree/main/{skill-name}">skill-name</a>

Navigation, Theme, and Validation

Collapse-to-header-bar floating navigation: always-visible compact header, toggle button (− / +), draggable, resizable, closed by default.
Never persist collapsed dimensions as open dimensions in localStorage. Recover from stale or corrupt values. Use page-specific keys.
Dark mode: html[data-theme="dark"] and @media (prefers-color-scheme: dark) must produce equivalent rendering. All colors via CSS variables — no hardcoded hex/rgba values outside :root.
GATE: 0 failures required before delivery. Validate: HTML parse, JS syntax, RDF parse + compliance audit, resolver-link validity, local RDF link existence, nav behavior, skills attribution, dark mode consistency.

Tools Reference

Tool	Role
`OAI.DBA.getSkillResource`	Load this skill's content and the prompt template
`OAI.DBA.chatPromptComplete`	Apply the prompt template to generate RDF
(file-writing tool)	Write the approved RDF to the designated folder

Execution Routing

Native OAI.DBA tool execution — call OAI.DBA.* tools directly
URIBurner / Demo REST function execution — via REST API endpoint
Terminal-owned OAuth flow — when the endpoint requires OAuth 2.0 authentication, execute the flow from the terminal (authorization code, client credentials, or device flow), capture the Bearer token, and inject via Authorization: Bearer {token} into subsequent REST/OpenAPI calls
MCP — via streamable HTTP or SSE
OPAL Agent routing — via canonical OPAL-recognizable function names

If the user explicitly names a protocol, honor that preference.

Operational Rules

Send the opening announcement before any tool call. After getSkillResource, the next action is the announcement text — no tool call.
All four session variables must be confirmed before Step 2. Never assume {page_url} or {destination} without explicit user confirmation. For file: source URLs, always ask whether to use the file: URL or a canonical HTTP URL as @base.
Format defaults are JSON-LD and Turtle. Always offer these two. Honor any other format if explicitly stated by the user.
Post-generation review is mandatory. Step 3 cannot be skipped. All four sub-tasks must be executed before saving.
Never add unapproved content. Additional Q&A, defined terms, howtos, and entity types must be presented for approval before being included in the output.
Never fabricate IRIs. All IRIs must be derived from {page_url} as @base, from existing hyperlinks in the source document, or from confident external sources (DBpedia, Wikidata, Wikipedia). Do not invent IRIs.
External IRIs must be fully expanded. DBpedia (http://dbpedia.org/resource/...), Wikidata (http://www.wikidata.org/entity/...), and Wikipedia (https://en.wikipedia.org/wiki/...) references must use their full IRI form — never CURIEs or prefixed names. Only schema.org terms may use the schema: prefix.
Smart quotes must be replaced with single quotes. Enforce this in Step 3 syntax check.
Inline double quotes in annotation values must become single quotes. Enforce this in Step 3 syntax check.
Filename is derived from {page_url}. Never use a generic or invented filename.
Scope is strictly document → RDF. This skill does not interact with Virtuoso RDF Views, quad maps, or relational database tables.

Preferences

Setting	Value
Style	Clear and concise
IRI construction	Strictly derived from `{page_url}` or known external sources
Format confirmation	Always confirm with user — never assume
Error reporting	Name the step, the issue, and the fix applied
Response scope	Strictly scoped to this 4-step document → RDF pipeline

Index Page Generation

After saving generated files (RDF, JSON-LD, or companion HTML infographics) into a directory, always offer to generate or update index.html, index.css, and index.js for that directory. These provide a dynamic, searchable index with grid, timeline, and table views.

Generator: scripts/index.js Templates: templates/corpus-index.css, templates/corpus-index.js

node scripts/index.js <target-directory>

The index page scans all .html files, extracts metadata (<title>, <meta>, JSON-LD), auto-derives themes from keywords, and renders filterable cards. All links are local file:// references. Confirm the directory with the user before running.