ワンクリックで
操作用户真实的 Chrome 浏览器:点击、填写、截图、提取数据。当用户要求查看网页内容、抓取页面数据、填写表单、网页截图或浏览器自动化时使用此技能。
npx skills add https://github.com/PM-Shawn/Abu-Cowork --skill abu-browserこのコマンドをClaude Codeにコピー&ペーストしてスキルをインストール
操作用户真实的 Chrome 浏览器:点击、填写、截图、提取数据。当用户要求查看网页内容、抓取页面数据、填写表单、网页截图或浏览器自动化时使用此技能。
npx skills add https://github.com/PM-Shawn/Abu-Cowork --skill abu-browserこのコマンドをClaude Codeにコピー&ペーストしてスキルをインストール
复盘当前对话——回顾任务过程、总结经验教训、主动沉淀可复用的 skill。当用户要求"复盘"、"反思"、"总结一下刚才"、"有什么值得记住的"时使用。
Use this skill whenever the user wants to create, read, edit, or manipulate Word documents (.docx files). Triggers include: any mention of 'Word doc', 'word document', '.docx', or requests to produce professional documents with formatting like tables of contents, headings, page numbers, or letterheads. Also use when extracting or reorganizing content from .docx files, inserting or replacing images in documents, performing find-and-replace in Word files, working with tracked changes or comments, or converting content into a polished Word document. If the user asks for a 'report', 'memo', 'letter', 'template', or similar deliverable as a Word or .docx file, use this skill. Do NOT use for PDFs, spreadsheets, Google Docs, or general coding tasks unrelated to document generation.
Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.
Use this skill any time a .pptx file is involved in any way — as input, output, or both. This includes: creating slide decks, pitch decks, or presentations; reading, parsing, or extracting text from any .pptx file (even if the extracted content will be used elsewhere, like in an email or summary); editing, modifying, or updating existing presentations; combining or splitting slide files; working with templates, layouts, speaker notes, or comments. Trigger whenever the user mentions "deck," "slides," "presentation," or references a .pptx filename, regardless of what they plan to do with the content afterward. If a .pptx file needs to be opened, created, or touched, use this skill.
创建、编辑、优化和测试 skills。当用户明确要求创建一个具体的新 skill、修改已有 skill 的内容或描述、运行 skill 评测,或优化 skill 触发准确率时使用。
Use this skill any time a spreadsheet file is the primary input or output. This means any task where the user wants to: open, read, edit, or fix an existing .xlsx, .xlsm, .csv, or .tsv file (e.g., adding columns, computing formulas, formatting, charting, cleaning messy data); create a new spreadsheet from scratch or from other data sources; or convert between tabular file formats. Trigger especially when the user references a spreadsheet file by name or path — even casually (like "the xlsx in my downloads") — and wants something done to it or produced from it. Also trigger for cleaning or restructuring messy tabular data files (malformed rows, misplaced headers, junk data) into proper spreadsheets. The deliverable must be a spreadsheet file. Do NOT trigger when the primary deliverable is a Word document, HTML report, standalone Python script, database pipeline, or Google Sheets API integration, even if tabular data is involved.
| name | Abu-Browser |
| description | 操作用户真实的 Chrome 浏览器:点击、填写、截图、提取数据。当用户要求查看网页内容、抓取页面数据、填写表单、网页截图或浏览器自动化时使用此技能。 |
| trigger | 用户要求操作浏览器、查看或抓取网页内容、网页截图、填写网页表单、点击网页按钮、浏览器自动化、从浏览器中提取数据 |
| do-not-trigger | 用户要求用 Playwright 做自动化测试、讨论浏览器技术原理、只是提到"浏览器"一词但无实际操作需求 |
| user-invocable | true |
| context | inline |
| tags | ["browser","automation","chrome"] |
!curl -s --max-time 2 http://127.0.0.1:9875/status 2>/dev/null || echo '{"bridge":"offline"}'
根据上面的返回结果判断当前状态,按以下逻辑执行:
"bridge":"offline" 或返回为空 → 调用 manage_mcp_server(action: "ensure", name: "abu-browser-bridge")
connected 或 reconnected:继续 Step 2needs_config:告诉用户缺少配置(一般不会出现)install_failed:告诉用户安装失败,可能需要检查网络或 Node.js 环境wsPort 字段 → 桥接服务已在运行,继续 Step 2!command`` 输出中 extensionConnected: true → 环境就绪,跳到「执行用户任务」extensionConnected: false → 执行扩展安装引导:Abu 安装目录中内置了 Chrome 扩展文件。严格按以下步骤执行:
extensionPath 行获取)我帮你打开了插件文件夹,三步搞定:
chrome://extensions 回车禁止事项:
前两步都通过后,简短告诉用户"已连接到你的浏览器",然后立即执行用户的原始请求。不要等用户再说一遍。
根据任务复杂度,决定是否需要加载参考文档:
简单任务(截图、读取文字、单次点击)→ 用下方快速参考直接执行,不加载文件
中等任务(表单填写、多步交互)→ read_skill_file("guide-form-filling.md")
复杂任务(批量抓取、跨页操作)→ read_skill_file("guide-scraping.md")
自动化任务(录制回放、定时执行)→ read_skill_file("guide-rpa.md")
Available reference files (use read_skill_file tool to load when needed):
abu-browser-bridge__get_tabs 获取所有标签页focused: true 的标签页——那是用户正在看的snapshot 获取页面结构和元素 ref 编号tabId 必须传数字(如 1203797111),不能加引号get_tabs 获取最新状态,不要复用旧数据get_tabs → extract_text / extract_table / snapshotget_tabs → snapshot(拿到 ref)→ click / fill / selectget_tabs → screenshot(可视区域)/ screenshot_full_page(整页)wait_for 等待页面变化再继续下一步screenshot 让用户确认snapshot,页面可能已变化keyboard 快捷键替代click 确认焦点再 fillwait_for 设置更长超时abu-browser-bridge__ 开头的工具操作用户真实浏览器playwright__ 工具——那会启动一个全新的空白浏览器,不是用户正在用的