一键在 Manus 中运行任何 Skill

开始使用

hanmoto

版元ドットコム Playwright scraper for Taiwan-related books (Publication Intel v3.1)

在 Manus 中运行

概览

版元ドットコム Playwright scraper for Taiwan-related books (Publication Intel v3.1)

安装命令

npx skills add https://github.com/TuiTuiKoan/Tokyo_Taiwan_Radar --skill hanmoto

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

TuiTuiKoan/Tokyo_Taiwan_Radar

星标1

分支0

更新时间2026年6月3日 17:39

文件资源管理器

2 个文件

SKILL.md

readonly

name	hanmoto
description	版元ドットコム Playwright scraper for Taiwan-related books (Publication Intel v3.1)

hanmoto Scraper

機能説明

来源名: hanmoto
URL: https://www.hanmoto.com/bd/search/keyword/台湾/sdate_desc
資料類型: 版元ドットコムの台湾関連書籍（発売日降順検索）

版元ドットコムは日本出版社の統合書誌データベース。keyword=台湾 + sdate_desc（発売日降順）で台湾関連書籍を最大 3 ページ取得する。

技術規格

項目	詳細
エンジン	Playwright Chromium (headless=True)
ページング	`/order/desc/offset/{offset}`（20件/ページ、最大 3 ページ）
カード selector	`div.booklist-item` → `ul.bookList > li` → `div.list-item` → `.booklist .item`（優先度順）
`source_id` 形式	`hanmoto_{isbn13}` または `hanmoto_{md5(detail_url)[:12]}`
ISBN 取得	`data-isbn` 属性 → href パスの `/isbn/{13桁}` 部分
発売日形式	`YYYY年MM月DD日` または `YYYY-MM-DD`

selector 多段フォールバック

hanmoto.com のマークアップは変更されることがあるため、カード検出に複数 selector を試みる仕様。ページに表示されるカード数が _PAGE_SIZE(=20) 未満の場合はページング終了。

_CARD_SELECTORS = [
    "div.booklist-item",
    "ul.bookList > li",
    "div.list-item",
    ".booklist .item",
]

実際の selector が変わった場合: SKILL.md の _CARD_SELECTORS を更新し history.md に記録する。

来源分流説明

sdate_desc ソート

hanmoto は発売日降順でソートされるため、最初のページに最新書籍が並ぶ。過去書籍は 3 ページ（60 件）を超えると自然にフォールオフする（アーカイブ挙動）。 start_date が過去日付の書籍は Active ビューから外れるが is_active=true は維持される（仕様どおり）。

server-side 台湾フィルタ

版元ドットコムは keyword=台湾 でサーバー側フィルタを実施済みだが、 server 側のフィルタが崩れた場合に備えて client-side フィルタも必ず実施する。

ZERO_EVENT_OK について

hanmoto は server-side で台湾検索済みのため、0 件は異常（scraper 不具合の可能性）。 ZERO_EVENT_OK_SOURCES には 追加しない。

特殊規則

出版事件欄位模板: location_name / location_address / business_hours / price_info 統一填 新書購買請洽各通路，performer 填作者，organizer は出版社名，event_form = ["publication"]。
null-byte strip 必須
tzinfo=timezone.utc: datetime(y, m, d, tzinfo=timezone.utc) を使用
name_ja_locked = True
organizer_type = ["media"] 固定（各書籍の出版社名は organizer フィールドに格納）

既知の問題

NDL ↔ hanmoto 重複: 同一書籍が両ソースに登場する場合がある（既知の非バグ、少量の重複は許容）
selector ドリフト: hanmoto のマークアップ変更で 0 件になった場合は selector を更新する

同仓库更多 Skills

同仓库

engineer

TuiTuiKoan/Tokyo_Taiwan_Radar

Implementation rules for database migrations, Python scrapers, and Next.js web for the Engineer agent

2026-06-031

scraper-expert

TuiTuiKoan/Tokyo_Taiwan_Radar

BaseScraper contract, field rules, and Peatix-specific conventions for the Scraper Expert agent

2026-06-031

kawade-rss

TuiTuiKoan/Tokyo_Taiwan_Radar

河出書房新社 RDF/RSS 1.0 scraper for Taiwan-related books and events (Publication Intel v3.1)

2026-06-031

ndl-opensearch

TuiTuiKoan/Tokyo_Taiwan_Radar

NDL OpenSearch API scraper for Taiwan-related books (Publication Intel v3.1)

2026-06-031

scraper-expert

TuiTuiKoan/Tokyo_Taiwan_Radar

BaseScraper contract, field rules, and Peatix-specific conventions for the Scraper Expert agent

2026-06-031

google-news-rss

TuiTuiKoan/Tokyo_Taiwan_Radar

Platform rules, Taiwan filter, date extraction, and known quirks for the Google News RSS scraper

2026-06-021

来源

TuiTuiKoan

TuiTuiKoan/Tokyo_Taiwan_Radar

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

适用职业SOC

软件开发工程师计算机与数学类职业15-1252L4

name	hanmoto
description	版元ドットコム Playwright scraper for Taiwan-related books (Publication Intel v3.1)

hanmoto Scraper

機能説明

来源名: hanmoto
URL: https://www.hanmoto.com/bd/search/keyword/台湾/sdate_desc
資料類型: 版元ドットコムの台湾関連書籍（発売日降順検索）

版元ドットコムは日本出版社の統合書誌データベース。keyword=台湾 + sdate_desc（発売日降順）で台湾関連書籍を最大 3 ページ取得する。

技術規格

項目	詳細
エンジン	Playwright Chromium (headless=True)
ページング	`/order/desc/offset/{offset}`（20件/ページ、最大 3 ページ）
カード selector	`div.booklist-item` → `ul.bookList > li` → `div.list-item` → `.booklist .item`（優先度順）
`source_id` 形式	`hanmoto_{isbn13}` または `hanmoto_{md5(detail_url)[:12]}`
ISBN 取得	`data-isbn` 属性 → href パスの `/isbn/{13桁}` 部分
発売日形式	`YYYY年MM月DD日` または `YYYY-MM-DD`

selector 多段フォールバック

_CARD_SELECTORS = [
    "div.booklist-item",
    "ul.bookList > li",
    "div.list-item",
    ".booklist .item",
]

実際の selector が変わった場合: SKILL.md の _CARD_SELECTORS を更新し history.md に記録する。

来源分流説明

sdate_desc ソート

server-side 台湾フィルタ

版元ドットコムは keyword=台湾 でサーバー側フィルタを実施済みだが、 server 側のフィルタが崩れた場合に備えて client-side フィルタも必ず実施する。

ZERO_EVENT_OK について

hanmoto は server-side で台湾検索済みのため、0 件は異常（scraper 不具合の可能性）。 ZERO_EVENT_OK_SOURCES には 追加しない。

特殊規則

出版事件欄位模板: location_name / location_address / business_hours / price_info 統一填 新書購買請洽各通路，performer 填作者，organizer は出版社名，event_form = ["publication"]。
null-byte strip 必須
tzinfo=timezone.utc: datetime(y, m, d, tzinfo=timezone.utc) を使用
name_ja_locked = True
organizer_type = ["media"] 固定（各書籍の出版社名は organizer フィールドに格納）

既知の問題

NDL ↔ hanmoto 重複: 同一書籍が両ソースに登場する場合がある（既知の非バグ、少量の重複は許容）
selector ドリフト: hanmoto のマークアップ変更で 0 件になった場合は selector を更新する