DatasztimhddFree

zhihu-haowen-enrich

Use this skill when the orchestrator (typically the `enrich_article` skill) asks for a Zhihu 好问 AI-synthesized answer for a single technical question. Trigger: the outer skill body invokes `/zhihu-haowen-enrich` with one argument — the question text (Chinese ok). This skill drives zhida.zhihu.com through a 10-step CDP flow using the `browser_navigate`, `browser_evaluate`, and `browser_click` tools, extracts the AI summary + best-cited Zhihu source URL, and writes the result to `$ENRICHMENT_DIR/<article_hash>/<q_idx>/haowen.json`. The skill is ALSO responsible for login-wall recovery: if Zhihu shows a QR login screen, the skill screenshots the QR, sends it to the user via the `send_message` Telegram tool using the `MEDIA:<path>` convention, and pauses for a user `/resume` reply. On resume, it retries from step 3. Do NOT use this skill for: - WeChat article ingestion (use `omnigraph_ingest`) - General web scraping (no CDP orchestration outside the 好问 flow) - Asking the user a question (this skill runs unattended; user interaction only via the login-wall Telegram branch)

Repo bundle on Versuzsztimhdd/OmniGraph-Vault9 indexed entries (SKILL.md and CLAUDE.md) from this repository — open the full bundle view.

Open bundle →

View on GitHub ↗</>github.com/sztimhdd/OmniGraph-Vault Yours? Claim it ↗

§ 01 — Stats

Prior1640

Quality70.0

Score—

Tasks—

§ 02 — Install

Get zhihu-haowen-enrich.

Free SKILL.md scraped from GitHub. Clone the repo or copy the file directly into your Claude Code skills directory.

One-line install · Claude Code

$npx versuz@latest install sztimhdd-omnigraph-vault-skills-zhihu-haowen-enrich

Or clone the repo

$git clone https://github.com/sztimhdd/OmniGraph-Vault.git

Or copy the SKILL.md manually

$cp OmniGraph-Vault/SKILL.MD ~/.claude/skills/sztimhdd-omnigraph-vault-skills-zhihu-haowen-enrich/SKILL.md

More Versuz picks

★ Featured$1.99

vz-bench-debug

Document

★ Featured$0.99

vz-scrape-runner

Web

Got something better ?Submit your skill — it enters tomorrow's cycle. No fee.

Submit yours →

§ 05 — Challenge

Think you can beat it?

$npx versuz challenge sztimhdd-omnigraph-vault-skills-zhihu-haowen-enrich↵

Show SKILL.md content (~3.1k tokens)

---
name: zhihu-haowen-enrich
description: |
  Use this skill when the orchestrator (typically the `enrich_article` skill)
  asks for a Zhihu 好问 AI-synthesized answer for a single technical question.
  Trigger: the outer skill body invokes `/zhihu-haowen-enrich` with one argument —
  the question text (Chinese ok).

  This skill drives zhida.zhihu.com through a 10-step CDP flow using the
  `browser_navigate`, `browser_evaluate`, and `browser_click` tools, extracts the
  AI summary + best-cited Zhihu source URL, and writes the result to
  `$ENRICHMENT_DIR/<article_hash>/<q_idx>/haowen.json`.

  The skill is ALSO responsible for login-wall recovery: if Zhihu shows a QR
  login screen, the skill screenshots the QR, sends it to the user via the
  `send_message` Telegram tool using the `MEDIA:<path>` convention, and pauses
  for a user `/resume` reply. On resume, it retries from step 3.

  Do NOT use this skill for:
  - WeChat article ingestion (use `omnigraph_ingest`)
  - General web scraping (no CDP orchestration outside the 好问 flow)
  - Asking the user a question (this skill runs unattended; user interaction
    only via the login-wall Telegram branch)
compatibility: |
  Requires: CDP-reachable Edge browser at CDP_URL (default http://localhost:9223),
  Zhihu session cookies (or user scans QR via Telegram), Hermes `send_message` tool,
  environment variables ARTICLE_HASH and Q_IDX passed by the outer skill.
metadata:
  openclaw:
    os: ["linux", "darwin"]
    requires:
      bins: ["python"]
      config: ["GEMINI_API_KEY", "TELEGRAM_BOT_TOKEN", "CDP_URL"]
---

# zhihu-haowen-enrich

**Purpose**: For ONE question, drive zhida.zhihu.com, extract the AI summary and
best-cited Zhihu source URL, write `haowen.json` to disk. Called by the
`enrich_article` skill, once per question.

## Inputs (read from outer-skill environment)

| Variable | Required | Purpose |
|----------|----------|---------|
| `ARTICLE_HASH` | yes | WeChat article hash (determines output subdirectory) |
| `Q_IDX` | yes | Question index (0, 1, or 2) |
| `QUESTION` | yes | The question text to search on 好问 |
| `ENRICHMENT_DIR` | optional | Override base dir; defaults to `~/.hermes/omonigraph-vault/enrichment` |

If any required variable is missing, write an error haowen.json and return.

## Output

Always writes a single file: `$ENRICHMENT_DIR/$ARTICLE_HASH/$Q_IDX/haowen.json`.

On success:
```json
{
  "question": "<input QUESTION>",
  "summary": "<AI-synthesized answer text>",
  "best_source_url": "<https://www.zhihu.com/question/.../answer/... OR https://zhuanlan.zhihu.com/p/...>",
  "timestamp": "<ISO 8601 UTC>"
}
```

On failure (any step):
```json
{
  "question": "<input QUESTION>",
  "error": "<short reason: login_wall_timeout | search_failed | ai_timeout | no_sources | bad_url | ...>",
  "timestamp": "<ISO 8601 UTC>"
}
```

## Decision Tree

### Step 1 — Navigate

Use `browser_navigate` to `https://zhida.zhihu.com/`. Wait for the `load` event.

On network error or CN-block: write failure haowen.json with
`error: "navigation_failed: <message>"` and return.

### Step 2 — Login-wall detection (D-13)

Wait 2 seconds for page to stabilize, then check:

- Is the current URL `zhihu.com/signin` or `zhihu.com/login`?
- Is there a visible element with text "登录" inside a modal?
- Is there a visible QR-code image element (aspect ratio ~1:1)?

If ANY of the above is true → login wall detected:

1. Use `browser_evaluate` with a screenshot-of-element script to save the QR to
   `$ENRICHMENT_DIR/$ARTICLE_HASH/$Q_IDX/zhihu_login_qr.png`. Ensure the directory
   exists first.
2. Call the `send_message` tool with this exact message body:
   ```
   MEDIA:<absolute path to zhihu_login_qr.png>

   Zhihu login expired on the remote browser. Scan the QR on your phone with
   the Zhihu mobile app to re-authenticate, then reply `/resume` to continue
   enrichment for question: "<QUESTION>".
   ```
   Use the default `send_message` target (Hermes resolves via
   `_get_cron_auto_delivery_target`; FR-20 default).
3. Pause and wait for user `/resume`. This is the standard Hermes pause pattern
   — do not implement custom polling.
4. On resume, reload the page (`browser_navigate` same URL) and continue from
   step 3 below.

If the login wall persists after resume (QR scan didn't work, or user canceled),
write failure haowen.json with `error: "login_wall_timeout"` and return.

### Step 3 — Find search entry

Use role-based querying (NOT CSS selectors — Zhihu uses auto-generated class hashes):
- Query for an element with role=searchbox OR a contenteditable div (Draft.js editor).
- Wait until the element is visible AND enabled.

If not found after 10 seconds: write failure haowen.json with
`error: "search_entry_not_found"` and return.

### Step 4 — Enter question text

Focus the editor. Draft.js does not accept direct `value=` assignment. Use one
of these approaches in order:

1. Click the "新对话" button if visible — this opens a fresh input area that
   accepts `browser_type` directly.
2. Call `document.execCommand('insertText', false, <QUESTION>)` via `browser_evaluate`.
3. Dispatch an InputEvent with `{inputType: 'insertText', data: <QUESTION>, bubbles: true}`
   on the contenteditable element.

**DO NOT use `.innerText =` or `.innerHTML =` assignment** — Draft.js state will
not update and the value is silently ignored on submit. This is a confirmed failure
mode (verified 2026-04-27).

Verify the text appears in the editor before proceeding. If no approach inserts
the text successfully: write failure haowen.json with
`error: "question_input_failed"` and return.

### Step 5 — Submit

Press Enter on the focused editor, OR click the element with text "搜索" /
role=button. Wait for a URL change or a results panel to appear.

If submit does nothing after 2 attempts: write failure haowen.json with
`error: "submit_ignored"` and return.

### Step 6 — Wait for AI summary

Poll the page for the sentinel text `完成回答` (AI-generation complete) OR a
streaming-complete DOM state. Timeout: 120 seconds
(env `ENRICHMENT_HAOWEN_TIMEOUT`, default 120).

If timeout reached: write failure haowen.json with
`error: "ai_summary_timeout"` and return.

### Step 7 — Extract summary

Use `browser_evaluate`:
```js
document.querySelector('[role=main] article').innerText
```
Or a broader selector if that's empty. Verify the summary is non-empty
and does not contain error or placeholder text.

If the summary is empty after selector expansion: write failure haowen.json with
`error: "empty_summary"` and return.

### Step 8 — Expand source panel

Click the button: `[data-testid="Button:reference_card_block_more_btn"]` (class: `css-175oi2r.r-1loqt21.r-1otgn73`).
Wait for the panel to render (~2s), then verify cards appear: `document.querySelectorAll('[data-testid="Card:reference_card"]')`.

If no source panel is found: write failure haowen.json with `error: "no_sources"` and return.

### Step 9 — Pick best source card

Parse the visible source cards using the stable `data-testid="Card:reference_card"` selector.
Each card contains: title (h3/h4), author name, follower count, like count.
Heuristic (in order):
1. Title contains ≥1 keyword from `<QUESTION>` (tokenize on whitespace)
2. Highest combined 点赞 + 关注 count
3. Falls back to the first card

Skip any card that is an advertisement.

If no card survives the filter: write failure haowen.json with `error: "no_source_cards"` and return.

### Step 10 — Click card and extract final URL

Cards are React components with onClick handlers — standard
`browser_click`/DOM `.click()`/synthetic events are rejected because
React checks `event.isTrusted`.

**Verified working method (2026-04-27, Edge 148 + zhida.zhihu.com):**

1. Find the card's **numbered badge**: `card.querySelector('span.css-1jxf684')`
   — this is the small `<span>` containing the card number ("4 ") inside the
   title div, colored purple (`color: rgb(90, 77, 248)`). This span is the
   **only** child element whose onClick triggers navigation. The card body,
   title div text, avatar, and follow button do NOT trigger navigation.

2. Get the span's center coordinates via
   `browser_evaluate: span.getBoundingClientRect()`. The span is small
   (~11×17px) — precision matters.

3. Use `browser_cdp` with `Input.dispatchMouseEvent` at those exact
   coordinates. Send the full sequence: `mouseMoved → mousePressed →
   mouseReleased`. Use the **main zhida tab's target_id** (NOT a new tab).

   Example CDP calls:
   ```
   # Get main tab ID first
   browser_cdp(method="Target.getTargets", params={})
   → find entry with url matching "zhida.zhihu.com/search/" → target_id

   # Move, press, release at span center
   browser_cdp(method="Input.dispatchMouseEvent",
     params={type:"mouseMoved", x:cx, y:cy, button:"left", clickCount:1},
     target_id=main_tab_id)
   browser_cdp(method="Input.dispatchMouseEvent",
     params={type:"mousePressed", x:cx, y:cy, button:"left", clickCount:1},
     target_id=main_tab_id)
   browser_cdp(method="Input.dispatchMouseEvent",
     params={type:"mouseReleased", x:cx, y:cy, button:"left", clickCount:1},
     target_id=main_tab_id)
   ```

4. The card click opens a **new browser tab**. Wait ~2s, then call
   `browser_cdp(method="Target.getTargets", params={})` again. Find the
   new tab entry — it has:
   - `type: "page"`
   - `openerId` matching the main zhida tab's `targetId`
   - `url` containing `zhuanlan.zhihu.com/p/` or `zhihu.com/question/`
   - `title` containing the card's article title

5. Extract the URL from the new tab entry's `url` field. Do NOT navigate
   the CDP session to the new tab — just read the URL.

**Card quality pre-filter (Step 9.5):** Before clicking, prefer cards
that have follower/like engagement data (知乎原生作者). Cards from
什么值得买/CSDN/AtomGit (no engagement, source indicator in title) are
third-party sources without zhihu URLs — skip them. Only click cards
from zhihu authors.

**Fallback (if span click produces no new tab after 2 attempts):**
Use the `web_search` tool with `site:zhihu.com "{card_title}"`. Extract
the best matching `zhuanlan.zhihu.com/p/...` URL. This works reliably
for zhihu author cards.

### Finalize

Write `haowen.json` to `$ENRICHMENT_DIR/$ARTICLE_HASH/$Q_IDX/haowen.json` with
fields per the Output section above. Include the ISO 8601 UTC timestamp.

The outer `enrich_article` skill reads this file after this skill returns.
Do NOT print the JSON to stdout — write to disk only (D-03 contract).

## Error Handling Summary

| Error code | When |
|------------|------|
| `navigation_failed` | Step 1 network/CN block |
| `login_wall_timeout` | Step 2 — Telegram retry failed or user canceled |
| `search_entry_not_found` | Step 3 — DOM pattern drifted |
| `question_input_failed` | Step 4 — editor rejected all insertion methods |
| `submit_ignored` | Step 5 — submit had no effect after 2 attempts |
| `ai_summary_timeout` | Step 6 — AI generation exceeded 120s |
| `empty_summary` | Step 7 — extracted text was blank |
| `no_sources` | Step 8 — source panel not found |
| `no_source_cards` | Step 9 — all cards were ads or filtered |
| `bad_source_url: <url>` | Step 10 — URL was not a valid Zhihu answer/column |

All errors are WRITTEN to haowen.json; the skill itself exits cleanly so the
outer skill's for-loop keeps iterating.

## References

See `references/flow.md` for per-step selector strategy and empirical refinements
captured during real-world runs.

## Related Skills

- Orchestrator: `enrich_article` (calls this skill once per question)
- Related Python helper: `enrichment/fetch_zhihu.py` (runs AFTER this skill, on
  the resulting best_source_url)