ask-opencli
Ask Grok or Gemini via opencli (browser-session based, no API cost). Use when the user says "问 grok", "问 gemini", "ask grok", "ask gemini", "用 grok 查", "grok 怎么看", "让 gemini 分析", or wants a second opinion from Grok/Gemini without paying API tokens. Routes to opencli which drives the already-logged-in Chrome session.
git clone --depth 1 https://github.com/majiayu000/spellbook /tmp/ask-opencli && cp -r /tmp/ask-opencli/skills/ask-opencli ~/.claude/skills/ask-opencliSKILL.md
# Ask via OpenCLI (Grok / Gemini)
Use `opencli` to query Grok or Gemini through the user's already-logged-in Chrome session. Zero API cost, fully driven by the existing browser session.
> **opencli 项目**:https://github.com/jackwener/opencli
> **安装**:`npm install -g @jackwener/opencli`(需要 Node ≥20)
> **首次运行**:`opencli doctor` 会自动引导安装 Chrome Browser Bridge 扩展
> **登录**:用装了扩展的 Chrome profile 打开 grok.com 和 gemini.google.com 登录一次
> **必须环境变量**:`export OPENCLI_BROWSER_COMMAND_TIMEOUT=300`(写进 shell rc)
> **更深度的多 AI 并行调研工作流**:use the `multi-ai-research` skill instead
**When to trigger**:
- User explicitly asks to query Grok or Gemini ("问一下 grok", "用 gemini 查", "让 grok 看看")
- User wants a second opinion / external advisor from Grok or Gemini
- User says "ask grok"/"ask gemini" or mentions these two as the information source
**When NOT to trigger**:
- User wants to use native Gemini CLI (`gemini -p`) → use `ask-gemini` skill instead
- User wants Claude's own opinion → answer directly
- User wants other AI (Doubao, Yuanbao, ChatGPT, etc.) → use other tools
## Model selection
If the user doesn't specify which model, default to **Grok** for real-time / social / news / tech-trend questions, and **Gemini** for long-form reasoning, research, coding, and document analysis.
If still ambiguous, ask the user which one to use before sending.
## ⚠️ 常见错误命令(直接 fail fast)
| ❌ 错误 | ✅ 正确 | 原因 |
|---|---|---|
| `opencli gemini chat "..."` | `opencli gemini ask "..."` | 没有 `chat` 子命令 |
| `opencli grok chat "..."` | `opencli grok ask "..."` | 只有 `ask` 子命令 |
| `opencli grok new` | `opencli grok ask "..." --new true` | grok 的"新对话"是 `ask` 的参数 |
| `opencli gemini query "..."` | `opencli gemini ask "..."` | 没有 `query` 子命令 |
**完整子命令速查**(运行 `opencli grok --help` 或 `opencli gemini --help` 查看):
```
Grok:
ask <prompt> # 唯一命令
Gemini:
ask <prompt> # 最常用
new # 开新对话
deep-research <prompt> # Deep Research 模式
deep-research-result [query] # 取 Deep Research 结果
image <prompt> # 图片生成
```
## ⚠️ Critical: Two-layer timeout (from 2026-04-08 debugging)
opencli has **two independent timeout layers**, both must be set for grok to wait long enough:
1. **Inner**: `--timeout 300` — grok.com web response wait time
2. **Outer**: `OPENCLI_BROWSER_COMMAND_TIMEOUT=300` environment variable — entire browser command execution timeout (**default 60s**, hardcoded in `runtime.js:25`)
The outer takes **priority over** the inner. Setting only `--timeout 300` still gets killed at 60s. You MUST set both.
Source code: `.omx/reference/opencli/dist/src/runtime.js:25`:
```js
export const DEFAULT_BROWSER_COMMAND_TIMEOUT = parseEnvTimeout('OPENCLI_BROWSER_COMMAND_TIMEOUT', 60);
```
**Recommended**: add `export OPENCLI_BROWSER_COMMAND_TIMEOUT=300` to `~/.zshrc` so every grok/gemini call has room to breathe.
## Commands
### Grok (MANDATORY env var)
```bash
OPENCLI_BROWSER_COMMAND_TIMEOUT=300 opencli grok ask "{{PROMPT}}" --timeout 300 -f json
```
Or if env var already exported in shell:
```bash
opencli grok ask "{{PROMPT}}" --timeout 300 -f json
```
Flags:
- `--new true` — start a fresh chat (avoids polluting prior conversation context)
- `--web true` — use the hardened grok.com consumer web flow (better error messages)
- `--timeout 300` — MUST match or be lower than `OPENCLI_BROWSER_COMMAND_TIMEOUT`
- `-f json` — structured output (recommended over `--format plain` for parsing)
### Gemini
```bash
opencli gemini ask "{{PROMPT}}" --format plain
```
Gemini is more stable than grok and usually doesn't hit the 60s timeout. No env var required but adding it doesn't hurt:
```bash
OPENCLI_BROWSER_COMMAND_TIMEOUT=300 opencli gemini ask "{{PROMPT}}" --format plain
```
Flags:
- `--new true` — start a fresh chat
- `--timeout 90` — extend timeout for long answers (default 60s)
### Parallel dispatch (for multi-AI workflows)
To call grok and gemini in parallel from Claude Code, use the Bash tool with `run_in_background=true` for both calls in the same assistant turn. Claude Code will launch them concurrently and wait for task-completion notifications.
For deep cross-validation workflows (N internal sub-agents + 2 external AIs, confidence-tiered synthesis), **use the `multi-ai-research` skill** instead — it covers the full orchestration pattern.
## Prerequisites (check before running)
1. **opencli installed**: `opencli --version` should print a version.
2. **Chrome running with Browser Bridge extension loaded** — the extension is installed from `~/Desktop/code/AI/tools/opencli/extension/` via `chrome://extensions` → Developer mode → Load unpacked. If the extension is missing, `opencli doctor` will flag it.
3. **Logged in to the target site**:
- Grok: `grok.com` must be logged in
- Gemini: `gemini.google.com` must be logged in
4. **Daemon connectivity**: run `opencli doctor` once per session if unsure.
If any check fails, **do not fall back to another tool silently**. Report the missing prerequisite and ask the user to fix it (per U-23 no-silent-degradation rule).
## Execution protocol
1. Decide which model to use (Grok vs Gemini). If unclear, ask.
2. Build the prompt. Be explicit — these browser UIs have no system prompt so the full question must be self-contained.
3. Run the opencli command with `--format plain`.
4. Capture the response. If the command fails with a session/auth/challenge message, stop and surface the error to the user. Do NOT retry blindly.
5. Save an artifact (see below).
6. Report back to the user: model used, summary of response, and path to the artifact.
## Error handling
opencli will emit specific errors like:
- "Not logged in to grok.com" → ask user to log in
- "Composer not found" → DOM changed, ask user to open the site manually and retry
- "Session gated / challenge" → user needs to solve the challenge in Chrome manually
- Timeout → offer to retry with a longer `--timeout`
Do not swallow these. Report verbatSenior backend TypeScript architect specializing in Bun/Node.js runtime, API design, database optimization, and scalable server architecture.
Expert at exploring and understanding legacy and unfamiliar codebases. Maps dependencies, identifies patterns, and creates documentation for complex systems.
Kubernetes architect specializing in cluster design, manifests, Helm charts, GitOps workflows, security policies, and production operations.
Systematic open source contributor that analyzes projects, finds suitable issues, implements fixes, and creates high-quality PRs with high acceptance probability.
Application security expert specializing in SAST, vulnerability assessment, OWASP Top 10, compliance auditing, and security architecture review.
Fullstack code reviewer with 15+ years experience analyzing code for security vulnerabilities, performance bottlenecks, architectural decisions, and best practices.
Senior technical lead who analyzes complex projects and coordinates multi-step development tasks. Delegates to specialized agents and ensures quality delivery.
Use when the user explicitly asks to stage all current changes, create a commit, and push to the remote after safety checks.