Globales Ranking · von 600 Skills
use-my-browser AI Agent Skill
Quellcode ansehen: xixu-me/skills
CriticalInstallation
npx skills add xixu-me/skills --skill use-my-browser 54.5K
Installationen
Do not treat this skill as a generic browsing default. Route from the evidence you need, not from tool preference.
Every task must be classified before you choose a route:
static-capable: the evidence can be produced without live browser state, visible confirmation, or page interactionbrowser-required: the evidence depends on rendered state, interaction, live session behavior, or browser-only structures
Only static-capable tasks may fall back to static retrieval, curl, or other non-browser paths. Once a task is browser-required, stay on the browser path and mark missing capability as blocked instead of silently downgrading.
Prerequisite check
This skill is for work inside the user's live browser session, not for launching a separate fresh automation browser.
Before doing browser automation, confirm that your environment already has access to a live browser stack that can provide the capabilities the task depends on, such as page inventory, task-owned page creation, page selection, snapshots or visible-state reads, DOM inspection, text or form input, uploads, dialogs, console inspection, and network inspection. The exact stack does not matter here: confirm capability, not brand.
If the live browser stack is unavailable, do not attempt browser automation through this skill. Only static-capable work may fall back to static retrieval.
Live browser automation can trigger anti-bot or anti-automation defenses on some sites. Use browser interaction only when the task truly needs it, and avoid unnecessary repetitive actions once the needed evidence has been obtained.
Experience loop
Treat site patterns as part of the browser protocol, not as optional background reading.
For browser-required work, run this loop:
- As soon as the target domain is known, check whether a matching note already exists under
references/site-patterns/. - If a note exists, read it before the first meaningful browser mutation on that domain.
- During the run, watch for verified site-specific facts that would change how a future run should operate.
- Before you consider the task complete, decide whether the run produced a reusable fact, disproved an existing fact, or produced no reusable site-specific learning.
- If the run verified something reusable or disproved an existing claim, update the matching note before finishing.
Do not create a domain note for one-off noise. Do not skip the end-of-run review just because the task itself succeeded.
Writeback is expected when a run verifies any of the following:
- a stable route shape or required query parameter
- a login, session inheritance, or
isolatedContextquirk - a reliable interaction primitive such as hover, keyboard entry, upload sequencing, or a selector bridge pattern
- a domain where DOM-generated links are reliable but hand-built URLs are not
- predictable anti-automation friction or a misleading platform error state
- a reusable media extraction or iframe / Shadow DOM access pattern
Decision guide
Start with the outcome, not the tool. Make the user's goal explicit, define what counts as done, and choose the cheapest route that can still produce the right evidence.
Use this routing order:
- Decide whether the task is
static-capableorbrowser-required. - If the task is
static-capable, loadreferences/task-routing.mdand stay on the cheapest route that still satisfies the evidence target. - If the task is
browser-required, loadreferences/browser-playbook.md. - If browser-required capability is uncertain in a fresh host session, also load
references/browser-capability-matrix.md. - If the user already has an active browser debugging context, such as a selected inspector element or network request, also load
references/debug-handoff.md. - If the browser-required task touches a logged-in dashboard, admin surface, CMS, editor, or any save / publish / update flow, also load
references/control-plane-workflows.md. - If the current failure shape suggests a soft 404, content-unavailable state, suspicious no-op interaction, auth wall, rate limit, or anti-automation defense, also load
references/anti-automation-friction.md. - If the browser-required task includes iframe, Shadow DOM, collapsed content, or lazy-loaded evidence, also load
references/deep-dom.md. - If the important evidence lives in an image, audio clip, or video, also load
references/media-inspection.md. - If browser work can be divided across independent page owners or sub-agents, also load
references/parallel-browser-ownership.md. - If you already know a reliable selector but need an MCP-native
uidtarget, also loadreferences/selector-bridge.md. - If page actions leave state ambiguous, a page unexpectedly navigates, an old
uidmay have gone stale, or console / network inspection is now needed to explain the next browser decision, also loadreferences/browser-recovery.md. - If the target site already has a matching domain note under
references/site-patterns/, read that note before operating on the site.
Treat the following as browser-required by default:
localhost,127.0.0.1, or benchmark-style local fixtures- uploads, downloads, drag-and-drop, hover, keyboard-native entry, or visible confirmation states
- same-origin iframe inspection, Shadow DOM inspection,
details/ collapsed evidence, or lazy-loaded content - any task where "what the page visibly shows" is itself the evidence
The normal happy path for a common task is this entrypoint plus one or two references, not the entire reference set.
Hard rules
- Use browser interaction only when live browser state is part of the evidence or required action.
- Once a task is
browser-required, do not silently downgrade. - Treat this file as the entrypoint and each reference file as a single-purpose authority. Do not duplicate rules across files.
- Keep reference loading one level deep. Decide the next file from this entrypoint instead of turning one reference into a hub that links to more references.
- Do not ask the user to log in just because a page looks restricted. First confirm whether the target content or action is actually blocked.
- Prefer site-generated DOM links over hand-built URLs once the page has shown you the path it expects.
- Prefer MCP-native actions over script-driven interaction when the task is genuinely an in-browser action.
- Only close pages you created.
- Prefer primary sources over aggregators or repeated secondary reporting.
- If a matching site pattern note exists, read it before the first meaningful browser mutation on that domain.
- Do not finish a
browser-requiredtask without explicitly checking whether the run should create, update, downgrade, or remove a site-pattern claim. - If an existing site-pattern claim fails under comparable conditions, stop trusting it, fall back to the generic workflow, and update the note instead of retrying the stale assumption.
- Do not use
curl,Invoke-WebRequest, or shell HTTP fetches forbrowser-requiredtasks. - Do not treat a generic page-opening tool as evidence that localhost deep interaction is available.
- Do not switch routes just because a browser capability probe failed. Record the missing capability and stop.
- When the user indicates an active browser debugging context, prefer handoff from that current context over fresh reproduction from scratch.
Reference index
references/task-routing.md: static retrieval vs live browser routingreferences/browser-playbook.md: core page-action protocol and base browser loopreferences/browser-capability-matrix.md: capability proof for uncertain host sessionsreferences/debug-handoff.md: active debugging-context handoffreferences/control-plane-workflows.md: logged-in dashboard / CMS save-publish disciplinereferences/anti-automation-friction.md: soft 404 / auth / anti-automation classificationreferences/deep-dom.md: iframe, Shadow DOM, collapsed, or lazy-loaded evidencereferences/media-inspection.md: image, audio, and video evidencereferences/parallel-browser-ownership.md: multi-owner browser coordinationreferences/selector-bridge.md: selector-to-uidbridgingreferences/browser-recovery.md: staleuid, navigation drift, and console / network escalationreferences/site-patterns/README.md: site-pattern note maintenance rules- site-patterns/{domain}.md: existing domain-specific operating knowledge
Installationen
Sicherheitsprüfung
Quellcode ansehen
xixu-me/skills
Mehr aus dieser Quelle
Power your AI Agents with
the best open-source models.
Drop-in OpenAI-compatible API. No data leaves Europe.
Explore Inference APIGLM
GLM 5
$1.00 / $3.20
per M tokens
Kimi
Kimi K2.5
$0.60 / $2.80
per M tokens
MiniMax
MiniMax M2.5
$0.30 / $1.20
per M tokens
Qwen
Qwen3.5 122B
$0.40 / $3.00
per M tokens
So verwenden Sie diesen Skill
Install use-my-browser by running npx skills add xixu-me/skills --skill use-my-browser in your project directory. Führen Sie den obigen Installationsbefehl in Ihrem Projektverzeichnis aus. Die Skill-Datei wird von GitHub heruntergeladen und in Ihrem Projekt platziert.
Keine Konfiguration erforderlich. Ihr KI-Agent (Claude Code, Cursor, Windsurf usw.) erkennt installierte Skills automatisch und nutzt sie als Kontext bei der Code-Generierung.
Der Skill verbessert das Verständnis Ihres Agenten für use-my-browser, und hilft ihm, etablierte Muster zu befolgen, häufige Fehler zu vermeiden und produktionsreifen Code zu erzeugen.
Was Sie erhalten
Skills sind Klartext-Anweisungsdateien — kein ausführbarer Code. Sie kodieren Expertenwissen über Frameworks, Sprachen oder Tools, das Ihr KI-Agent liest, um seine Ausgabe zu verbessern. Das bedeutet null Laufzeit-Overhead, keine Abhängigkeitskonflikte und volle Transparenz: Sie können jede Anweisung vor der Installation lesen und prüfen.
Kompatibilität
Dieser Skill funktioniert mit jedem KI-Coding-Agenten, der das skills.sh-Format unterstützt, einschließlich Claude Code (Anthropic), Cursor, Windsurf, Cline, Aider und anderen Tools, die projektbezogene Kontextdateien lesen. Skills sind auf Transportebene framework-agnostisch — der Inhalt bestimmt, für welche Sprache oder welches Framework er gilt.
Chat with 100+ AI Models in one App.
Use Claude, ChatGPT, Gemini alongside with EU-Hosted Models like Deepseek, GLM-5, Kimi K2.5 and many more.