Claw Use — Device Control for AI Agents OpenClaw Skill

Control physical devices over HTTP with unified commands for screen reading, input actions, app launch, navigation, and audio output using the Claw Use proto...

v1.0.0 Recently Updated Updated Today

Installation

clawhub install claw-use

Requires npm i -g clawhub

View on ClawHub Download .zip

139

Downloads

0

Stars

0

current installs

0 all-time

1

Versions

Claw Use — Device Control for AI Agents

Give your AI agent eyes, hands, and a voice on real devices.

Claw Use is a protocol + skill for AI agents to control physical devices over HTTP. The cu CLI provides a unified interface — the same commands work across any device that implements the Claw Use API.

Supported Devices

Platform	Implementation	Status
Android	claw-use-android	✅ Available
iOS	claw-use-ios	🔮 Planned
Desktop	claw-use-desktop	🔮 Planned

Prerequisites

cu CLI installed (ships with claw-use-android, or install standalone)
At least one device running a Claw Use implementation
Device and agent on the same network (or connected via Tailscale)

Setup

            # Add a device with a friendly name
cu add redmi 192.168.0.105 <token>
cu add pixel 100.80.1.10 <token>

# List devices
cu devices
# ▸ redmi  192.168.0.105  online v1.2.0
#   pixel  100.80.1.10    offline

# Switch default
cu use pixel

# Target a specific device
cu -d redmi screenshot
          

Core API (all platforms)

Every Claw Use implementation exposes the same HTTP endpoints:

Perception — read the device

            cu screen              # UI tree (semantic: element text, bounds, state)
cu screen -c           # compact mode (interactive elements only)
cu screenshot          # visual capture (JPEG, configurable quality)
cu notifications       # system notifications
cu status              # device health dashboard
          

Action — control the device

            cu tap <x> <y>         # tap coordinates
cu click <text>        # tap by visible text (semantic click)
cu type "text"         # type text (CJK supported)
cu swipe up|down|left|right
cu scroll up|down|left|right
cu back / cu home      # system navigation
cu launch <app>        # open an application
cu open <url>          # open URL
cu intent '<json>'     # platform-specific intent (Android)
          

Audio

            cu tts "hello"         # speak through device speaker
cu say "你好"          # alias
          

Device State

            cu wake                # wake screen
cu lock / cu unlock    # lock/unlock (PIN required for unlock)
          

Workflow Patterns

Navigate and interact

            cu launch org.telegram.messenger
cu screen -c                        # see what's on screen
cu click "Search"
cu type "John"
cu click "John, last seen recently"
cu type "Hey!"
cu click "Send"
          

Visual + semantic dual-channel

            cu screen -c                         # semantic: what elements exist
cu screenshot 50 720 /tmp/look.jpg   # visual: what it actually looks like
          

Multi-device orchestration

            cu -d phone1 launch com.whatsapp
cu -d phone2 screenshot
cu -d tablet open "https://example.com"
          

For Agent Developers

Claw Use is designed as a protocol, not just an app. To add support for a new platform:

Implement the Claw Use HTTP API spec
Expose endpoints on a configurable port (default: 7333)
Support token auth via X-Bridge-Token header
Return JSON responses matching the documented schemas

The cu CLI and this skill work automatically with any compliant implementation.

Tips

cu screen -c is the primary perception tool — compact mode filters noise
cu click by text is more reliable than cu tap when text is visible
cu screenshot when you need visual context the UI tree can't capture
Auto-unlock is transparent: locked devices auto-unlock before any command
Combine with Tailscale for remote access from anywhere

Statistics

Downloads 139

Stars 0

Current installs 0

All-time installs 0

Versions 1

Comments 0

Created Mar 16, 2026

Updated Apr 4, 2026

Author

傅洋

@4ier

Latest Changes

v1.0.0 · Mar 16, 2026

Initial release: cu CLI + skill for controlling Android phones (and future devices) via HTTP API. 25 endpoints covering screen reading, screenshots, tap/click/type/swipe, TTS, intents, notifications, and device management. Multi-device support with named aliases.

Quick Install

clawhub install claw-use

Related Skills

Other popular skills you might find useful.

Agent Browser

MaTriXy

Headless browser automation CLI optimized for AI agents with accessibility tree snapshots and ref-based element selection

69.1k 248 v0.1.0

Browser Automation

peytoncasper

Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.

31.8k 46 v1.0.1

Code

Iván

Coding workflow with planning, implementation, verification, and testing for clean software development.

18.2k 35 v1.0.4

Agent Browser - Stagehand

peytoncasper

Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.

6.3k 4 v1.0.0

Browse all skills →

Made in Europe

Chat with 100+ AI Models in one App.

Use Claude, ChatGPT, Gemini alongside with EU-Hosted Models like Deepseek, GLM-5, Kimi K2.5 and many more.

Start for free View pricing