Global Rank · of 601 Skills
image-video-gen AI Agent Skill
View Source: bytedance/agentkit-samples
MediumInstallation
npx skills add bytedance/agentkit-samples --skill image-video-gen 47
Installs
Image Video Tool Workflow
描述
这是一个用于生成图片和视频的智能体工作流。它协调 byted-web-search, image-generate, 和 video-generate 工具来完成任务。
依赖技能
工作流程
理解用户意图:
- 接收用户输入的文本描述。
- 如果用户输入是故事或情节,直接调用
byted-web-search工具获取背景信息。 - 如果用户输入为其他类型(如问题、请求),则先调用
byted-web-search工具 (最多调用2次),找到合适的信息。
生成图片:
- 根据准备好的背景信息,调用
image-generate工具生成分镜图片。 - 生成后,以 Markdown 图片列表形式返回,例如:
- 根据准备好的背景信息,调用
生成视频 (可选):
- 根据用户输入,判断是否需要调用
video-generate工具生成视频。 - 返回视频 URL 时,使用 Markdown 视频链接列表,例如:
<video src="https://example.com/video1.mp4" width="640" controls>分镜视频1</video>- 根据用户输入,判断是否需要调用
注意事项
- 此技能本身没有 Python 执行脚本 (
scripts/目录下无脚本)。 - 它通过协调其他原子技能来工作。
- 输入输出中,任何涉及图片或视频的链接 url,绝对禁止任何形式的修改、截断、拼接或替换,必须 100% 保持原始内容的完整性与准确性。
Installs
Security Audit
View Source
bytedance/agentkit-samples
More from this source
Power your AI Agents with
the best open-source models.
Drop-in OpenAI-compatible API. No data leaves Europe.
Explore Inference APIGLM
GLM 5
$1.00 / $3.20
per M tokens
Kimi
Kimi K2.5
$0.60 / $2.80
per M tokens
MiniMax
MiniMax M2.5
$0.30 / $1.20
per M tokens
Qwen
Qwen3.5 122B
$0.40 / $3.00
per M tokens
How to use this skill
Install image-video-gen by running npx skills add bytedance/agentkit-samples --skill image-video-gen in your project directory. Run the install command above in your project directory. The skill file will be downloaded from GitHub and placed in your project.
No configuration needed. Your AI agent (Claude Code, Cursor, Windsurf, etc.) automatically detects installed skills and uses them as context when generating code.
The skill enhances your agent's understanding of image-video-gen, helping it follow established patterns, avoid common mistakes, and produce production-ready output.
What you get
Skills are plain-text instruction files — not executable code. They encode expert knowledge about frameworks, languages, or tools that your AI agent reads to improve its output. This means zero runtime overhead, no dependency conflicts, and full transparency: you can read and review every instruction before installing.
Compatibility
This skill works with any AI coding agent that supports the skills.sh format, including Claude Code (Anthropic), Cursor, Windsurf, Cline, Aider, and other tools that read project-level context files. Skills are framework-agnostic at the transport level — the content inside determines which language or framework it applies to.
Chat with 100+ AI Models in one App.
Use Claude, ChatGPT, Gemini alongside with EU-Hosted Models like Deepseek, GLM-5, Kimi K2.5 and many more.