#601

Global Rank · of 601 Skills

image-video-gen AI Agent Skill

View Source: bytedance/agentkit-samples

Medium

Installation

npx skills add bytedance/agentkit-samples --skill image-video-gen

47

Installs

Image Video Tool Workflow

描述

这是一个用于生成图片和视频的智能体工作流。它协调 byted-web-search, image-generate, 和 video-generate 工具来完成任务。

依赖技能

工作流程

  1. 理解用户意图

    • 接收用户输入的文本描述。
    • 如果用户输入是故事或情节,直接调用 byted-web-search 工具获取背景信息。
    • 如果用户输入为其他类型(如问题、请求),则先调用 byted-web-search 工具 (最多调用2次),找到合适的信息。
  2. 生成图片

    • 根据准备好的背景信息,调用 image-generate 工具生成分镜图片。
    • 生成后,以 Markdown 图片列表形式返回,例如:
    ![分镜图片1](https://example.com/image1.png)
  3. 生成视频 (可选):

    • 根据用户输入,判断是否需要调用 video-generate 工具生成视频。
    • 返回视频 URL 时,使用 Markdown 视频链接列表,例如:
    <video src="https://example.com/video1.mp4" width="640" controls>分镜视频1</video>

注意事项

  • 此技能本身没有 Python 执行脚本 (scripts/ 目录下无脚本)。
  • 它通过协调其他原子技能来工作。
  • 输入输出中,任何涉及图片或视频的链接 url,绝对禁止任何形式的修改、截断、拼接或替换,必须 100% 保持原始内容的完整性与准确性。

Installs

Installs 47
Global Rank #601 of 601

Security Audit

ath Safe
socket Safe
Alerts: 0 Score: 90
snyk Medium
EU EU-Hosted Inference API

Power your AI Agents with the best open-source models.

Drop-in OpenAI-compatible API. No data leaves Europe.

Explore Inference API

GLM

GLM 5

$1.00 / $3.20

per M tokens

Kimi

Kimi K2.5

$0.60 / $2.80

per M tokens

MiniMax

MiniMax M2.5

$0.30 / $1.20

per M tokens

Qwen

Qwen3.5 122B

$0.40 / $3.00

per M tokens

How to use this skill

1

Install image-video-gen by running npx skills add bytedance/agentkit-samples --skill image-video-gen in your project directory. Run the install command above in your project directory. The skill file will be downloaded from GitHub and placed in your project.

2

No configuration needed. Your AI agent (Claude Code, Cursor, Windsurf, etc.) automatically detects installed skills and uses them as context when generating code.

3

The skill enhances your agent's understanding of image-video-gen, helping it follow established patterns, avoid common mistakes, and produce production-ready output.

What you get

Skills are plain-text instruction files — not executable code. They encode expert knowledge about frameworks, languages, or tools that your AI agent reads to improve its output. This means zero runtime overhead, no dependency conflicts, and full transparency: you can read and review every instruction before installing.

Compatibility

This skill works with any AI coding agent that supports the skills.sh format, including Claude Code (Anthropic), Cursor, Windsurf, Cline, Aider, and other tools that read project-level context files. Skills are framework-agnostic at the transport level — the content inside determines which language or framework it applies to.

Data sourced from the skills.sh registry and GitHub. Install counts and security audits are updated regularly.

EU Made in Europe

Chat with 100+ AI Models in one App.

Use Claude, ChatGPT, Gemini alongside with EU-Hosted Models like Deepseek, GLM-5, Kimi K2.5 and many more.

Customer Support