ホーム/マルチメディア/ai-avatar-video

ai-avatar-video

Name: ai-avatar-video AI Agent Skill
Availability: InStock
Rating: 4.7 (260 reviews)
Author: inferen-sh

by @inferen-shv

4.7(260)

AI技術を通じてバーチャルアバター動画を作成し、リアルなキャラクターアニメーションと音声同期を実現し、教育、マーケティング、エンターテイメントなどの分野で広く応用されます。

ai-video-generationdigital-avatarssynthesiadeepfakecharacter-animationGitHub

インストール方法

npx skills add inferen-sh/skills --skill ai-avatar-video

compare_arrows

Before / After 効果比較

1 组

使用前

実写動画の制作は、俳優、場所、機材のレンタル、複雑なポストプロダクションが必要で、コストが高く、制作期間が長く、修正も困難です。

使用後

AIバーチャルアバター動画技術を利用すれば、テキストや音声入力だけでリアルなバーチャルプレゼンター動画を生成できます。これにより、制作コストと時間を大幅に削減し、企業や個人が迅速かつ柔軟に大量の動画コンテンツを作成し、パーソナライズと規模化を実現できます。

SKILL.md

ai-avatar-video

AI Avatar & Talking Head Videos

Create AI avatars and talking head videos via inference.sh CLI.

Quick Start

Requires inference.sh CLI (infsh). Install instructions

infsh login

# Create avatar video from image + audio
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Available Models

Model App ID Best For

OmniHuman 1.5 bytedance/omnihuman-1-5 Multi-character, best quality

OmniHuman 1.0 bytedance/omnihuman-1-0 Single character

Fabric 1.0 falai/fabric-1-0 Image talks with lipsync

PixVerse Lipsync falai/pixverse-lipsync Highly realistic

Search Avatar Apps

infsh app list --search "omnihuman"
infsh app list --search "lipsync"
infsh app list --search "fabric"

Examples

OmniHuman 1.5 (Multi-Character)

infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Supports specifying which character to drive in multi-person images.

Fabric 1.0 (Image Talks)

infsh app run falai/fabric-1-0 --input '{
  "image_url": "https://face.jpg",
  "audio_url": "https://audio.mp3"
}'

PixVerse Lipsync

infsh app run falai/pixverse-lipsync --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Generates highly realistic lipsync from any audio.

Full Workflow: TTS + Avatar

# 1. Generate speech from text
infsh app run infsh/kokoro-tts --input '{
  "prompt": "Welcome to our product demo. Today I will show you..."
}' > speech.json

# 2. Create avatar video with the speech
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://presenter-photo.jpg",
  "audio_url": "<audio-url-from-step-1>"
}'

Full Workflow: Dub Video in Another Language

# 1. Transcribe original video
infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "https://video.mp4"}' > transcript.json

# 2. Translate text (manually or with an LLM)

# 3. Generate speech in new language
infsh app run infsh/kokoro-tts --input '{"text": "<translated-text>"}' > new_speech.json

# 4. Lipsync the original video with new audio
infsh app run infsh/latentsync-1-6 --input '{
  "video_url": "https://original-video.mp4",
  "audio_url": "<new-audio-url>"
}'

Use Cases

Marketing: Product demos with AI presenter
Education: Course videos, explainers
Localization: Dub content in multiple languages
Social Media: Consistent virtual influencer
Corporate: Training videos, announcements

Tips

Use high-quality portrait photos (front-facing, good lighting)
Audio should be clear with minimal background noise
OmniHuman 1.5 supports multiple people in one image
LatentSync is best for syncing existing videos to new audio

Related Skills

# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli

# Text-to-speech (generate audio for avatars)
npx skills add inference-sh/skills@text-to-speech

# Speech-to-text (transcribe for dubbing)
npx skills add inference-sh/skills@speech-to-text

# Video generation
npx skills add inference-sh/skills@ai-video-generation

# Image generation (create avatar images)
npx skills add inference-sh/skills@ai-image-generation

Browse all video apps: infsh app list --category video

Documentation

Running Apps - How to run apps via CLI
Content Pipeline Example - Building media workflows
Streaming Results - Real-time progress updates

Weekly Installs4.4KRepositoryinferen-sh/skillsGitHub Stars159First Seen6 days agoSecurity AuditsGen Agent Trust HubPass SocketPass SnykWarnInstalled onclaude-code3.5Kgemini-cli3.1Kcodex3.1Kamp3.1Kgithub-copilot3.1Kkimi-cli3.1K

ユーザーレビュー (0)

レビューを書く

効果

使いやすさ

ドキュメント

互換性

レビューなし

統計データ

インストール数54.8K

評価4.7 / 5.0

バージョン

更新日2026年5月21日

比較事例1 件

ユーザー評価

4.7(260)

50%

この Skill を評価

0.0

対応プラットフォーム

🔧Claude Code

🔧OpenClaw

🔧OpenCode

🔧Codex

🔧Gemini CLI

🔧GitHub Copilot

🔧Amp

🔧Kimi CLI

タイムライン

作成2026年3月17日

最終更新2026年5月21日