A

ai-avatar-video

by @inferen-shv
4.5(124)

Creates virtual avatar videos using AI technology, achieving realistic character animation and voice synchronization, widely applied in education, marketing, and entertainment.

AI Video GenerationDigital AvatarsSynthesiaDeepfakeCharacter AnimationGitHub
Installation
npx skills add inferen-sh/skills --skill ai-avatar-video
compare_arrows

Before / After Comparison

1
Before

Producing videos featuring real people is costly, requiring actors, venues, equipment rental, and complex post-production. The production cycle is long, and modifications are inconvenient.

After

Utilizing AI virtual avatar video technology, realistic virtual presenter videos can be generated simply by inputting text or audio. This significantly reduces production costs and time, enabling businesses and individuals to quickly and flexibly create a large volume of video content, achieving personalization and scalability.

description SKILL.md

ai-avatar-video

AI Avatar & Talking Head Videos

Create AI avatars and talking head videos via inference.sh CLI.

Quick Start

Requires inference.sh CLI (infsh). Install instructions

infsh login

# Create avatar video from image + audio
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Available Models

Model App ID Best For

OmniHuman 1.5 bytedance/omnihuman-1-5 Multi-character, best quality

OmniHuman 1.0 bytedance/omnihuman-1-0 Single character

Fabric 1.0 falai/fabric-1-0 Image talks with lipsync

PixVerse Lipsync falai/pixverse-lipsync Highly realistic

Search Avatar Apps

infsh app list --search "omnihuman"
infsh app list --search "lipsync"
infsh app list --search "fabric"

Examples

OmniHuman 1.5 (Multi-Character)

infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Supports specifying which character to drive in multi-person images.

Fabric 1.0 (Image Talks)

infsh app run falai/fabric-1-0 --input '{
  "image_url": "https://face.jpg",
  "audio_url": "https://audio.mp3"
}'

PixVerse Lipsync

infsh app run falai/pixverse-lipsync --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Generates highly realistic lipsync from any audio.

Full Workflow: TTS + Avatar

# 1. Generate speech from text
infsh app run infsh/kokoro-tts --input '{
  "prompt": "Welcome to our product demo. Today I will show you..."
}' > speech.json

# 2. Create avatar video with the speech
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://presenter-photo.jpg",
  "audio_url": "<audio-url-from-step-1>"
}'

Full Workflow: Dub Video in Another Language

# 1. Transcribe original video
infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "https://video.mp4"}' > transcript.json

# 2. Translate text (manually or with an LLM)

# 3. Generate speech in new language
infsh app run infsh/kokoro-tts --input '{"text": "<translated-text>"}' > new_speech.json

# 4. Lipsync the original video with new audio
infsh app run infsh/latentsync-1-6 --input '{
  "video_url": "https://original-video.mp4",
  "audio_url": "<new-audio-url>"
}'

Use Cases

  • Marketing: Product demos with AI presenter

  • Education: Course videos, explainers

  • Localization: Dub content in multiple languages

  • Social Media: Consistent virtual influencer

  • Corporate: Training videos, announcements

Tips

  • Use high-quality portrait photos (front-facing, good lighting)

  • Audio should be clear with minimal background noise

  • OmniHuman 1.5 supports multiple people in one image

  • LatentSync is best for syncing existing videos to new audio

Related Skills

# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli

# Text-to-speech (generate audio for avatars)
npx skills add inference-sh/skills@text-to-speech

# Speech-to-text (transcribe for dubbing)
npx skills add inference-sh/skills@speech-to-text

# Video generation
npx skills add inference-sh/skills@ai-video-generation

# Image generation (create avatar images)
npx skills add inference-sh/skills@ai-image-generation

Browse all video apps: infsh app list --category video

Documentation

Weekly Installs4.4KRepositoryinferen-sh/skillsGitHub Stars159First Seen6 days agoSecurity AuditsGen Agent Trust HubPassSocketPassSnykWarnInstalled onclaude-code3.5Kgemini-cli3.1Kcodex3.1Kamp3.1Kgithub-copilot3.1Kkimi-cli3.1K

forumUser Reviews (0)

Write a Review

Effect
Usability
Docs
Compatibility

No reviews yet

Statistics

Installs6.5K
Rating4.5 / 5.0
Version
Updated2026年3月17日
Comparisons1

User Rating

4.5(124)
5
0%
4
0%
3
0%
2
0%
1
0%

Rate this Skill

0.0

Compatible Platforms

🔧Claude Code
🔧OpenClaw
🔧OpenCode
🔧Codex
🔧Gemini CLI
🔧GitHub Copilot
🔧Amp
🔧Kimi CLI

Timeline

Created2026年3月17日
Last Updated2026年3月17日