---
id: sm-ai-avatar-video
name: "ai-avatar-video"
url: https://skills.yangsir.net/skill/sm-ai-avatar-video
author: inferen-sh
domain: multimedia
tags: ["ai-video-generation", "digital-avatars", "synthesia", "deepfake", "character-animation"]
install_count: 54800
rating: 4.70 (260 reviews)
github: https://github.com/inferen-sh/skills
---

# ai-avatar-video

> 通过AI技术创建虚拟形象视频，实现逼真的人物动画和语音同步，广泛应用于教育、营销和娱乐等领域。

**Stats**: 54,800 installs · 4.7/5 (260 reviews)

## Before / After 对比

### AI 虚拟形象视频制作效率对比

| Metric | Before | After | Change |
|---|---|---|---|
| - | - | - | - |
| - | - | - | - |
| - | - | - | - |

## Readme

# ai-avatar-video

# AI Avatar & Talking Head Videos

Create AI avatars and talking head videos via [inference.sh](https://inference.sh) CLI.

## Quick Start

Requires inference.sh CLI (`infsh`). [Install instructions](https://raw.githubusercontent.com/inference-sh/skills/refs/heads/main/cli-install.md)

```
infsh login

# Create avatar video from image + audio
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

```

## Available Models

Model
App ID
Best For

OmniHuman 1.5
`bytedance/omnihuman-1-5`
Multi-character, best quality

OmniHuman 1.0
`bytedance/omnihuman-1-0`
Single character

Fabric 1.0
`falai/fabric-1-0`
Image talks with lipsync

PixVerse Lipsync
`falai/pixverse-lipsync`
Highly realistic

## Search Avatar Apps

```
infsh app list --search "omnihuman"
infsh app list --search "lipsync"
infsh app list --search "fabric"

```

## Examples

### OmniHuman 1.5 (Multi-Character)

```
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

```

Supports specifying which character to drive in multi-person images.

### Fabric 1.0 (Image Talks)

```
infsh app run falai/fabric-1-0 --input '{
  "image_url": "https://face.jpg",
  "audio_url": "https://audio.mp3"
}'

```

### PixVerse Lipsync

```
infsh app run falai/pixverse-lipsync --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

```

Generates highly realistic lipsync from any audio.

## Full Workflow: TTS + Avatar

```
# 1. Generate speech from text
infsh app run infsh/kokoro-tts --input '{
  "prompt": "Welcome to our product demo. Today I will show you..."
}' > speech.json

# 2. Create avatar video with the speech
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://presenter-photo.jpg",
  "audio_url": "<audio-url-from-step-1>"
}'

```

## Full Workflow: Dub Video in Another Language

```
# 1. Transcribe original video
infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "https://video.mp4"}' > transcript.json

# 2. Translate text (manually or with an LLM)

# 3. Generate speech in new language
infsh app run infsh/kokoro-tts --input '{"text": "<translated-text>"}' > new_speech.json

# 4. Lipsync the original video with new audio
infsh app run infsh/latentsync-1-6 --input '{
  "video_url": "https://original-video.mp4",
  "audio_url": "<new-audio-url>"
}'

```

## Use Cases

- **Marketing**: Product demos with AI presenter

- **Education**: Course videos, explainers

- **Localization**: Dub content in multiple languages

- **Social Media**: Consistent virtual influencer

- **Corporate**: Training videos, announcements

## Tips

- Use high-quality portrait photos (front-facing, good lighting)

- Audio should be clear with minimal background noise

- OmniHuman 1.5 supports multiple people in one image

- LatentSync is best for syncing existing videos to new audio

## Related Skills

```
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli

# Text-to-speech (generate audio for avatars)
npx skills add inference-sh/skills@text-to-speech

# Speech-to-text (transcribe for dubbing)
npx skills add inference-sh/skills@speech-to-text

# Video generation
npx skills add inference-sh/skills@ai-video-generation

# Image generation (create avatar images)
npx skills add inference-sh/skills@ai-image-generation

```

Browse all video apps: `infsh app list --category video`

## Documentation

- [Running Apps](https://inference.sh/docs/apps/running) - How to run apps via CLI

- [Content Pipeline Example](https://inference.sh/docs/examples/content-pipeline) - Building media workflows

- [Streaming Results](https://inference.sh/docs/api/sdk/streaming) - Real-time progress updates

Weekly Installs4.4KRepository[inferen-sh/skills](https://github.com/inferen-sh/skills)GitHub Stars159First Seen6 days agoSecurity Audits[Gen Agent Trust HubPass](/inferen-sh/skills/ai-avatar-video/security/agent-trust-hub)[SocketPass](/inferen-sh/skills/ai-avatar-video/security/socket)[SnykWarn](/inferen-sh/skills/ai-avatar-video/security/snyk)Installed onclaude-code3.5Kgemini-cli3.1Kcodex3.1Kamp3.1Kgithub-copilot3.1Kkimi-cli3.1K

---
*Source: https://skills.yangsir.net/skill/sm-ai-avatar-video*
*Markdown mirror: https://skills.yangsir.net/api/skill/sm-ai-avatar-video/markdown*