Turn text prompts into videos via AI-generated HTML. No coding required.
Features • Quick Start • How It Works • Comparison • Download
Html To Video Pixie is an AI-powered desktop application that transforms natural language descriptions into HTML/CSS animations, then renders them as high-quality MP4 videos or GIF animations.
Simply describe what you want in plain language, and the AI generates the HTML code. The built-in rendering engine converts it to video — no coding skills required.
- Natural Language to Video — Describe your animation in words, get a video
- Real-time Preview — See the HTML animation instantly as AI generates it
- Dual Output Formats — Export as MP4 video or animated GIF
- Built-in Code Editor — Fine-tune the generated HTML directly in the app
- Multi-Model Support — OpenAI, DeepSeek, Qwen, Zhipu AI, Xiaomi MiMo, and more
- Custom Providers — Add any OpenAI-compatible API service
- Streaming Output — Watch AI generate code in real-time
- Bilingual Interface — English and Chinese with one-click switching
- No Installation Hassle — Pre-packaged installer with FFmpeg included
- Auto-save — All creations and chat history preserved automatically
- Import HTML — Load existing HTML files for rendering
- Download
HtmlToVideoPixieInstaller.exe - Run installer (requires administrator privileges)
- Launch from desktop shortcut or Start Menu
# Install dependencies
pip install playwright requests Pillow tkinterweb
# Install Playwright browsers
playwright install chromium
# Ensure FFmpeg is in PATH
ffmpeg -version
# Run the application
python main_tkinter.py┌─────────────────────────────────────────────────────────────────┐
│ Html To Video Pixie │
├─────────────────────────────────────────────────────────────────┤
│ │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ User Input │ ──► │ AI Model │ ──► │ HTML/CSS │ │
│ │ (Natural │ │ (GPT-4o, │ │ Animation │ │
│ │ Language) │ │ DeepSeek..) │ │ Code │ │
│ └──────────────┘ └──────────────┘ └──────┬───────┘ │
│ │ │
│ ▼ │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ MP4/GIF │ ◄── │ FFmpeg │ ◄── │ Playwright │ │
│ │ Output │ │ Encoding │ │ Browser │ │
│ └──────────────┘ └──────────────┘ │ Rendering │ │
│ └──────────────┘ │
└─────────────────────────────────────────────────────────────────┘
Technical Stack:
- GUI: Python tkinter (lightweight, cross-platform)
- AI: OpenAI-compatible API with streaming
- Rendering: Playwright (headless Chromium/Edge)
- Encoding: FFmpeg (MP4/GIF output)
| Project | Core Idea | Target Users | Tech Stack |
|---|---|---|---|
| HtmlToVideoPixie | Natural Language → HTML → Video | Non-programmers, creators | Python, Tkinter, AI API |
| Remotion | React Components → Video | Frontend developers | React, TypeScript, Node.js |
| HyperFrames | HTML Source → Video | AI Agents, developers | HTML, CSS, JavaScript |
| html-video | Data → Multi-frame HTML → Video | Automation, Coding Agents | Python, Chrome, FFmpeg |
| Aspect | HtmlToVideoPixie | Sora/Kling |
|---|---|---|
| Control | Precise, deterministic (code-based) | Probabilistic (diffusion-based) |
| Text Rendering | Vector-sharp, perfect for subtitles | Often garbled/unreadable |
| Resolution | Unlimited (4K/8K possible) | Limited to 1080p |
| Style | Geometric, data-viz, presentations | Photorealistic, cinematic |
| Editability | Direct code editing | Re-prompt required |
- Exact Timing Control — "Rotate 45° at exactly 3 seconds" is possible
- Crystal Clear Text — Perfect for data visualization, PPT animations, subtitles
- Infinite Scalability — Vector graphics scale to any resolution
- Code-Level Precision — Every pixel is deterministic and reproducible
HtmlToVideoPixie/
├── main_tkinter.py # Main application
├── installer.py # Installer script
├── build_installer.bat # Build script
├── logo.ico # Application icon
├── logo.png # Original logo
├── config/
│ └── providers.json # AI provider configuration
├── core/
│ ├── ai_client.py # AI API client
│ ├── video_generator.py # Video rendering engine
│ └── i18n.py # Internationalization
├── README.md # This file
└── LICENSE # MIT License
- Open Settings → Model Settings
- Click Add Provider
- Fill in:
- Name: Display name (e.g., "My DeepSeek")
- API URL: Base URL (e.g.,
https://api.deepseek.com/v1) - API Key: Your API key
- Model ID: Model name (e.g.,
deepseek-chat)
- Click Save
Configuration is stored in %APPDATA%\HtmlToVideoPixie\providers.json
- OpenAI (GPT-4o, GPT-4-turbo)
- DeepSeek (deepseek-chat, deepseek-coder)
- Qwen (qwen-turbo, qwen-plus)
- Zhipu AI (glm-4)
- Xiaomi MiMo
- Any OpenAI-compatible API
Good Prompt:
Create a colorful bar chart animation showing monthly sales data:
- 5 bars for Jan to May
- Each bar grows from bottom with a smooth ease-out animation
- Bars appear one by one with 0.3s delay
- Show value labels on top of each bar
- Use gradient colors: blue to purple
Why it works:
- Specific element count and names
- Clear animation description
- Timing parameters
- Visual style details
- Generate — Let AI create initial code
- Preview — Check animation in preview pane
- Edit — Adjust values in the code editor
- Iterate — Regenerate if needed
- Export — Save as MP4 or GIF
- OS: Windows 10/11 (64-bit)
- RAM: 4GB minimum, 8GB recommended
- Disk: 500MB free space
- Network: Required for AI API calls
| Issue | Solution |
|---|---|
| FFmpeg not found | Install FFmpeg and add to PATH |
| Browser launch failed | Run playwright install chromium |
| API connection error | Check API key and network |
| Video generation slow | Reduce duration/FPS in settings |
MIT License - See LICENSE for details.
- Playwright — Browser automation
- FFmpeg — Video encoding
- Pillow — Image processing
- tkinterweb — HTML preview
- GitHub: FatFatYoung/HtmlToVideoPixie
- Issues: Report a bug
Made with ❤️ by FatFatYoung
