All AI Models, Ready to Create

Browse every model we support for video, image, audio, and text generation. Pick the one that fits your workflow and ship faster.

AI Video Models

HappyHorse logo

HappyHorse

HappyHorse is the team behind HappyHorse-1.0, the #1 ranked AI video generation model on the Artificial Analysis Video Arena leaderboard. Built on a 15-billion-parameter unified Transformer architecture, HappyHorse-1.0 generates synchronized audio and video in a single forward pass — no post-dubbing required. It supports both text-to-video and image-to-video workflows, multilingual lip-sync across 7 languages, and 50+ visual styles, delivering cinematic outputs at up to 1080p.

OpenAI

OpenAI Sora

OpenAI Sora is a cinematic AI video generator built for prompt-driven short clips, image-to-video animation, and fast iteration across scene composition, pacing, and visual style.

Google

Google Veo

Google Veo is an advanced AI video generation model family from Google DeepMind, introduced in May 2024. It is built for high-definition video generation from text prompts and images, with stronger cinematic control, prompt understanding, and flexible scene customization.

Kling

Kling AI Video

Kling AI is a leading AI video generator that offers features ranging from text-to-video and image-to-video to video extension and cinematic multi-shot sequences. Renowned for its prompt to screen capabilities and realistic output, it provides both free and premium plans for global users.

Grok

Grok Imagine Video

xAI Grok Imagine is xAI's visual generation lineup for image and video creation workflows, supporting fast concept iteration, expressive style exploration, and natural-language refinement.

ByteDance

ByteDance Seedance

Seedance 2.0 is ByteDance's cinematic audio-video generation lineup. It pairs stronger prompt comprehension with director-level camera moves, realistic physics, native soundtrack generation, and multi-shot continuity so every clip feels like a finished scene.

Hailuo

MiniMax Hailuo

Hailuo AI, developed by MiniMax, is an AI video generator that turns text prompts and images into short clips with fast turnaround, realistic AI voiceovers, and music that can sync with your visuals.

Luma

Luma AI

Luma Dream Machine is Luma Labs' AI video generator built for cinematic visuals, realistic motion, prompt-assisted generation, and a flexible mix of animated, realistic, and film-like video styles.

Lightricks

Lightricks

LTX-2.3 is Lightricks' open-source video model family built for sharp detail, fast generation, native audio, and practical production workflows from idea to final edit. The model supports image-to-video, multi-keyframe conditioning, keyframe-based animation, video extension (both forward and backward), video-to-video transformations, and any combination of these features.

Runway

Runway AI

Runway AI is a creative video platform from Runway Research that combines text-to-video, image-to-video, video transformation, character performance tools, lip sync, and frame interpolation in a single AI workflow.

Vidu

Vidu AI

Vidu AI, created by Shengshu Technology, is a fast AI video model focused on quick generation, flexible templates, ready-made inspiration content, and detailed control over style, duration, and output settings.

PixVerse

PixVerse AI

PixVerse AI is a browser-based AI video generator that focuses on fast web creation, varied styles, creative effects, and high-resolution video output without requiring a desktop app.

Qwen

Wan Video

Wan AI is Alibaba's AI video generation model for text and image driven creation, built around stronger prompt understanding, broad style coverage, realistic motion, and better multi-object interactions.

AI Image Models

OpenAI

OpenAI GPT-Image

OpenAI GPT-Image is OpenAI's image generation and editing lineup, designed for strong instruction following, high-quality visual output, natural-language image edits, and reliable text rendering inside images.

Google

Google nano banana

Google Nano Banana AI, also called Gemini 2.5 Flash Image, is an advanced AI image generation and editing model from Google DeepMind. It is designed for detail-rich image creation, highly consistent iterative edits, and fast natural-language-driven refinement across both original generation and image editing workflows.

Grok

Grok Imagine Image

xAI Grok Imagine is xAI's visual generation lineup for image and video creation workflows, supporting fast concept iteration, expressive style exploration, and natural-language refinement.

Kling

Kling AI Image

Kling AI is a leading AI video generator that offers features ranging from text-to-video and image-to-video to video extension and cinematic multi-shot sequences. Renowned for its prompt to screen capabilities and realistic output, it provides both free and premium plans for global users.

ByteDance

ByteDance Seedream

ByteDance Seedream is an advanced AI image generation and editing lineup built for natural language-guided image edits, strong character consistency, ultra-high-quality text-to-image output, and precise on-image text rendering.

Vidu

Vidu AI Image

Vidu AI, created by Shengshu Technology, is a fast AI video model focused on quick generation, flexible templates, ready-made inspiration content, and detailed control over style, duration, and output settings.

Qwen

Wan Image

Wan AI is Alibaba's AI video generation model for text and image driven creation, built around stronger prompt understanding, broad style coverage, realistic motion, and better multi-object interactions.

Flux

Flux AI

Flux AI is an advanced text-to-image generation lineup from Black Forest Labs, designed for highly detailed prompt-driven image creation, versatile artistic styles, high-resolution output, and deeper generation control.

Ideogram

Ideogram

Ideogram is an AI image generation platform focused on turning text prompts into images quickly, with strong prompt fidelity and standout text-in-image rendering quality.

Qwen

Qwen Image

Qwen Image is an open image generation model built for stylistically diverse image creation, intuitive natural-language editing, sophisticated multilingual text rendering, and strong prompt adherence across creative tasks.

Z.ai

Z-Image

Z-Image is Tongyi-MAI's image model family, including generation and editing variants designed for high-quality output, strong aesthetics, and controllable prompt-driven visual creation.

AI Audio Models

All AI Models | skills.video