A comprehensive, curated list of State-of-the-Art (SOTA) Image Generation models, Generative AI research, and text-to-image tools. Stay updated with the latest advancements in Stable Diffusion, FLUX.1, Midjourney, and DALL-E.
- 🔥 FLUX.1 by Black Forest Labs (Aug 2024): The new benchmark for open-weights models. Features a 12B parameter Rectified Flow Transformer for elite prompt adherence.
- 🎨 Midjourney V7 (April 2025): Revolutionizing consistency with Omni Reference and blazing speed with Draft Mode.
- ⚡ Stable Diffusion 3.5 (Oct 2024): Stability AI's flagship release featuring MMDiT architecture, perfect for local fine-tuning and LoRA development.
- 🤖 Google Imagen 4 (2025): Professional-grade photorealism and advanced typography, now native in the Gemini ecosystem.
- 💬 OpenAI GPT-4o Multimodal (2024): Native, conversational image generation and iterative editing within ChatGPT.
- 🛠️ r/StableDiffusion: The heart of local AI generation and ComfyUI workflows.
- 🔬 r/MachineLearning: Deep dives into Rectified Flow and Diffusion Transformer (DiT) papers.
- 🔗 The rise of ComfyUI: Why node-based pipelines are the future of professional AI art.
| Model Name | Year | Pretrained Weights | Codebase | Research Paper | Quality | License |
|---|---|---|---|---|---|---|
| Flux.1 [schnell] 👑 | 2024 | 🤗 HF Hub | 💻 GitHub | 📄 Report | SOTA | Apache 2.0 |
| Stable Diffusion 3.5 ⚡ | 2024 | 🤗 HF Hub | 💻 GitHub | 📄 2403.03206 | SOTA | Community |
| DALL-E 3 🧠 | 2023 | API Only | Proprietary | 📄 System Card | SOTA | Proprietary |
| Midjourney V7 🖌️ | 2025 | Web/Discord | Proprietary | -- | SOTA | Proprietary |
| Janus-Pro (7B) 🌌 | 2025 | 🤗 HF Hub | 💻 GitHub | 📄 2501.14691 | A+ | MIT |
| Recraft V3 📐 | 2024 | API/Web | Proprietary | -- | A+ | Commercial |
| UNIT (Legacy) 🏛️ | 2017 | 💻 Model | 💻 Code | 📄 1703.00848 | B | CC Non-Comm |
- ✨ Black Forest Labs: Experience the power of Flux.
- 🖼️ Civitai: The ultimate repository for Stable Diffusion checkpoints, LoRAs, and embeddings.
- 🎮 Midjourney Explore: Interactive gallery and web generation.
- 🧪 Stability AI Platform: API access for SD3 and SDXL.
- 🎨 Seedream AI Studio: Multi-model image generation using Seedream 5.0/4.5/4.0 (ByteDance), with one-click Kling 2.1 video animation. Free tier available.
- 📖 Hugging Face Daily Papers: The pulse of the AI research community.
- 🔎 ArXiv: Computer Vision: Latest pre-prints in image synthesis.
- 🧠 Arxiv-sanity-lite: A better way to browse and search AI papers.
If this consolidation helps your research or creative work, please consider supporting the maintenance of this landscape:
- ☕ Support via PayPal
- 🪙 Bitcoin:
3LZazKXG18Hxa3LLNAeKYZNtLzCxpv1LyD
Maintained by ishandutta2007. PRs are always welcome!