comfy.rs

A high-performance Rust port of ComfyUI using the Candle ML framework

Bringing the power of ComfyUI workflows to Rust with native performance

🎯 Vision

comfy.rs is a ground-up reimplementation of ComfyUI in Rust, leveraging the Candle ML framework. Our goal is to provide a lightweight, fast, and memory-efficient alternative for running stable diffusion and other AI model workflows across all major platforms.

Why comfy.rs?

🚀 Performance: 1.5-2x faster inference than Python implementations
💾 Memory Efficient: 20-30% lower memory footprint with smart model management
📦 Lightweight Deployment: Single binary < 100MB (no Python runtime needed)
🔒 Type Safety: Rust's type system catches errors at compile time
⚡ Serverless Ready: Fast cold starts, perfect for cloud deployments
🌍 Cross-Platform: Native support for Linux, macOS, Windows, and server environments
🎮 GPU Acceleration: CUDA, Metal, ROCm support via Candle

🏗️ Architecture

Core Components

comfy.rs/
├── comfy-core/           # Core workflow engine
│   ├── graph/           # Node graph representation & execution
│   ├── node/            # Node trait and type system
│   ├── cache/           # Execution caching system
│   └── memory/          # Smart memory management
│
├── comfy-nodes/         # Standard node implementations
│   ├── loaders/         # Model, VAE, CLIP, LoRA loaders
│   ├── samplers/        # KSampler, schedulers, noise generation
│   ├── conditioning/    # CLIP text encode, conditioning ops
│   ├── latent/          # Latent space operations
│   └── image/           # Image processing, VAE encode/decode
│
├── comfy-models/        # Model implementations using Candle
│   ├── clip/            # CLIP models
│   ├── vae/             # VAE implementations
│   ├── unet/            # U-Net diffusion models
│   ├── controlnet/      # ControlNet support
│   └── samplers/        # Sampling algorithms (DPM++, Euler, etc.)
│
├── comfy-server/        # REST API & WebSocket server
│   ├── api/             # HTTP endpoints
│   ├── queue/           # Job queue system
│   ├── websocket/       # Real-time progress updates
│   └── storage/         # Output and cache management
│
├── comfy-cli/           # Command-line interface
│   ├── execute/         # Workflow execution
│   ├── validate/        # Workflow validation
│   └── convert/         # Model conversion utilities
│
└── examples/            # Example workflows and usage
    ├── workflows/       # Sample JSON workflows
    └── tutorials/       # Getting started guides

Key Design Principles

Modular Architecture: Each component is independent and reusable
Zero-Copy Where Possible: Minimize memory allocations and copies
Async-First: Built on Tokio for efficient I/O and parallelism
Type-Safe Node System: Strongly-typed inputs/outputs with runtime validation
Compatible: Support ComfyUI workflow JSON format
Extensible: Plugin system for custom nodes in Rust

🗺️ Development Roadmap

Phase 1: Foundation (Weeks 1-4) ✅ Planning

Goals: Establish core architecture and workflow engine

Project structure and workspace setup
Core traits definition (Node, Tensor, WorkflowGraph, ExecutionContext)
JSON workflow parser
Node graph validation and dependency resolution
Execution engine with topological sorting
Basic caching system (content-based hashing)
Memory management foundation
Error handling framework

Deliverables:

Workflow can be loaded from ComfyUI JSON
Dependency graph is correctly built and validated
Mock nodes can be executed in correct order

Phase 2: Essential Nodes (Weeks 5-10) 🎯 Critical Path

Goals: Implement core nodes for basic txt2img workflow

Milestone 2.1: Model Loading (Weeks 5-6)

SafeTensors loader integration
CheckpointLoaderSimple node
- SD 1.5 support
- SDXL support
- SD 3.x support
VAELoader node
CLIPLoader node
Model path configuration system
Device selection (CPU/CUDA/Metal)

Milestone 2.2: Sampling (Weeks 7-8)

Milestone 2.3: Conditioning & Image Processing (Weeks 9-10)

Deliverables:

Complete txt2img workflow works end-to-end
Can load any SD1.5/SDXL checkpoint
Generate images comparable to ComfyUI output
Basic img2img support

Test Workflow: Standard txt2img (checkpoint → CLIP encode → KSampler → VAE decode → save)

Phase 3: Server & API (Weeks 11-14) 🌐 User-Facing

Goals: Production-ready API server

Milestone 3.1: Core Server (Week 11)

HTTP server with Axum
REST API endpoints:
- POST /api/prompt - Queue workflow
- GET /api/queue - Get queue status
- GET /api/history - Get execution history
- DELETE /api/queue/:id - Cancel job
- POST /api/interrupt - Interrupt current execution
Job queue system with priorities
Concurrent execution management

Milestone 3.2: Real-time Updates (Week 12)

Milestone 3.3: Storage & Management (Week 13)

Output directory management
Temporary file cleanup
Workflow history persistence (SQLite)
Model cache management
Configuration file support (TOML/YAML)

Milestone 3.4: CLI Tool (Week 14)

comfy-rs run <workflow.json> - Execute workflow
comfy-rs validate <workflow.json> - Validate workflow
comfy-rs serve - Start API server
comfy-rs models list - List available models
comfy-rs info - System information
Interactive mode for workflow building

Deliverables:

Production-ready API server
CLI tool for local execution
Compatible with ComfyUI frontend (API-compatible)
Docker image available

Phase 4: Advanced Features (Weeks 15-20) 🔥 Power User

Milestone 4.1: LoRA & Advanced Loading (Weeks 15-16)

Milestone 4.2: ControlNet (Week 17)

Milestone 4.3: Upscaling & Image Enhancement (Week 18)

Milestone 4.4: Advanced Sampling (Week 19)

Milestone 4.5: Optimizations (Week 20)

Deliverables:

Support for 95% of common ComfyUI workflows
Performance benchmarks showing improvements
Production-ready deployment guides

Phase 5: Video & Audio (Weeks 21-24) 🎬 Future

Phase 6: Ecosystem (Weeks 25+) 🌟 Community

🔧 Technical Stack

Core Technologies

Component	Technology	Rationale
ML Framework	Candle	Rust-native, GPU support, lightweight
Async Runtime	Tokio	Industry standard, mature ecosystem
Web Server	Axum	Fast, ergonomic, built on Tokio
Serialization	Serde	JSON workflow parsing
CLI	Clap	Powerful argument parsing
Database	SQLite (via rusqlite)	Embedded, zero-config
Image Processing	image	Pure Rust, format support
Logging	tracing	Structured logging
Testing	Built-in Rust testing + criterion	Benchmarking

Hardware Support

NVIDIA GPUs: CUDA support via Candle (cuDNN optional)
AMD GPUs: ROCm support (Linux)
Apple Silicon: Metal acceleration (M1/M2/M3)
Intel GPUs: Experimental support
CPU: Optimized with BLAS (MKL/OpenBLAS/Accelerate)

🚀 Getting Started

⚠️ Note: comfy.rs is currently in the planning/early development phase. These instructions are forward-looking.

Prerequisites

Rust 1.70 or higher
CUDA Toolkit 11.8+ (for NVIDIA GPU support)
8GB+ RAM (16GB+ recommended)

Installation

# Clone the repository
git clone https://github.com/satishbabariya/comfy.rs
cd comfy.rs

# Build all components (CPU only)
cargo build --release

# Build with CUDA support
cargo build --release --features cuda

# Build with Metal support (macOS)
cargo build --release --features metal

Quick Start

# Start the API server
comfy-rs serve --host 0.0.0.0 --port 8188

# Execute a workflow from CLI
comfy-rs run examples/workflows/txt2img.json --output ./outputs

# Validate a workflow
comfy-rs validate my_workflow.json

# List available models
comfy-rs models list

Configuration

Create a config.toml in your working directory:

[paths]
models = ["./models", "~/ComfyUI/models"]
output = "./output"
temp = "./temp"

[server]
host = "127.0.0.1"
port = 8188
max_queue_size = 100

[execution]
default_device = "cuda:0"  # or "cpu", "metal"
max_vram_gb = 10
enable_model_offload = true
cache_size_gb = 4

[performance]
num_threads = 8
enable_flash_attention = true

📊 Node System Design

Node Trait

pub trait Node: Send + Sync {
    /// Unique identifier for this node type
    fn node_type(&self) -> &str;
    
    /// Input slot definitions
    fn inputs(&self) -> Vec<InputSlot>;
    
    /// Output slot definitions  
    fn outputs(&self) -> Vec<OutputSlot>;
    
    /// Execute the node
    async fn execute(
        &self,
        ctx: &ExecutionContext,
        inputs: NodeInputs,
    ) -> Result<NodeOutputs>;
    
    /// Validate inputs before execution
    fn validate(&self, inputs: &NodeInputs) -> Result<()>;
}

Type System

pub enum TensorData {
    Image(Tensor),      // [B, C, H, W]
    Latent(Tensor),     // [B, C, H/8, W/8]
    Conditioning(Tensor), // [B, T, D]
    Mask(Tensor),       // [B, 1, H, W]
}

pub enum NodeValue {
    Tensor(TensorData),
    Model(Arc<dyn ModelType>),
    Integer(i64),
    Float(f64),
    String(String),
    Boolean(bool),
    List(Vec<NodeValue>),
}

Workflow Format

comfy.rs uses ComfyUI-compatible JSON format:

{
  "1": {
    "class_type": "CheckpointLoaderSimple",
    "inputs": {
      "ckpt_name": "sd_xl_base_1.0.safetensors"
    }
  },
  "2": {
    "class_type": "CLIPTextEncode",
    "inputs": {
      "text": "a beautiful sunset over mountains",
      "clip": ["1", 0]
    }
  },
  "3": {
    "class_type": "KSampler",
    "inputs": {
      "model": ["1", 0],
      "positive": ["2", 0],
      "negative": ["2", 0],
      "latent_image": ["4", 0],
      "seed": 42,
      "steps": 20,
      "cfg": 7.5,
      "sampler_name": "dpmpp_2m",
      "scheduler": "karras"
    }
  }
}

🔬 Performance Targets

Benchmark Goals (vs Python ComfyUI)

Metric	Target	Baseline (Python)
Cold Start	< 3s	~10s
Model Load	< 5s	~8s
SDXL 1024x1024 (20 steps)	< 8s	~12s
SD1.5 512x512 (20 steps)	< 2s	~3.5s
Memory Usage	-25%	100%
Binary Size	< 100MB	~2GB (with Python)

Optimization Strategies

Memory:
- Smart model offloading (GPU ↔ CPU)
- Aggressive tensor deallocation
- Memory pooling for tensors
- Quantization support
Speed:
- Flash Attention v2
- Fused kernels
- Parallel node execution
- Efficient scheduling
Deployment:
- Static binary (no runtime dependencies)
- Cross-compilation support
- Minimal Docker images (< 500MB)

🤝 Contributing

We welcome contributions! Here's how you can help:

Development Setup

# Clone with submodules
git clone --recursive https://github.com/satishbabariya/comfy.rs

# Run tests
cargo test --all

# Run benchmarks
cargo bench

# Check formatting
cargo fmt --check

# Run linter
cargo clippy -- -D warnings

Adding a New Node

Create node implementation in comfy-nodes/src/
Implement the Node trait
Add tests in tests/nodes/
Register in comfy-nodes/src/lib.rs
Add example workflow in examples/workflows/

Priority Areas

📈 Project Status

Current Phase: Phase 1 - Planning 📋

Track our progress:

Compatibility Matrix

Feature	Status	ComfyUI Parity
Workflow Loading	🔴 Not Started	0%
Basic Nodes	🔴 Not Started	0%
SD 1.5	🔴 Not Started	0%
SDXL	🔴 Not Started	0%
LoRA	🔴 Not Started	0%
ControlNet	🔴 Not Started	0%
API Server	🔴 Not Started	0%

Legend: 🔴 Not Started | 🟡 In Progress | 🟢 Complete

🎓 Learning Resources

Understanding ComfyUI

Rust ML & Candle

Stable Diffusion

🔮 Future Possibilities

Long-term Vision

Native UI: Cross-platform desktop app with Tauri/egui
Cloud Service: Managed hosting for comfy.rs workflows
WASM Support: Run lightweight models in browsers
Distributed Execution: Multi-GPU, multi-node execution
Model Training: Not just inference, but fine-tuning support
Custom Model Support: Easy integration of new model architectures

Research Areas

Novel sampling algorithms optimized for Rust
Advanced caching strategies
Distributed inference
On-device mobile deployment

📝 License

MIT License - see LICENSE file for details

🙏 Acknowledgments

ComfyUI team for the original implementation and workflow design
HuggingFace for the Candle framework
Stability AI for Stable Diffusion models
The Rust ML community

💬 Community & Support

Discussions: GitHub Discussions
Issues: GitHub Issues
Discord: Coming soon
Twitter/X: Coming soon

Built with ❤️ in Rust

Making AI workflows faster, lighter, and more reliable

⭐ Star us on GitHub | 📖 Documentation | 🤝 Contributing

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.github/workflows		.github/workflows
benches		benches
benchmarks/baselines		benchmarks/baselines
comfy-cli		comfy-cli
comfy-core		comfy-core
comfy-models		comfy-models
comfy-nodes		comfy-nodes
comfy-server		comfy-server
config		config
docs		docs
examples		examples
reference		reference
scripts		scripts
tests		tests
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CI_CD_VALIDATION_IMPLEMENTATION_SUMMARY.md		CI_CD_VALIDATION_IMPLEMENTATION_SUMMARY.md
CI_CD_VALIDATION_PLAN.md		CI_CD_VALIDATION_PLAN.md
CLIP_LOADER_IMPLEMENTATION_SUMMARY.md		CLIP_LOADER_IMPLEMENTATION_SUMMARY.md
CLIP_TEXT_ENCODE_IMPLEMENTATION_SUMMARY.md		CLIP_TEXT_ENCODE_IMPLEMENTATION_SUMMARY.md
COMPLETE_PHASES_PLAN.md		COMPLETE_PHASES_PLAN.md
COMPLETE_PROJECT_STATUS.md		COMPLETE_PROJECT_STATUS.md
COMPREHENSIVE_PROGRESS_SUMMARY.md		COMPREHENSIVE_PROGRESS_SUMMARY.md
COMPREHENSIVE_TESTING_COMPLETION_SUMMARY.md		COMPREHENSIVE_TESTING_COMPLETION_SUMMARY.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.toml		Cargo.toml
END_TO_END_VALIDATION_RESULTS.md		END_TO_END_VALIDATION_RESULTS.md
FINAL_COMPREHENSIVE_SUMMARY.md		FINAL_COMPREHENSIVE_SUMMARY.md
FINAL_INTEGRATION_TESTING_SUMMARY.md		FINAL_INTEGRATION_TESTING_SUMMARY.md
HTTP_SERVER_IMPLEMENTATION_SUMMARY.md		HTTP_SERVER_IMPLEMENTATION_SUMMARY.md
HTTP_SERVER_VALIDATION_SUMMARY.md		HTTP_SERVER_VALIDATION_SUMMARY.md
INTEGRATION_TESTING_SUMMARY.md		INTEGRATION_TESTING_SUMMARY.md
INTEGRATION_TEST_FAILURE_ANALYSIS.md		INTEGRATION_TEST_FAILURE_ANALYSIS.md
KSAMPLER_IMPLEMENTATION_SUMMARY.md		KSAMPLER_IMPLEMENTATION_SUMMARY.md
LICENSE		LICENSE
MODEL_CONFIG_IMPLEMENTATION_SUMMARY.md		MODEL_CONFIG_IMPLEMENTATION_SUMMARY.md
PERFORMANCE_OPTIMIZATION_SUMMARY.md		PERFORMANCE_OPTIMIZATION_SUMMARY.md
PERFORMANCE_REGRESSION_DETECTION_SUMMARY.md		PERFORMANCE_REGRESSION_DETECTION_SUMMARY.md
PHASE1_COMPLETION_SUMMARY.md		PHASE1_COMPLETION_SUMMARY.md
PHASE2_COMPLETION_SUMMARY.md		PHASE2_COMPLETION_SUMMARY.md
PHASE2_INTEGRATION_TESTING_COMPLETE.md		PHASE2_INTEGRATION_TESTING_COMPLETE.md
PHASE3_DEVELOPMENT_COMPLETE.md		PHASE3_DEVELOPMENT_COMPLETE.md
PHASE3_DEVELOPMENT_INITIATED.md		PHASE3_DEVELOPMENT_INITIATED.md
PHASE4_1_COMPLETE_SUMMARY.md		PHASE4_1_COMPLETE_SUMMARY.md
PHASE_STATUS_REPORT.md		PHASE_STATUS_REPORT.md
PROPERTY_BASED_TESTING_IMPLEMENTATION_SUMMARY.md		PROPERTY_BASED_TESTING_IMPLEMENTATION_SUMMARY.md
PROPERTY_BASED_TESTING_VALIDATION_RESULTS.md		PROPERTY_BASED_TESTING_VALIDATION_RESULTS.md
PROPERTY_BASED_TEST_ANALYSIS.md		PROPERTY_BASED_TEST_ANALYSIS.md
PROPERTY_BASED_TEST_CODE_REVIEW.md		PROPERTY_BASED_TEST_CODE_REVIEW.md
PROPERTY_TESTS_VALIDATION_SUMMARY.md		PROPERTY_TESTS_VALIDATION_SUMMARY.md
README.md		README.md
REAL_WORKFLOW_VALIDATION_SUMMARY.md		REAL_WORKFLOW_VALIDATION_SUMMARY.md
REGRESSION_DETECTION_VALIDATION_GUIDE.md		REGRESSION_DETECTION_VALIDATION_GUIDE.md
SAFETENSORS_IMPLEMENTATION_SUMMARY.md		SAFETENSORS_IMPLEMENTATION_SUMMARY.md
ULTIMATE_ACHIEVEMENT_SUMMARY.md		ULTIMATE_ACHIEVEMENT_SUMMARY.md
VAE_LOADER_IMPLEMENTATION_SUMMARY.md		VAE_LOADER_IMPLEMENTATION_SUMMARY.md
VAE_NODES_IMPLEMENTATION_SUMMARY.md		VAE_NODES_IMPLEMENTATION_SUMMARY.md
VALIDATION_REPORT_TEMPLATE.md		VALIDATION_REPORT_TEMPLATE.md
WEBSOCKET_IMPLEMENTATION_SUMMARY.md		WEBSOCKET_IMPLEMENTATION_SUMMARY.md
ci_validation_test_report.md		ci_validation_test_report.md
code_coverage_analysis_report.md		code_coverage_analysis_report.md
compilation_fix_report.md		compilation_fix_report.md
core_test_execution_report.md		core_test_execution_report.md
coverage_analysis.sh		coverage_analysis.sh
focused_integration_test_report.md		focused_integration_test_report.md
integration_test_execution_report.md		integration_test_execution_report.md
integration_test_results.md		integration_test_results.md
integration_test_validation_report.md		integration_test_validation_report.md
manual_validation.sh		manual_validation.sh
performance_analysis.sh		performance_analysis.sh
performance_benchmarking_report.md		performance_benchmarking_report.md
performance_regression_report.md		performance_regression_report.md
performance_test_execution_plan.md		performance_test_execution_plan.md
phase1_status_report.md		phase1_status_report.md
phase2_status_report.md		phase2_status_report.md
phase2_validation_report.md		phase2_validation_report.md
phase3_development_plan.md		phase3_development_plan.md
phase3_testing_report.md		phase3_testing_report.md
phase4_advanced_sampling_implementation_report.md		phase4_advanced_sampling_implementation_report.md
phase4_controlnet_implementation_report.md		phase4_controlnet_implementation_report.md
phase4_lora_implementation_report.md		phase4_lora_implementation_report.md
phase4_optimizations_implementation_report.md		phase4_optimizations_implementation_report.md
phase4_upscaling_implementation_report.md		phase4_upscaling_implementation_report.md
property_tests_validation_report.md		property_tests_validation_report.md
run_tests.sh		run_tests.sh
simple_coverage_analysis.sh		simple_coverage_analysis.sh
simple_validation.sh		simple_validation.sh
test_execution_plan.md		test_execution_plan.md
test_execution_results.md		test_execution_results.md
test_ksampler_syntax		test_ksampler_syntax
test_ksampler_syntax.rs		test_ksampler_syntax.rs
testing_strategy_report.md		testing_strategy_report.md
validate_property_tests.sh		validate_property_tests.sh
validate_regression_detection		validate_regression_detection
validate_regression_detection.rs		validate_regression_detection.rs
verification_report.md		verification_report.md

Folders and files

Latest commit

History

Repository files navigation

comfy.rs

🎯 Vision

Why comfy.rs?

🏗️ Architecture

Core Components

Key Design Principles

🗺️ Development Roadmap

Phase 1: Foundation (Weeks 1-4) ✅ Planning

Phase 2: Essential Nodes (Weeks 5-10) 🎯 Critical Path

Milestone 2.1: Model Loading (Weeks 5-6)

Milestone 2.2: Sampling (Weeks 7-8)

Milestone 2.3: Conditioning & Image Processing (Weeks 9-10)

Phase 3: Server & API (Weeks 11-14) 🌐 User-Facing

Milestone 3.1: Core Server (Week 11)

Milestone 3.2: Real-time Updates (Week 12)

Milestone 3.3: Storage & Management (Week 13)

Milestone 3.4: CLI Tool (Week 14)

Phase 4: Advanced Features (Weeks 15-20) 🔥 Power User

Milestone 4.1: LoRA & Advanced Loading (Weeks 15-16)

Milestone 4.2: ControlNet (Week 17)

Milestone 4.3: Upscaling & Image Enhancement (Week 18)

Milestone 4.4: Advanced Sampling (Week 19)

Milestone 4.5: Optimizations (Week 20)

Phase 5: Video & Audio (Weeks 21-24) 🎬 Future

Phase 6: Ecosystem (Weeks 25+) 🌟 Community

🔧 Technical Stack

Core Technologies

Hardware Support

🚀 Getting Started

Prerequisites

Installation

Quick Start

Configuration

📊 Node System Design

Node Trait

Type System

Workflow Format

🔬 Performance Targets

Benchmark Goals (vs Python ComfyUI)

Optimization Strategies

🤝 Contributing

Development Setup

Adding a New Node

Priority Areas

📈 Project Status

Current Phase: Phase 1 - Planning 📋

Compatibility Matrix

🎓 Learning Resources

Understanding ComfyUI

Rust ML & Candle

Stable Diffusion

🔮 Future Possibilities

Long-term Vision

Research Areas

📝 License

🙏 Acknowledgments

💬 Community & Support

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages