RabbitMQ Worker for KoboldAI

A message queue worker that enables parallel request processing for KoboldAI instances. This worker bridges RabbitMQ message queues with KoboldAI's inference API, allowing multiple generation requests to be queued and processed efficiently.

Background

KoboldAI is structured to process only one generation request at a time. This worker solves that limitation by acting as an intermediary - receiving requests from a RabbitMQ queue, forwarding them to KoboldAI, and returning results to another queue for downstream processing.

Features

Message Queue Integration: Connects to RabbitMQ to consume requests and publish results
Dual API Mode Support: Compatible with both OpenAI format (/v1/completions) and KoboldAI legacy format (/api/v1/generate)
Docker Support: Containerized deployment with Docker and docker-compose
Graceful Shutdown: Proper connection management on shutdown

Technology Stack

Language: Python 3.9+
Message Broker: RabbitMQ (via pika)
HTTP Client: requests
Deployment: Docker, docker-compose

Installation

Install Python dependencies:

pip install -r requirements.txt

Set up RabbitMQ (optional - using docker-compose):

docker-compose up -d

Configuration

All parameters are passed via command-line arguments:

Parameter	Description
`-u`, `--user`	RabbitMQ username
`-p`, `--password`	RabbitMQ password
`-rh`, `--rabbitmq-host`	RabbitMQ host address
`-rp`, `--rabbitmq-port`	RabbitMQ port (default: 5672)
`-pl`, `--poll-queue`	Input queue name to consume from
`-pu`, `--push-queue`	Output queue name to publish results
`-kh`, `--koboldai-host`	KoboldAI server URL
`-am`, `--api-mode`	API mode: `openai` or `legacy` (default: legacy)

Usage

Start the worker:

python rabbitmq.py \
  -u RABBITMQ_USER \
  -p RABBITMQ_PASSWORD \
  -rh rabbitmq.host \
  -rp 5672 \
  -pl "pygmalion_requests" \
  -pu "pygmalion_results" \
  -kh http://koboldai.host:5000

Send a test message:

python rabbitmq_test.py

Message Format

Request:

{
  "MessageID": "unique-id",
  "MessageBody": {
    "prompt": "Your prompt here...",
    "temperature": 0.7,
    "max_tokens": 500
  },
  "MessageMetadata": {}
}

Response:

{
  "MessageID": "unique-id",
  "MessageMetadata": {},
  "ResultStatus": "success",
  "ResultBody": {
    "text": "Generated text..."
  }
}

Docker Deployment

# Build the image
docker build -t koboldai-rabbitmq-worker .

# Run the container
docker run -d \
  -e RABBITMQ_USER=username \
  -e RABBITMQ_PASSWORD=password \
  -e RABBITMQ_HOST=rabbitmq \
  -e POLL_QUEUE=requests \
  -e PUSH_QUEUE=results \
  -e INFERENCE_SERVER_HOST=http://koboldai:5000 \
  koboldai-rabbitmq-worker

Project Structure

koboldai-rabbitmq-worker/
├── rabbitmq.py           # Main worker implementation
├── rabbitmq_test.py      # Test script for sending messages
├── requirements.txt      # Python dependencies
├── Dockerfile            # Docker image definition
├── docker-compose.yml    # RabbitMQ service for local development
└── run.sh                # Startup script with auto-restart

Documentation Links

License

AGPL - See LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RabbitMQ Worker for KoboldAI

Background

Features

Technology Stack

Installation

Configuration

Usage

Message Format

Docker Deployment

Project Structure

Documentation Links

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.planning/codebase		.planning/codebase
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
rabbitmq.py		rabbitmq.py
rabbitmq_test.py		rabbitmq_test.py
requirements.txt		requirements.txt
run.sh		run.sh

Folders and files

Latest commit

History

Repository files navigation

RabbitMQ Worker for KoboldAI

Background

Features

Technology Stack

Installation

Configuration

Usage

Message Format

Docker Deployment

Project Structure

Documentation Links

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages