feat: Add LlamaCpp support for local model hosting for faster inference by MAXNORM8650 · Pull Request #5 · context-labs/uwu

MAXNORM8650 · 2025-08-13T21:41:48Z

Add LlamaCppProvider with background server management
Support for TinyLlama, Gemma-3-4B, and SmolLM3 models
Automatic model downloading from Hugging Face
Silent server operation with clean command output
Add sample configurations for different models
Update README with LlamaCpp setup instructions
Server runs in background until manually stopped

Supports models in simple config and can be extended with all smol models:

tinyllama-1.1b (fast, basic responses)
gemma-3-4b (balanced quality/speed)
smollm3-3b (small, efficient)

Usage: Set LLAMA_DIR env var and use config commands to switch models

- Add LlamaCppProvider with background server management - Support for TinyLlama, Gemma-3-4B, and SmolLM3 models - Automatic model downloading from Hugging Face - Silent server operation with clean command output - Add sample configurations for different models - Update README with LlamaCpp setup instructions - Server runs in background until manually stopped Supports models: - tinyllama-1.1b (fast, basic responses) - gemma-3-4b (balanced quality/speed) - smollm3-3b (small, efficient) Usage: Set LLAMA_DIR env var and use config commands to switch models

samheutmaker · 2025-08-17T19:41:21Z

@MAXNORM8650 Can you resolve conflicts?

…pport

MAXNORM8650 · 2025-08-17T21:27:56Z

MAXNORM8650 · 2025-08-18T11:48:30Z

Can you please look into these two conflicts, which is related to the llama cpp new features? I am not sure how to resolve both conflicts, as they are new features related to Llama cpp.

samheutmaker · 2025-08-20T18:20:12Z

Please fix the conflicts.

UniquePratham · 2025-09-29T14:41:49Z

@samheutmaker Hey can I look into into to resolve the conflicts

MAXNORM8650 added 6 commits August 18, 2025 00:47

Merge remote-tracking branch 'upstream/main' into feature/llamacpp-su…

1aeb5b0

…pport

issues resolved!

db55bb7

Resolve final merge conflicts - combine all providers

14b399f

Resolve final merge conflicts - combine all providers

894d362

Resolve final merge conflicts - revisiting to base

9c9cf8e

Resolve final merge conflicts - revisiting to base

43d994e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add LlamaCpp support for local model hosting for faster inference#5

feat: Add LlamaCpp support for local model hosting for faster inference#5
MAXNORM8650 wants to merge 7 commits intocontext-labs:mainfrom
MAXNORM8650:feature/llamacpp-support

MAXNORM8650 commented Aug 13, 2025 •

edited

Loading

Uh oh!

samheutmaker commented Aug 17, 2025

Uh oh!

MAXNORM8650 commented Aug 17, 2025

Uh oh!

MAXNORM8650 commented Aug 18, 2025

Uh oh!

samheutmaker commented Aug 20, 2025

Uh oh!

UniquePratham commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

MAXNORM8650 commented Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

samheutmaker commented Aug 17, 2025

Uh oh!

MAXNORM8650 commented Aug 17, 2025

Uh oh!

MAXNORM8650 commented Aug 18, 2025

Uh oh!

samheutmaker commented Aug 20, 2025

Uh oh!

UniquePratham commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MAXNORM8650 commented Aug 13, 2025 •

edited

Loading