Skip to content
@llm-d-incubation

llm-d incubation

Incubating components of @llm-d, a Kubernetes-native high-performance distributed LLM inference framework

Popular repositories Loading

  1. llm-d-infra llm-d-infra Public archive

    llm-d helm charts and deployment examples

    Go Template 58 57

  2. llm-d-modelservice llm-d-modelservice Public

    helm charts for deploying models with llm-d

    Go Template 31 63

  3. llm-d-planner llm-d-planner Public

    Python 21 10

  4. llm-d-fast-model-actuation llm-d-fast-model-actuation Public

    Kubernetes controllers for fast model actuation using vLLM sleep/wake and launcher-based model swapping

    Go 16 16

  5. llm-d-async llm-d-async Public

    Asynchronous Processor for Inference Gateway. Orchestrator of queues

    Go 10 21

  6. py-inference-scheduler py-inference-scheduler Public

    Python based inference-scheduler for Reinforcement Learning

    Python 9 9

Repositories

Showing 10 of 18 repositories

Top languages

Loading…

Most used topics

Loading…