Skip to content
@llm-d-incubation

llm-d incubation

Incubating components of llm-d, a Kubernetes-native high-performance distributed LLM inference framework

Popular repositories Loading

  1. llm-d-infra llm-d-infra Public

    llm-d helm charts and deployment examples

    Shell 36 34

  2. inferno-autoscaler inferno-autoscaler Public

    Go 14 10

  3. llm-d-modelservice llm-d-modelservice Public

    helm charts for deploying models with llm-d

    Smarty 14 22

  4. llm-d-ci llm-d-ci Public

    Shell 2 2

  5. ig-wva ig-wva Public

    Workload Variant Autoscaler is a service to compute the cost-optimal provisioning of heterogeneous accelerators for inference workloads with varying request latency objectives

    Jupyter Notebook 1 1

  6. llm-d-fast-model-actuation llm-d-fast-model-actuation Public

    5

Repositories

Showing 6 of 6 repositories

Top languages

Loading…

Most used topics

Loading…