Skip to content

Add interactive installer script (GAIE + BBR) under tools/installation#2527

Open
mkoushni wants to merge 5 commits intokubernetes-sigs:mainfrom
mkoushni:tools/installation-script
Open

Add interactive installer script (GAIE + BBR) under tools/installation#2527
mkoushni wants to merge 5 commits intokubernetes-sigs:mainfrom
mkoushni:tools/installation-script

Conversation

@mkoushni
Copy link
Copy Markdown

@mkoushni mkoushni commented Mar 9, 2026

Goals

Adds an interactive installer script at tools/installation/install.sh that runs the full getting-started flow for the Gateway API Inference Extension (GAIE) with Body-Based Routing (BBR).

Flow

  1. Prerequisites – Checks for kubectl, helm, kind, jq, curl
  2. Cluster – Create a kind cluster or use an existing one
  3. LoadBalancer – Optional MetalLB install (for kind)
  4. Gateway API CRDs – Standard or Experimental, per Gateway API installing guide (required before Istio). Uses v1.2.1 and server-side apply for compatibility with older K8s
  5. Model server – vLLM Simulator (no GPU)
  6. Inference Extension CRDs – From latest GAIE release
  7. Gateway controller – Istio with inference extension enabled
  8. Inference Gateway – Deploy Istio gateway
  9. InferencePool + EPP – Helm install from registry.k8s.io
  10. Verify – HttpRoute and InferencePool status
  11. InferenceObjective – Optional
  12. Test – Sample completion request
  13. BBR – Body-Based Routing (second pool, LoRA configmap, multi-pool routing)

References

Usage

sh
./tools/installation/install.sh # interactive install
./tools/installation/install.sh --dry-run # print commands only
./tools/installation/install.sh --cleanup # remove installed resources
./tools/installation/install.sh --help # help

- Install script: kind, MetalLB, Gateway API CRDs (v1.2.1), vLLM simulator,
  Inference Extension CRDs, Istio, Inference Gateway, InferencePool/EPP, and BBR
- Prerequisites: kubectl, helm, kind, jq, curl
- Follows gateway-api-inference-extension.sigs.k8s.io and
  gateway-api.sigs.k8s.io (Gateway API CRDs before Istio)
- Gateway API installed with server-side apply; v1.2.1 for older K8s compatibility
@netlify
Copy link
Copy Markdown

netlify bot commented Mar 9, 2026

Deploy Preview for gateway-api-inference-extension ready!

Name Link
🔨 Latest commit 4379f31
🔍 Latest deploy log https://app.netlify.com/projects/gateway-api-inference-extension/deploys/69c120f08432880008c4ae84
😎 Deploy Preview https://deploy-preview-2527--gateway-api-inference-extension.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@k8s-ci-robot
Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: mkoushni
Once this PR has been reviewed and has the lgtm label, please assign danehans for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@linux-foundation-easycla
Copy link
Copy Markdown

linux-foundation-easycla bot commented Mar 9, 2026

CLA Signed

The committers listed above are authorized under a signed CLA.

  • ✅ login: mkoushni / name: Marina Koushnir (4379f31)

@k8s-ci-robot
Copy link
Copy Markdown
Contributor

Welcome @mkoushni!

It looks like this is your first PR to kubernetes-sigs/gateway-api-inference-extension 🎉. Please refer to our pull request process documentation to help your PR have a smooth ride to approval.

You will be prompted by a bot to use commands during the review process. Do not be afraid to follow the prompts! It is okay to experiment. Here is the bot commands documentation.

You can also check if kubernetes-sigs/gateway-api-inference-extension has its own contribution guidelines.

You may want to refer to our testing guide if you run into trouble with your tests not passing.

If you are having difficulty getting your pull request seen, please follow the recommended escalation practices. Also, for tips and tricks in the contribution process you may want to read the Kubernetes contributor cheat sheet. We want to make sure your contribution gets all the attention it needs!

Thank you, and welcome to Kubernetes. 😃

@k8s-ci-robot k8s-ci-robot added needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. labels Mar 9, 2026
@k8s-ci-robot
Copy link
Copy Markdown
Contributor

Hi @mkoushni. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work.

Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@k8s-ci-robot k8s-ci-robot added the size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. label Mar 9, 2026
@k8s-ci-robot k8s-ci-robot added size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. labels Mar 9, 2026
@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 18, 2026
Resolve lora-syncer initContainer conflicts: keep registry.k8s.io
lora-syncer:v1.2.1 with restartPolicy and IfNotPresent from upstream.

Made-with: Cursor
@k8s-ci-robot k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 23, 2026
@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Apr 3, 2026
@k8s-ci-robot
Copy link
Copy Markdown
Contributor

PR needs rebase.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants