Skip to content

Commit 126da57

Browse files
committed
Add additional Ollama README.md updates
1 parent 75a09bb commit 126da57

1 file changed

Lines changed: 5 additions & 4 deletions

File tree

README.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ If you want to implement this for yourself, please follow the [setup document](.
1919
- Metal LB: An on-prem native software load balancer
2020
- Monitoring: Prometheus Operator with Grafana using storage PVC through Longhorn
2121
- n8n: Workflow automation platform with persistent storage
22+
- Ollama: Runtime for AI models on a GPU node
2223
- Dashboard UI for:
2324
- Argo CD: For controlling deployments and rollbacks
2425
- Grafana: For building dashboards against Prometheus data
@@ -73,7 +74,7 @@ Here's the hardware list of what each of the control/worker nodes is using:
7374
3. [Samsung 2TB NVMe SSD](https://www.amazon.com/dp/B0DHLCRF91)
7475
4. [256gb Micro SD Card](https://www.amazon.com/dp/B08TJZDJ4D)
7576

76-
> It's worth noting that one of my nodes is a computer running Ubuntu with a nice GPU, but that's really outside the scope of any guides I'd give for deploying this repository. The only part of this that will impact you is any apps that have node affinity for that setup, but you can easily remove that from your own deployments.
77+
> It's worth noting that one of my nodes is a computer running Ubuntu with a nice GPU, but that's really outside the scope of any guides I'd give for deploying this repository. The only part of this that will impact you is any apps that have node affinity for that setup (like the `nvidia-device-plugin-app` and `ollama-app` deployments), but you can easily remove that from your own deployments.
7778
>
7879
> The rest of the nodes are Raspberry Pi 5s as described above.
7980
@@ -83,19 +84,19 @@ Here's the hardware list of what each of the control/worker nodes is using:
8384
- [Cert Manager](https://cert-manager.io/)
8485
- [Grafana](https://grafana.com/)
8586
- [Grafana Loki](https://grafana.com/docs/loki/latest/)
86-
- [Grafana Promtail](https://grafana.com/docs/loki/latest/send-data/promtail/) (soon to be removed for Grafana Alloy)
87+
- [Grafana Promtail](https://grafana.com/docs/loki/latest/send-data/promtail/)
8788
- [Helm](https://helm.sh/docs/)
8889
- [Kubernetes](https://kubernetes.io/), specifically [K3s](https://k3s.io/)
8990
- [Longhorn](https://longhorn.io/)
9091
- [Metal LB](https://metallb.io/)
9192
- [n8n](https://n8n.io/)
9293
- [Nvidia Container Toolkit](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/index.html)
94+
- [Ollama](https://ollama.com/) (vLLM does not support RTX 5090 FE cards right now)
9395
- [OpenFaaS](https://www.openfaas.com/) (coming soon)
94-
- [Prometheus](https://prometheus.io/) (including Prometheus Operator)
96+
- [Prometheus](https://prometheus.io/) and [Prometheus Operator](https://github.com/prometheus-operator/prometheus-operator)
9597
- [Sealed Secrets](https://github.com/bitnami-labs/sealed-secrets)
9698
- [Terraform](https://developer.hashicorp.com/terraform)
9799
- [Traefik](https://traefik.io/traefik)
98-
- [vLLM](https://docs.vllm.ai/en/latest/index.html)
99100

100101
## 🙇🏻‍♂️ Acknowledgements
101102

0 commit comments

Comments
 (0)