Kubenatives Newsletter

Kubenatives Newsletter

Share this post

Kubenatives Newsletter
Kubenatives Newsletter
Kubernetes Was Never Meant for LLMs - So Why Are We Doing It Anyway?

Kubernetes Was Never Meant for LLMs - So Why Are We Doing It Anyway?

Sharon Sahadevan's avatar
Sharon Sahadevan
Jun 25, 2025
∙ Paid
2

Share this post

Kubenatives Newsletter
Kubenatives Newsletter
Kubernetes Was Never Meant for LLMs - So Why Are We Doing It Anyway?
1
Share

In 2023, Kubernetes became the go-to platform for everything.
In 2024, we started asking it to serve Large Language Models (LLMs).
And in 2025, it’s starting to buckle under the weight.

Because let’s face it:

Kubernetes wasn’t designed to schedule 350GB models across eight GPUs with 400MB/s read speeds from a PVC.

But we’re doing it anyway.

Why?

Keep reading with a 7-day free trial

Subscribe to Kubenatives Newsletter to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 Sharon Sahadevan
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share