AI & LLMs

Practical guides for running AI workloads, this section was built while learning the domain, with AI as a collaborator. From local LLMs on your machine to deploying models on Kubernetes with GPU scheduling, autoscaling, and observability.

Understand the foundations

Run locally first

The fastest way to get started with LLMs is to run them on your own machine. No API keys, no subscription, full control over the model.

Bring it to Kubernetes

Coming soon

The next step: taking these models to Kubernetes. GPU node pools, model serving, and more.