Olares and HAMi: Desktop AI Workstation Inflection
HAMi moves from cluster to desktop with Olares.
Olares and HAMi: Desktop AI Workstation Inflection
HAMi moves from cluster to desktop with Olares.
Why GPU Is the Foundation of AI
A GPU explainer for Kubernetes veterans new to AI. Maps token, model, training, inference, Transformer, Tensor Core, HBM, and KV cache to concepts you already know.
From GPU utilization to productive GPU-hours.
From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure
From GPU hardware, Kubernetes scheduling, inference engines to token cost — understanding the 8-layer observability architecture for modern AI infrastructure.
Token Is More Than a Billing Unit, It's Becoming the Resource Unit of the AI Era
The Linux Foundation’s Tokenomics Foundation signals a shift: tokens are becoming a core resource in the AI era, much like CPUs in the cloud era.
Kubernetes as the GPU Control Plane for AI
Observations on the evolution of AI infrastructure control planes, focusing on HAMi v2.9, GPU scheduling, and Kubernetes resource models.
Kubernetes's Anxiety and Rebirth in the AI Wave
At KubeCon EU 2026, I witnessed Kubernetes’ anxiety and transformation in the AI era. This article explores the challenges and future opportunities for Kubernetes in the age of AI.
KubeCon EU 2026 Day One Observations
KubeCon Europe 2026 Day One: How Kubernetes is adapting to the AI infrastructure wave and the evolution of the GPU resource layer.
When GPUs Move Toward Open Scheduling: Structural Shifts in AI Native Infrastructure
A CTO/VP view on open GPU scheduling: CDI, Kubernetes DRA, virtualization data planes, ecosystem governance, and lock-in risk.
Standing on Giants' Shoulders: The Traditional Infrastructure Powering Modern AI
Before ChatGPT and TensorFlow, there was Hadoop, Kafka, and Kubernetes. This post honors the traditional open source infrastructure that became the foundation of today’s AI revolution.
From Cloud Native to AI Native: Why Kubernetes Is the Foundation for Next-Gen AI Agents
Explores why AI Agents need Kubernetes infrastructure and how Agent orchestration, MCP services, and AI gateways enable production-ready AI architectures.
How ARK uses cloud-native architecture and declarative runtime to drive engineering adoption of multi-agent systems and shape the Agentic Runtime ecosystem.