Why GPU Is the Foundation of AI
A GPU explainer for Kubernetes veterans new to AI. Maps token, model, training, inference, Transformer, Tensor Core, HBM, and KV cache to concepts you already know.
In-depth articles and insights on open source, AI, cloud-native, DevOps, and software engineering.
Why GPU Is the Foundation of AI
A GPU explainer for Kubernetes veterans new to AI. Maps token, model, training, inference, Transformer, Tensor Core, HBM, and KV cache to concepts you already know.
From GPU utilization to productive GPU-hours.
Agentic AI Infrastructure Reliability
A practical AI Infra review of Agentic AI reliability, covering a five-dimension framework, fault tolerance, recovery, observability, and hybrid architecture design.
From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure
From GPU hardware, Kubernetes scheduling, inference engines to token cost — understanding the 8-layer observability architecture for modern AI infrastructure.
How I built a personal AI infrastructure using ChatGPT, OpenClaw, Obsidian, GitHub, Lark, GLM-5.1, and a Mac mini M4.
Token Is More Than a Billing Unit, It's Becoming the Resource Unit of the AI Era
The Linux Foundation’s Tokenomics Foundation signals a shift: tokens are becoming a core resource in the AI era, much like CPUs in the cloud era.
AI Native Landscape Launches as a Standalone Site
AI Native Landscape has moved to landscape.jimmysong.io with 600+ curated open-source projects, AI skill search support, and a call for community contributions.
Kubernetes as the GPU Control Plane for AI
Observations on the evolution of AI infrastructure control planes, focusing on HAMi v2.9, GPU scheduling, and Kubernetes resource models.
Kubernetes's Anxiety and Rebirth in the AI Wave
At KubeCon EU 2026, I witnessed Kubernetes’ anxiety and transformation in the AI era. This article explores the challenges and future opportunities for Kubernetes in the age of AI.
KubeCon EU 2026 Day One Observations
KubeCon Europe 2026 Day One: How Kubernetes is adapting to the AI infrastructure wave and the evolution of the GPU resource layer.