Olares and HAMi: Desktop AI Workstation Inflection
HAMi moves from cluster to desktop with Olares.
Olares and HAMi: Desktop AI Workstation Inflection
HAMi moves from cluster to desktop with Olares.
Why GPU Is the Foundation of AI
A GPU explainer for Kubernetes veterans new to AI. Maps token, model, training, inference, Transformer, Tensor Core, HBM, and KV cache to concepts you already know.
From GPU utilization to productive GPU-hours.
Agentic AI Infrastructure Reliability
A practical AI Infra review of Agentic AI reliability, covering a five-dimension framework, fault tolerance, recovery, observability, and hybrid architecture design.
From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure
From GPU hardware, Kubernetes scheduling, inference engines to token cost — understanding the 8-layer observability architecture for modern AI infrastructure.
Token Is More Than a Billing Unit, It's Becoming the Resource Unit of the AI Era
The Linux Foundation’s Tokenomics Foundation signals a shift: tokens are becoming a core resource in the AI era, much like CPUs in the cloud era.
Standing on Giants' Shoulders: The Traditional Infrastructure Powering Modern AI
Before ChatGPT and TensorFlow, there was Hadoop, Kafka, and Kubernetes. This post honors the traditional open source infrastructure that became the foundation of today’s AI revolution.
2025 Year in Review: How AI Is Shifting the Focus of Software Engineering
In 2025, software engineering shifts from code-centric to runtime and cost governance. AI and Agents move complexity to runtime, compute, and budget layers, reshaping engineering value.
Analyzing Ark from architecture, semantics, community activity, and engineering paradigms to reveal its impact on 2026 AI Infra trends and the ArkSphere community.