AI + machine learning

How UC Berkeley students use AI as a learning partner

Deven Goratela 27 March 2026

AI has made it easier than ever for student developers to work efficiently, tackle harder problems, and...

llm-d officially a CNCF Sandbox project

Deven Goratela 26 March 2026

Beyond intelligent routing, orchestrating multi-node AI deployments requires bulletproof underlying primitives, which is why Google leads the...

Kubernetes device management with DRA Dynamic Resource Allocation llm-d officially a CNCF Sandbox project

Kubernetes device management with DRA Dynamic Resource Allocation

Deven Goratela 25 March 2026

The explosion of large language models (LLMs) has increased demand for high-performance accelerators like GPUs and TPUs....

FabCon and SQLCon 2026: Unifying databases and Fabric on a single data platform

Deven Goratela 24 March 2026

We’re bring attendees together to share real experiences and solve challenges side-by-side. Only together can we move...

Training large models on Ironwood TPUs

Deven Goratela 23 March 2026

This technical overview explores the specific methods and tools within the JAX and MaxText ecosystems designed to...

Google Cloud AI infrastructure at NVIDIA GTC 2026

Deven Goratela 22 March 2026

The era of agentic AI is fundamentally changing enterprise infrastructure needs. As organizations build systems capable of...

Multi-cluster GKE Inference Gateway helps scale AI workloads llm-d officially a CNCF Sandbox project

Multi-cluster GKE Inference Gateway helps scale AI workloads

Deven Goratela 18 March 2026

With this release, the system uses Kubernetes Custom Resources to manage your distributed inference service. InferencePool resources...

Microsoft at NVIDIA GTC: New solutions for Microsoft Foundry, Azure AI infrastructure and Physical AI

Deven Goratela 17 March 2026

Microsoft combines accelerated computing with cloud scale engineering to bring advanced AI capabilities to our customers. For...

Introducing Fireworks AI on Microsoft Foundry: Bringing high performance, low latency open model inference to Azure

Deven Goratela 15 March 2026

We’re announcing the public preview of Fireworks AI on Microsoft Foundry, bringing high‑performance open model inference into...

Reduce 429 errors on Vertex AI

Deven Goratela 15 March 2026

Default options: The default option with Gemini on Vertex AI is Standard Pay-as-you-go (Paygo). For Standard Pay-as-you-go...

How UC Berkeley students use AI as a learning partner

llm-d officially a CNCF Sandbox project

Kubernetes device management with DRA Dynamic Resource Allocation

FabCon and SQLCon 2026: Unifying databases and Fabric on a single data platform

Training large models on Ironwood TPUs

Google Cloud AI infrastructure at NVIDIA GTC 2026

Multi-cluster GKE Inference Gateway helps scale AI workloads

Microsoft at NVIDIA GTC: New solutions for Microsoft Foundry, Azure AI infrastructure and Physical AI

Introducing Fireworks AI on Microsoft Foundry: Bringing high performance, low latency open model inference to Azure

Reduce 429 errors on Vertex AI

You may have missed

Embed a live AI browser agent in your React app with Amazon Bedrock AgentCore

How Drasi used GitHub Copilot to find documentation bugs

The AI Readiness Paradox

Microsoft named a Leader in The Forrester Wave™