Architecting in Azure

PTU Spillover and Token Tracking in Microsoft Foundry

May 18, 2026

Why spillover matters for cost attribution, chargeback, and observability in provisioned AI deployments In this post, I’ll walk through how PTU spillover works in Microsoft Foundry, why it complicates token tracking, and what you…
Continue reading…
Token-Level Chargeback for Azure OpenAI Batch API Using APIM, Event Hub, and Durable Functions

April 3, 2026

Batch APIs are fantastic for high-volume, asynchronous workloads—but they come with a challenge every architect eventually runs into: How do you track and charge back token consumption when thousands of completions are bundled into…
Continue reading…
Tracking Azure OpenAI Token Usage by an Application’s Client ID in APIM

January 30, 2026

Because “who’s using my tokens?” is always a fun question—especially at chargeback time. Disclaimer: The views and opinions expressed here are my own and do not necessarily reflect those of my employer or any organization…
Continue reading…
Implementing Token‑Based Chargeback for Azure OpenAI PTU Deployments Using API Management

December 31, 2025

Disclaimer: The views and opinions expressed here are my own and do not necessarily reflect those of my employer or any organization with which I am affiliated. Hello, my name is Scott Ray, and…
Continue reading…

This is how it all started…

After 30 years in IT—working across engineering, architecture, operations, and now serving as a Principal Cloud Solution Architect at Microsoft—I realized it was time to create a place where I can share the ideas, patterns, and practical guidance that continue to shape my work. This blog is the beginning of that journey. I’ll be using it to publish posts that break down real-world architecture decisions, explore emerging cloud capabilities, and highlight the lessons learned from helping customers modernize and innovate at scale. While the primary focus will be on Azure and cloud architecture, I’ll also weave in the occasional personal perspective—because after three decades in this industry, the future isn’t just about the technology I work with, but the experiences I continue to grow from. This space will evolve with the challenges, opportunities, and conversations that lie ahead.

DISCLAIMER
The views and opinions expressed here are my own and do not necessarily reflect those of my employer or any organization with which I am affiliated.

Scott Ray

Principal Cloud Solution Architect

PTU Spillover and Token Tracking in Microsoft Foundry

Token-Level Chargeback for Azure OpenAI Batch API Using APIM, Event Hub, and Durable Functions

Tracking Azure OpenAI Token Usage by an Application’s Client ID in APIM

Implementing Token‑Based Chargeback for Azure OpenAI PTU Deployments Using API Management

Follow Me On Social Media

This is how it all started…