-
PTU Spillover and Token Tracking in Microsoft Foundry
Why spillover matters for cost attribution, chargeback, and observability in provisioned AI deployments In this post, I’ll walk through how PTU spillover works in Microsoft Foundry, why it complicates token tracking, and what you…
-
Token-Level Chargeback for Azure OpenAI Batch API Using APIM, Event Hub, and Durable Functions
Batch APIs are fantastic for high-volume, asynchronous workloads—but they come with a challenge every architect eventually runs into: How do you track and charge back token consumption when thousands of completions are bundled into…
-
Tracking Azure OpenAI Token Usage by an Application’s Client ID in APIM
Because “who’s using my tokens?” is always a fun question—especially at chargeback time. Disclaimer: The views and opinions expressed here are my own and do not necessarily reflect those of my employer or any organization…
-
Implementing Token‑Based Chargeback for Azure OpenAI PTU Deployments Using API Management
Disclaimer: The views and opinions expressed here are my own and do not necessarily reflect those of my employer or any organization with which I am affiliated. Hello, my name is Scott Ray, and…








