A catalog of AI models in Microsoft Foundry that you can discover, compare, and deploy using Azure’s built‑in tools for evaluation, fine‑tuning, and inference
408 and 503 returns from Mistral Document AI
Ian Van Dort
15
Reputation points
We are experiencing a sustained reliability issue with the Mistral Document AI (mistral-document-ai-2512) model deployed via Azure AI Foundry.
Issue summary:
We are seeing a recurring pattern of two error classes from the Mistral Document AI endpoints:
- HTTP 503 "no healthy upstream" (returned after ~10–15 seconds)
- HTTP 408 "upstream request timeout" (returned after ~60 seconds)
These failures occur multiple times per day across multiple EU regions and are directly impacting our production document processing pipeline.
Setup context:
- We run multiple regional Mistral Document AI deployments across EU Azure regions for redundancy.
- We use a load-balancing proxy that enforces rate caps well below documented Azure quotas.
- Real traffic is well below both our self-imposed caps and any documented quota limits.
- We are seeing no HTTP 429 responses — only 503 and 408 — so we do not believe this is a rate-limit issue.
Questions / requests:
- Are Mistral Document AI deployments in EU regions being scaled appropriately for steady production load? We are seeing repeated failures in specific regions over the past several weeks.
- Is there a known issue with mistral-document-ai-2512 on Azure AI Foundry returning 503/408 under load instead of proper 429 responses?
- If a known issue exists, please add us to the impact list and notify us when a fix is deployed.
We are happy to provide traces, request logs, and load patterns upon request.
Foundry Models
Foundry Models
Sign in to answer