Batch scoring
Mass batch via Data Factory/Functions on Blob/Data Lake; ideal for reports, segmentations, periodic enrichment.
Reference architectures and patterns to bring Azure AI into enterprise apps and processes.
Technology Cluster · Back to Azure AI Technologies · Azure ML
Mass batch via Data Factory/Functions on Blob/Data Lake; ideal for reports, segmentations, periodic enrichment.
Online endpoints (AKS/ACI) for interactive experiences: recommendations, fraud detection, assistants.
Event Grid/Hub + Stream Analytics/Functions for low‑latency pipelines and controlled back‑pressure.
Enrich LLM with enterprise sources (index + embeddings) for cite‑back, up‑to‑date answers.
Registry, environments, CI/CD, monitoring and retraining; clear roles and environments separation.
Private networks, managed identity, key vault, encryption, logging, DLP and retention policies.
Ingest → Storage → Features → Train → Registry → Deploy → Monitor → Retrain
Automate with pipelines and quality gates.
Pattern | Latency | Volume | Examples |
---|---|---|---|
Batch | High (minutes‑hours) | Very high | Reports, nightly scoring |
Real‑time | Low (ms‑s) | Medium | In‑session recommendations |
Event‑driven | Low (ms‑s) | High | IoT, logs, clickstream |
RAG | Medium | Variable | Knowledge & assistants |
Track requests, queues, timings, errors and costs; define SLOs and alerts.
Retry/backoff, idempotency, circuit breakers and rate limiting on APIs.
Model/API versioning, canary/blue‑green, compatibility and rollback plan.
Autoscaling, async queues and caching; separate compute planes to avoid contention.
Latency P50/P95, error rate, throughput, cost per call, model quality (F1/AUC), drift.
Key Vault, managed identity, secret rotation and least‑privilege policies.