Azure Compare Decision Center

ContainersRegistry access impact

Best match

ACR admin user vs managed identity pull

Should image pulls use a broad registry credential or a scoped Azure identity?

ACR admin user enables broad registry username/password access; managed identity pull uses Azure identity and RBAC for secretless image access.

When to use ACR admin user

Use managed identity pull when an Azure workload can authenticate to ACR with scoped role assignment and no shared password.

When to use Managed identity pull

Use the ACR admin user only for narrow break-glass or legacy scenarios where identity-based pull is not available.

Key differences

Managed identity can be scoped, rotated by platform lifecycle, and audited through role assignments.
ACR admin user is a registry-wide credential and can become a shared secret.
Kubernetes, Container Apps, App Service, and VM image pulls each have different identity wiring steps.

Common mistake

Leaving the ACR admin user enabled because it fixed a pull failure once, then spreading the same credential across environments.

Safe next steps

Check current registry auth settings and role assignments before enabling or disabling admin user.
Verify the workload identity has AcrPull at the right scope.
Test image pull with a non-production revision, pod, or deployment before production rollout.

Related terms

Open the next path

Related runbook Command path Search all

ContainersCluster / workload impact

Best match

ACR tag vs digest

Do you need a human-friendly image label or an immutable reference to exact image content?

A tag is a mutable human-friendly reference; a digest is a content-addressed identifier for an image manifest.

When to use ACR tag

Use a tag when humans and pipelines need a readable release label such as latest, prod, or v1.2.3.

When to use digest

Use a digest when the deployment must prove the exact image bytes that are running.

Key differences

Tags can be moved to point at different image content.
Digests identify immutable image content and are better for evidence, rollback, and supply-chain checks.
Production investigations should record both the tag used and the resolved digest.

Common mistake

Deploying by a mutable tag and later being unable to prove what image actually ran during an incident.

Safe next steps

Resolve the image digest before rollout and capture it in release evidence.
Check running pods, revisions, or web app containers for the resolved image reference.
Avoid using latest for production unless the pipeline records immutable provenance elsewhere.

Related terms

Open the next path

Related runbook Command path Search all

MonitoringAlerting / notification impact

Best match

Action group vs Alert rule

Are you defining the condition that detects a problem, or the notification/action path after detection?

An alert rule decides when a signal should fire; an action group decides who or what gets notified or triggered.

When to use Alert rule

Use an alert rule to define scope, signal, condition, threshold, evaluation frequency, and severity.

When to use Action group

Use an action group to define recipients, webhooks, automation, ITSM, or other actions triggered by one or more alerts.

Key differences

Alert rules detect; action groups notify or trigger response.
One action group can be reused by many alert rules.
Bad action groups create noise even when alert logic is correct.

Common mistake

Troubleshooting alert noise only by changing thresholds, while the real problem is a shared action group or receiver configuration.

Safe next steps

List alert rules and their linked action groups.
Confirm receivers, escalation path, and suppression rules before enabling.
Test notification routing with non-production scope where possible.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataAI solution organization

Best match

AI Foundry project vs Azure AI services resource

Are you organizing an AI development workspace or controlling the deployed service endpoint that apps call?

AI Foundry projects organize development assets and collaboration; Azure AI services resources expose service endpoints, keys, identity, networking, and billing boundaries.

When to use AI Foundry project

Use an AI Foundry project when the exam scenario is about building, evaluating, or coordinating model, prompt, agent, and app assets.

When to use Azure AI services resource

Use an Azure AI services resource when the scenario is about endpoint access, keys, network rules, identity, region, quota, or billing.

Key differences

A project is a workspace-level organizing concept; a service resource is the deployable Azure control-plane boundary.
Resources carry keys, endpoints, managed identity, private endpoints, and diagnostic settings.
Projects help learners reason about app lifecycle, but production security still lives at the resource, identity, and network layers.

Common mistake

Assuming a project setting secures all deployed endpoints. Exam scenarios often separate development workspace choices from resource-level access and network controls.

Safe next steps

Identify whether the question is asking about development organization or runtime endpoint control.
Check service resource keys, identity, network rules, and deployments before release.
Open the AI services runbook before changing keys, networking, or model deployments.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataSearch ingestion path

Best match

AI Search indexer vs manual ingestion

Should the search service crawl/pull from a data source or should your application push indexed documents?

Indexers pull content from supported data sources; manual ingestion pushes prepared documents into an index from app or pipeline code.

When to use AI Search indexer

Use an indexer when the source is supported and scheduled pull/indexing behavior fits the data refresh pattern.

When to use Manual ingestion

Use manual ingestion when the content requires custom transforms, event-driven updates, non-supported sources, or strict app-controlled indexing.

Key differences

Indexers depend on data-source credentials, skillsets, schedules, and indexer status.
Manual ingestion shifts transformation, retries, and error handling into app/pipeline code.
Exam questions often reveal the answer through data-source support, refresh cadence, or required transformations.

Common mistake

Debugging query relevance while ignoring indexer failures or stale source-to-index refresh status.

Safe next steps

Check indexer status and error messages before changing index schema.
Confirm whether the source is supported by an indexer.
Use manual ingestion when transformations or event timing require app ownership.

Related terms

Open the next path

Related runbook Command path Search all

ContainersCluster / workload impact

Best match

AKS system vs user node pool

Is the node pool meant to keep AKS system components healthy, or to run application workloads?

System node pools host critical cluster components; user node pools are meant for application workloads and specialized capacity.

When to use AKS system

Use the system node pool for critical cluster services and add-ons required for the AKS control experience.

When to use user node pool

Use user node pools for application workloads, scaling profiles, GPU/spot nodes, isolation, or workload-specific sizing.

Key differences

System pools protect cluster operations; user pools carry app workload capacity.
Scaling or tainting the wrong pool can break DNS, ingress, monitoring, or critical add-ons.
User pools let teams separate cost, autoscale behavior, OS image, SKU, and workload placement.

Common mistake

Running most application pods on the system pool, then making node pool changes that affect cluster reliability.

Safe next steps

List node pools, modes, taints, labels, and current pod placement before changes.
Check system add-on health and pending pods before scaling or upgrading pools.
Plan workload movement and surge capacity before changing production node pools.

Related terms

Open the next path

Related runbook Command path Search all

ContainersCluster identity impact

Best match

AKS Workload Identity vs Managed Identity

Does a Kubernetes workload need its own Azure identity, or is the identity attached at the Azure resource level enough?

AKS Workload Identity federates Kubernetes service accounts to Entra identities; managed identity is the Azure identity used by resources or cluster components.

When to use AKS Workload Identity

Use AKS Workload Identity when individual pods or service accounts need scoped Azure access without node-level secrets.

When to use Managed Identity

Use managed identity directly when the Azure resource itself can hold the identity and pod-level federation is not needed.

Key differences

Workload Identity maps Kubernetes service accounts to Azure identity through federation.
Managed identity is the Azure identity object, but AKS workloads still need a safe way to assume it.
The blast radius depends on service account binding, federated credential setup, and assigned Azure roles.

Common mistake

Granting broad identity permissions to the cluster or node path instead of the specific workload that needs access.

Safe next steps

List service accounts, federated credentials, and role assignments before enabling access.
Use the smallest Azure scope and role that satisfies the workload.
Test token acquisition and resource access from a non-production pod before rollout.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataAI access control

Best match

API key vs managed identity for AI service access

Should the app call the AI service with a stored key or Azure identity?

API keys are shared secrets for service access; managed identity uses Azure identity and RBAC where supported for secretless access.

When to use API key

Use API keys when the service path or SDK requires key-based access and key storage/rotation is controlled.

When to use Managed identity

Use managed identity when the app and service support identity-based access and you can scope roles without storing secrets.

Key differences

Keys are simple but can leak and must be rotated.
Managed identity reduces secret handling but requires role assignment and supported service integration.
Exam scenarios often mention Key Vault, app settings, identity, or least privilege as clues.

Common mistake

Putting an AI service key directly in app settings or code because it made the first demo work.

Safe next steps

Check whether the target AI service supports identity-based access for the operation.
If keys are used, store them in Key Vault and rotate them.
List role assignments and app identity before changing access.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingReachability impact

Best match

API Management vs Application Gateway

Do you need API product management and policies, or Layer 7 ingress and WAF routing?

API Management governs APIs with products, subscriptions, operations, and policies; Application Gateway routes web traffic and can apply WAF controls.

When to use API Management

Use API Management when the job is publishing APIs, applying API policies, managing subscriptions, versioning, and developer access.

When to use Application Gateway

Use Application Gateway when the job is regional HTTP routing, TLS termination, backend pools, path rules, and WAF protection.

Key differences

API Management understands APIs, operations, products, subscriptions, policies, and developer portals.
Application Gateway understands listeners, routing rules, probes, backend pools, and WAF at the application ingress layer.
They can be used together: gateway for ingress/WAF, APIM for API governance.

Common mistake

Expecting Application Gateway to replace API lifecycle management, or using APIM as a generic WAF/load balancer.

Safe next steps

Draw the request path from client to backend before choosing.
Check TLS, private networking, WAF, authentication, API policies, and subscription-key needs.
Review existing routes, products, policies, and backend health before changing exposure.

Related terms

Open the next path

Related runbook Command path Search all

App hostingHosting / scale impact

Best match

App Service plan vs Functions hosting plan

Are you hosting a long-running web app/API, or event-driven functions with plan-specific scaling behavior?

App Service plans allocate web app compute; Functions hosting plans determine event-driven compute scale, cold-start, networking, and runtime behavior.

When to use App Service plan

Use an App Service plan when hosting web apps, APIs, and containers that need app service features, scaling, and predictable web hosting behavior.

When to use Functions hosting plan

Use a Functions hosting plan when the workload is event-driven and plan choice affects scale, cold start, networking, and execution limits.

Key differences

App Service plan capacity is shared by App Service apps.
Functions plans include Consumption, Premium, and Dedicated choices with different scaling and networking behavior.
A Function App can run on an App Service plan, but that is not the same as the Functions Consumption or Premium model.

Common mistake

Treating all Functions deployments as serverless consumption, then missing Premium plan, VNet, and cold-start requirements.

Safe next steps

Review runtime, scaling, networking, and cost requirements.
List app settings and deployment source before moving plans.
Use deployment runbooks before production slot or scale changes.

Related terms

Open the next path

Related runbook Command path Search all

App hostingDeployment impact

Best match

App Service slot swap vs direct production deployment

Can the release be warmed, tested, and swapped, or is a direct production deployment acceptable?

Slot swap stages a release in a deployment slot and swaps traffic; direct production deployment changes the live app path immediately.

When to use Slot swap

Use a deployment slot when you need warm-up, validation, sticky settings, quick rollback, or lower-risk production cutover.

When to use Direct production deployment

Use direct production deployment only for low-risk changes where rollback, warm-up, and configuration drift are already controlled elsewhere.

Key differences

Slot swap changes routing between slots; direct deployment changes the active production app in place.
Slots need attention to sticky app settings, connection strings, health checks, and warm-up behavior.
Direct deployment is simpler but leaves less room to catch startup, dependency, or configuration failures before users see them.

Common mistake

Swapping without checking slot-specific settings, then sending production traffic to an app with staging secrets, wrong endpoints, or cold startup failures.

Safe next steps

Compare app settings, connection strings, runtime stack, and health checks between slots.
Warm and test the target slot before swap; record the rollback path.
Use Activity Log and deployment logs to capture who changed what and when.

Related terms

Open the next path

Related runbook Command path Search all

App hostingDeployment impact

Best match

App Service vs Azure Functions

Is this a continuously hosted web/API app, or event-driven code that should run on triggers?

App Service is application hosting; Azure Functions is event-driven serverless execution with triggers and bindings.

When to use App Service

Use App Service for web apps, APIs, containers, or long-running app processes that need a stable hosting surface.

When to use Azure Functions

Use Azure Functions when work is trigger-driven, event-based, scheduled, or naturally split into small units of execution.

Key differences

App Service centers on sites, plans, slots, runtime stacks, and always-on application hosting.
Functions centers on triggers, bindings, scale plans, host settings, and execution duration.
App Service usually gives simpler web-app control; Functions can reduce idle cost but adds trigger, cold-start, and plan-choice concerns.

Common mistake

Choosing Functions for a normal web app because it sounds cheaper, then fighting cold starts, long execution, and trigger complexity.

Safe next steps

Identify request pattern, execution duration, state needs, and deployment workflow first.
Check runtime stack, scaling plan, VNet/private access, and observability needs before choosing.
If moving between platforms, map secrets, managed identity, custom domains, and CI/CD separately.

Related terms

Open the next path

Related runbook Command path Search all

App hostingDeployment impact

Best match

App Service vs Static Web Apps

Do you need a server-side app platform, or mostly static front-end hosting with managed build and routing?

App Service hosts dynamic web apps and APIs on App Service plans; Static Web Apps focuses on static front ends, preview environments, and integrated backend APIs.

When to use App Service

Use App Service when the app needs server-side runtime control, custom containers, background processes, slots, or richer networking options.

When to use Static Web Apps

Use Static Web Apps for static front ends, JAMstack-style sites, branch environments, and simple integrated API/back-end patterns.

Key differences

App Service hosts dynamic server workloads; Static Web Apps optimizes static assets, routes, auth, and preview environments.
App Service exposes more plan, runtime, networking, and deployment-slot controls.
Static Web Apps can simplify front-end delivery but may not fit complex server-side hosting or custom network requirements.

Common mistake

Putting a dynamic app into Static Web Apps and later discovering that runtime, private networking, or deployment needs belong in App Service.

Safe next steps

Confirm whether the workload serves static files, dynamic server code, APIs, or containers.
Check custom domain, auth, API, and environment requirements before migrating.
Review deployment tokens, GitHub workflow, and backend links before production changes.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingReachability impact

Best match

Application Gateway vs Front Door

Is the entry point regional and VNet-oriented, or global edge-facing?

Application Gateway is regional HTTP load balancing; Front Door is global edge routing with endpoints, origins, routes, caching, and WAF policies.

When to use Application Gateway

Use Application Gateway for regional Layer 7 load balancing, private backend integration, and WAF close to the VNet/application region.

When to use Front Door

Use Front Door for global edge entry, anycast routing, acceleration, global failover, and edge WAF behavior.

Key differences

Application Gateway is regional; Front Door is a global edge service.
Application Gateway often sits near private backends; Front Door fronts global public entry and origin routing.
DNS, TLS, WAF policy, health probes, and origin exposure differ significantly.

Common mistake

Choosing based only on WAF without deciding where traffic should enter: global edge or regional network boundary.

Safe next steps

Map client locations, origin regions, private/public origin requirements, and failover needs.
Inspect current listeners, origins/backends, probes, TLS, and WAF policies.
Test health probe and failover behavior before changing production routing.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingTraffic filtering impact

Best match

Application Security Group vs Network Security Group

Are you grouping application NICs for rule targeting, or defining the actual network allow/deny rules?

An Application Security Group groups VM NICs for rule targeting; an NSG is the actual traffic-filtering control applied to subnets or NICs.

When to use Application Security Group

Use an Application Security Group to group VM NICs by application role so NSG rules can target them cleanly.

When to use Network Security Group

Use a Network Security Group to define inbound and outbound security rules on subnets or NICs.

Key differences

ASGs are labels/targets; NSGs are the rule enforcement point.
NSGs attach to subnets or NICs and evaluate priority-based rules.
ASGs can make NSG rules more readable but do not filter traffic by themselves.

Common mistake

Creating ASGs and assuming traffic is filtered without confirming NSG rules reference them.

Safe next steps

List NIC ASG membership and NSG associations before changing rules.
Check effective security rules on the VM/NIC to see what is actually applied.
Review rule priority, direction, source, destination, and port before allowing exposure.

Related terms

Open the next path

Related runbook Command path Search all

Backup & resilienceResilience / availability impact

Best match

Availability zone vs Availability set

Do you need zone-level regional resiliency, or only VM fault/update-domain separation for a single workload tier?

Availability zones separate resources across datacenter zones in a region; availability sets spread VMs across fault and update domains inside a datacenter placement model.

When to use Availability zone

Use availability zones when supported services need regional-zone separation and the architecture can handle zone-aware networking, storage, and failover behavior.

When to use Availability set

Use availability sets for classic VM availability grouping where zone placement is not required or not supported by the deployment pattern.

Key differences

Zones are a regional resiliency feature; availability sets are a VM placement feature.
Zones usually affect network, storage, and cross-zone latency design; availability sets mostly affect VM fault/update-domain placement.
Newer architectures often prefer zones where service and region support are available; availability sets still appear in VM-focused scenarios.

Common mistake

Assuming an availability set protects against a full zone failure. It does not provide the same regional-zone isolation as availability zones.

Safe next steps

Check whether the region and target service support zones.
List VM, disk, public IP, load balancer, and app dependencies before choosing a resiliency model.
Open related backup and VM runbooks before changing production placement.

Related terms

Open the next path

Related runbook Command path Search all

CostGovernance / optimization impact

Best match

Azure Advisor recommendation vs Policy compliance

Are you looking at optimization guidance, or checking compliance against assigned rules?

Advisor recommendations suggest improvements; Policy compliance reports whether resources meet assigned governance rules.

When to use Advisor recommendation

Use Advisor recommendations to find cost, reliability, performance, operational, and security improvement opportunities.

When to use Policy compliance

Use Policy compliance when resources must be evaluated against required organizational rules and effects.

Key differences

Advisor is recommendation-driven; Policy is assignment/evaluation-driven.
Advisor findings are not the same as policy non-compliance.
Policy can deny, audit, append, modify, or deploy settings depending on effect.

Common mistake

Treating Advisor cleanup as mandatory compliance remediation without checking business owner, exemption, and policy requirements.

Safe next steps

Separate recommendation triage from compliance remediation.
Check resource owner, tag, and impact before acting.
Use related runbooks to collect evidence before changes.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataResponsible AI control

Best match

Azure AI Content Safety vs Prompt Shields

Are you filtering harmful content or defending the prompt/tool flow from malicious instructions?

Content Safety classifies harmful content; Prompt Shields help detect prompt-injection or jailbreak-style attacks against AI systems.

When to use Content Safety

Use Content Safety when the scenario is detecting or filtering harmful user/model content categories.

When to use Prompt Shields

Use Prompt Shields when the scenario is about prompt injection, jailbreaks, or malicious instructions in user/input documents.

Key differences

Content Safety focuses on content category risk; Prompt Shields focus on attack patterns against instructions and grounding.
Production systems may need both, plus human review and logging.
Prompt-injection risk becomes more important when agents, tools, or retrieval are involved.

Common mistake

Assuming content moderation alone protects agent tools or system prompts from malicious retrieved text.

Safe next steps

Classify the threat: harmful content, prompt injection, data leakage, or unsafe tool action.
Log and review blocked/flagged cases before loosening controls.
Test with adversarial prompts and retrieved content, not only normal happy-path prompts.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataSearch / generation impact

Best match

Azure AI Search vs Azure OpenAI

Are you trying to retrieve grounded content, generate model output, or combine both in a RAG flow?

AI Search retrieves and ranks grounded content; Azure OpenAI generates, summarizes, reasons over, or transforms content through deployed models.

When to use Azure AI Search

Use Azure AI Search when the hard problem is indexing, filtering, ranking, or retrieving approved content from your own corpus.

When to use Azure OpenAI

Use Azure OpenAI when the hard problem is generation, summarization, extraction, reasoning, or transformation through a deployed model.

Key differences

AI Search owns indexes, analyzers, vector fields, data sources, indexers, and retrieval ranking.
Azure OpenAI owns model deployments, prompt behavior, token usage, and generation quality.
Many production designs use both: Search retrieves evidence, then OpenAI produces the answer grounded on that evidence.

Common mistake

Treating Azure OpenAI as a knowledge base. Without retrieval, permissions, and grounding, model output may not reflect your approved data.

Safe next steps

List the data sources, indexes, and model deployments involved before choosing the pattern.
Test a real query with expected citations or source IDs, not only a demo prompt.
Check indexing cost, token cost, access controls, and private-network requirements before production rollout.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataVision/document extraction

Best match

Azure AI Vision OCR vs Document Intelligence

Are you reading text from images or extracting structured fields from documents?

AI Vision OCR reads text from images; Document Intelligence is built for structured document extraction and document-specific models.

When to use AI Vision OCR

Use AI Vision OCR when the scenario is image text extraction or visual analysis without document-specific field extraction.

When to use Document Intelligence

Use Document Intelligence when the scenario is forms, invoices, layouts, tables, fields, or document-processing workflows.

Key differences

Vision OCR is image-centric; Document Intelligence is document-centric.
Document Intelligence adds layout, prebuilt models, and custom extraction choices.
The right answer often depends on whether the exam scenario asks for fields/tables or raw text.

Common mistake

Using generic OCR for document workflows that require field confidence, tables, or model-specific extraction.

Safe next steps

Identify whether the input is an image, document, form, or invoice.
Check whether raw text is enough or structured fields are required.
Review endpoint, key, region, and network access before deployment.

Related terms

Open the next path

Related runbook Command path Search all

CostCost governance impact

Best match

Azure budget vs Cost alert

Are you setting planned spend guardrails, or detecting unusual cost movement after it happens?

An Azure budget tracks planned spend and threshold notifications; cost alerts or anomaly signals help detect unexpected changes in actual spend.

When to use Azure budget

Use budgets to define expected spend boundaries and threshold notifications for a scope.

When to use Cost alert / anomaly signal

Use cost alerts, anomaly review, Advisor, or cost analysis when you need to investigate unexpected cost changes and drivers.

Key differences

Budgets are proactive guardrails; cost anomaly review is investigative.
Budgets need correct scope and recipients.
Cost investigation usually needs tags, resource ownership, and usage evidence.

Common mistake

Creating a budget without validating scope, contact path, or the resources that actually drive spend.

Safe next steps

Check budget scope, thresholds, and action groups.
Review month-to-date cost by service, resource group, and tag.
Use cost runbooks before deleting or resizing resources.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingReachability impact

Best match

Azure Firewall vs Network Security Group

Do you need centralized stateful network policy, or subnet/NIC-level traffic filtering?

Azure Firewall is a managed network security service for centralized inspection and policy; an NSG filters traffic at subnet or network-interface scope.

When to use Azure Firewall

Use Azure Firewall for centralized inspection, rule collections, threat intelligence, routing control, and shared network security policy.

When to use Network Security Group

Use Network Security Groups for subnet or NIC allow/deny rules close to workloads.

Key differences

Azure Firewall is a managed firewall appliance/service in the network path.
NSGs are distributed rule sets applied to subnets or NICs.
Firewall policy often works with route tables; NSG behavior is inspected through effective security rules.

Common mistake

Adding firewall rules while traffic still bypasses the firewall because routes or subnet associations were not checked.

Safe next steps

Map routes, subnets, next hop, and NSG rules before editing firewall policy.
Check firewall logs and rule collection priority before adding broad allows.
Use effective routes and effective NSG output to prove the actual path.

Related terms

Open the next path

Related runbook Command path Search all

MonitoringSignal selection impact

Best match

Azure Monitor metrics vs logs

Can a numeric time-series signal answer the question, or do you need queryable event/detail evidence?

Metrics are numeric time-series signals optimized for fast alerting and charts; logs are queryable records that provide richer event and diagnostic context.

When to use Metrics

Use Azure Monitor metrics for fast numeric signals such as CPU, requests, latency, failures, or capacity counters.

When to use Logs

Use logs when you need detailed records, correlation, joins, text search, or custom KQL investigation.

Key differences

Metrics are lightweight, structured time series and usually better for fast alerting.
Logs are richer and queryable but depend on ingestion, retention, schema, and cost.
Good observability uses metrics for symptoms and logs for explanation.

Common mistake

Sending everything to logs and creating expensive noisy alerts for questions that a simple metric could answer.

Safe next steps

Define the question before choosing signal type: symptom, cause, audit, or evidence.
Check available metrics, diagnostic settings, tables, and retention before enabling new ingestion.
Estimate alert noise and ingestion cost before production rollout.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceAccess / policy impact

Best match

Azure Policy vs Management Locks

Do you need to govern what configurations are allowed, or prevent writes/deletes at a scope?

Policy evaluates compliance and can deny or remediate; locks directly restrict delete or update operations at a scope.

When to use Azure Policy

Use Azure Policy to audit, deny, append, or remediate resources based on compliance rules.

When to use Management Locks

Use management locks when the immediate requirement is to prevent accidental delete or write operations at a scope.

Key differences

Policy evaluates resource configuration and can enforce governance at scale.
Locks block operations regardless of why the operation was requested.
Locks can interrupt legitimate automation; policy can block deployments if definitions or exemptions are wrong.

Common mistake

Using a lock as a governance program or using policy to protect a critical resource from accidental deletion when a lock is the direct control.

Safe next steps

Check current policy assignments, exemptions, and locks before changing either control.
Confirm the target scope and whether automation relies on write/delete operations.
Record compliance or lock evidence before applying enforcement.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceAccess / policy impact

Best match

Azure RBAC vs Azure Policy

Is the problem who can act, or what configurations are allowed once someone acts?

RBAC controls who can act; Azure Policy controls what resource states are allowed or remediated.

When to use Azure RBAC

Use Azure RBAC to control which identities can perform actions at management or data scopes.

When to use Azure Policy

Use Azure Policy to control, audit, or remediate resource configuration regardless of which allowed identity deployed it.

Key differences

RBAC answers who can perform an operation.
Policy answers whether the resulting resource state is allowed or compliant.
Strong governance usually uses both: least-privilege RBAC plus policy guardrails.

Common mistake

Trying to fix broad Owner access with policy alone, or trying to enforce tagging and regions with RBAC alone.

Safe next steps

List role assignments and policy assignments at the same scope before changing access.
Identify whether the failure is authorization or policy denial.
Use exemptions and custom roles carefully, with evidence and owner approval.

Related terms

Open the next path

Related runbook Command path Search all

OperationsDiscovery vs telemetry

Best match

Azure Resource Graph vs Log Analytics KQL

Are you trying to find Azure resources and configuration or analyze collected logs and telemetry?

Resource Graph queries Azure resource inventory/control-plane state; Log Analytics queries collected telemetry, logs, and table data.

When to use Azure Resource Graph

Use Resource Graph when the question is inventory, configuration, exposure, ownership, policy, or resource relationships across subscriptions.

When to use Log Analytics KQL

Use Log Analytics KQL when the question is runtime events, logs, metrics-derived tables, errors, alerts, or historical telemetry in a workspace.

Key differences

Resource Graph is read-only resource inventory; Log Analytics depends on data collection and workspace retention.
Resource Graph is excellent before changes; Log Analytics is excellent after activity has happened.
Exam scenarios often reveal the answer by saying “resources configured as” versus “logs/events show.”

Common mistake

Expecting Resource Graph to contain app logs or expecting Log Analytics to know every resource that lacks diagnostic settings.

Safe next steps

Classify the question as inventory/configuration or telemetry/events.
Use Resource Graph before risky changes to find affected resources.
Use Log Analytics after diagnostics are enabled and data has been collected.

Related terms

Open the next path

Related runbook Command path Search all

DataData platform impact

Best match

Azure SQL Database vs SQL Managed Instance

Do you need a managed database surface, or SQL Server instance-level compatibility?

Azure SQL Database is database-scoped PaaS; SQL Managed Instance preserves more instance-level SQL Server compatibility. Also covers: Azure SQL Database vs SQL Managed Instance.

When to use Azure SQL Database

Use Azure SQL Database when the unit of management is a database or elastic pool and platform-managed simplicity is the priority.

When to use SQL Managed Instance

Use SQL Managed Instance when the workload needs broader SQL Server compatibility, instance-level features, or easier lift-and-shift behavior.

Key differences

Azure SQL Database optimizes database-level PaaS management and scaling.
Managed Instance keeps more instance-level capabilities and network isolation patterns.
Compatibility, cross-database behavior, SQL Agent needs, networking, and migration path should drive the choice.

Common mistake

Choosing Azure SQL Database for a lift-and-shift workload that quietly depends on instance-level SQL Server features.

Safe next steps

Inventory SQL Server features, cross-database dependencies, jobs, linked servers, and network requirements.
Check restore, failover, backup, and maintenance expectations before choosing.
Run a migration compatibility assessment before committing the target service.

Related terms

Open the next path

Related runbook Command path Search all

DataData platform impact

Best match

Azure SQL vs Cosmos DB

Is the workload relational with SQL-style constraints, or globally distributed NoSQL with partition-first access patterns?

Azure SQL is managed relational data; Cosmos DB is globally distributed NoSQL and multi-model data with tunable consistency and throughput.

When to use Azure SQL

Use Azure SQL when relational modeling, joins, transactions, SQL compatibility, and structured reporting are central.

When to use Cosmos DB

Use Cosmos DB when the workload needs globally distributed, low-latency NoSQL access with a well-designed partition key.

Key differences

Azure SQL favors relational schema, query consistency, and mature SQL ecosystem behavior.
Cosmos DB favors partitioned NoSQL scale, multiple APIs, global distribution, and RU-based throughput planning.
The Cosmos DB partition key and access pattern are architecture decisions, not afterthoughts.

Common mistake

Choosing Cosmos DB for scale without proving the partition key, query pattern, and RU cost model.

Safe next steps

Map entities, query patterns, transaction boundaries, and data growth before choosing.
Estimate throughput, indexing, region, and consistency requirements.
Prototype the highest-volume query path and measure latency and cost.

Related terms

Open the next path

Related runbook Command path Search all

Backup & resilienceRecovery impact

Best match

Backup vault vs Recovery Services vault

Which vault type matches the workload you need to protect and the recovery operation you must prove?

Backup vault is used by newer Azure Backup workloads; Recovery Services vault supports classic backup and site recovery scenarios.

When to use Backup vault

Use Backup vault for workloads supported by Azure Backup's newer data-protection model and policy flow.

When to use Recovery Services vault

Use Recovery Services vault where the workload, migration state, or recovery feature set still depends on that vault model.

Key differences

Vault choice is workload-driven; not every protected item can move freely between vault types.
Backup policies, recovery points, soft delete, and cross-region behavior must be checked per vault.
Operational teams need to know which vault owns the protected item before testing restore.

Common mistake

Creating a vault because the name sounds right, then discovering the workload or restore flow is supported by the other vault type.

Safe next steps

List protected items, policies, and recent backup jobs before changing vault setup.
Confirm restore requirements with the application or data owner, not only the infrastructure team.
Run a non-destructive restore-readiness review before any production recovery action.

Related terms

Open the next path

Related runbook Command path Search all

App hostingDeployment impact

Best match

Bicep vs ARM Template

Do you want cleaner infrastructure authoring, or raw ARM JSON compatibility for generated or legacy templates?

Bicep is a cleaner language that compiles to ARM template JSON; ARM templates are the JSON deployment representation.

When to use Bicep

Use Bicep when humans will author and maintain the Azure infrastructure code.

When to use ARM Template

Use ARM templates directly when you already receive generated JSON, need exact low-level representation, or maintain legacy deployments.

Key differences

Bicep is a higher-level authoring language that compiles to ARM JSON.
ARM templates are the underlying deployment format, but they are more verbose and harder to review by hand.
Both deploy through ARM, so what-if, scope, parameters, and deployment mode still matter.

Common mistake

Assuming Bicep makes a deployment safe. The same wrong scope, parameters, or complete-mode deployment can still create or delete resources.

Safe next steps

Run what-if at the correct scope before applying either format.
Review parameters, deployment mode, and target scope with the resource owner.
Check deployment history and operation errors before rerunning a failed deployment.

Related terms

Open the next path

Related runbook Command path Search all

StorageData access impact

Best match

Blob Storage vs Azure Files

Does the workload need object storage, or a managed file share protocol?

Blob Storage is object storage; Azure Files provides managed SMB/NFS file shares.

When to use Blob Storage

Use Blob Storage for object storage, HTTP/REST access, data lakes, backups, media, logs, and application objects.

When to use Azure Files

Use Azure Files when applications or users need SMB/NFS-style shared file access with directory semantics.

Key differences

Blob is object storage; Azure Files is managed file share storage.
Azure Files has share quota, protocol, identity, mount, and snapshot considerations.
Blob has containers, tiers, lifecycle, versioning, immutability, and SAS/public access considerations.

Common mistake

Using Azure Files as a general object store or Blob Storage as a drop-in replacement for a mounted file share.

Safe next steps

Confirm required access protocol, permissions, performance, backup, and restore behavior.
Check public access, SAS/key usage, private endpoints, and lifecycle rules before exposing data.
Test client access from the actual network and identity path.

Related terms

Open the next path

Related runbook Command path Search all

StorageData access impact

Best match

Blob Storage vs Data Lake Storage Gen2

Do you need general object storage, or hierarchical namespace and analytics-oriented lake behavior?

Blob Storage is object storage; Data Lake Storage Gen2 adds hierarchical namespace and analytics-friendly ACL semantics.

When to use Blob Storage

Use Blob Storage for general object workloads where containers, blobs, tiers, and lifecycle policy are the main concerns.

When to use Data Lake Storage Gen2

Use Data Lake Storage Gen2 when analytics workloads need hierarchical namespace, directory semantics, and lake-oriented access patterns.

Key differences

Data Lake Storage Gen2 is built on storage accounts with hierarchical namespace enabled.
Hierarchical namespace changes directory operations, ACL behavior, and analytics integration.
The choice affects access control, tooling, migration, and analytics performance patterns.

Common mistake

Enabling or choosing lake behavior without checking how existing apps, permissions, and tools interact with hierarchical namespace.

Safe next steps

Identify analytics engines, access methods, ACL needs, and migration expectations.
Check storage account hierarchical namespace, private access, and identity model.
Test representative read/write/list operations before migrating production data.

Related terms

Open the next path

Related runbook Command path Search all

ContainersCluster / workload impact

Best match

Container Apps vs AKS

Do you need a managed container app platform, or full Kubernetes cluster control?

Container Apps hides most orchestration and infrastructure; AKS gives Kubernetes-level control for platform teams that need it.

When to use Container Apps

Use Container Apps when you want revisions, ingress, scale rules, and managed container hosting without operating a Kubernetes cluster.

When to use AKS

Use AKS when you need Kubernetes APIs, node pools, custom networking, cluster add-ons, or deep platform control.

Key differences

Container Apps abstracts much of the cluster and gives app-level revisions and scaling.
AKS exposes the Kubernetes control surface and requires cluster, node, network, and upgrade ownership.
The right choice depends on operational ownership as much as application architecture.

Common mistake

Choosing AKS for every container workload before confirming the team is ready to operate Kubernetes safely.

Safe next steps

Identify required ingress, private networking, secrets, registry access, scale rules, and observability.
Check whether the team needs Kubernetes-native APIs or just managed container deployment.
Compare production change workflow: Container Apps revision rollout vs AKS manifests, ingress, and node upgrades.

Related terms

Open the next path

Related runbook Command path Search all

ContainersCluster / workload impact

Best match

Container Apps vs Container Instances

Is this a managed app with ingress, revisions, and scale rules, or a simple container run?

Container Apps is an application platform with revisions, ingress, Dapr, jobs, and scale rules; Container Instances is simpler container execution.

When to use Container Apps

Use Container Apps for long-running services, APIs, event-driven apps, revisions, traffic splitting, and managed scaling.

When to use Container Instances

Use Container Instances for simple container groups, short-lived jobs, quick tests, or cases where app-platform features are unnecessary.

Key differences

Container Apps provides app platform behavior such as revisions, ingress, environment, and scale rules.
Container Instances is lower-level and simpler for direct container execution.
Secrets, registry auth, networking, and logs are managed differently across the two services.

Common mistake

Building a production service on Container Instances and then recreating app-platform features manually.

Safe next steps

Confirm whether the workload needs ingress, revision rollback, autoscaling, or service discovery.
Check image pull, secrets, environment variables, and networking before choosing.
For production services, document rollout and rollback behavior before deployment.

Related terms

Open the next path

Related runbook Command path Search all

ArchitectureOperations boundary impact

Best match

Control Plane vs Data Plane

Are you managing the Azure resource itself, or accessing the workload/data inside that resource?

The control plane manages resource configuration; the data plane accesses or operates service data.

When to use Control Plane

Think control plane when the action creates, configures, scales, secures, or deletes Azure resources.

When to use Data Plane

Think data plane when the action reads, writes, processes, queries, or serves the data/application inside the resource.

Key differences

Control-plane permissions usually flow through Azure Resource Manager and RBAC at scope.
Data-plane permissions may use service-specific roles, keys, tokens, connection strings, or network rules.
A user can have control-plane access without data-plane access, or the reverse, depending on service design.

Common mistake

Granting Contributor and assuming that automatically allows reading secrets, blobs, databases, or messages.

Safe next steps

Identify whether the requested action is resource management or data access.
Check both Azure RBAC and service-specific access controls before changing permissions.
Collect the exact error message because authorization failures often reveal the missing plane.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataLanguage workload choice

Best match

Conversational Language Understanding vs custom text classification

Is the app interpreting user intent in conversation or classifying text/documents into categories?

CLU predicts conversational intents/entities; custom text classification assigns documents or text into categories.

When to use CLU

Use CLU when the app needs intents, entities, utterances, and conversational routing.

When to use Custom text classification

Use custom text classification when the app needs to label text or documents into categories without conversational intent routing.

Key differences

CLU is dialog/intent oriented; classification is category-label oriented.
CLU design depends on utterances, intents, and entities.
Classification design depends on labeled examples and category boundaries.

Common mistake

Using a conversational intent model for every NLP problem even when the requirement is document classification.

Safe next steps

Read the requirement for intents, entities, categories, and expected output.
Choose sample data that mirrors real user utterances or documents.
Check endpoint, deployment, and confidence thresholds before production.

Related terms

Open the next path

Related runbook Command path Search all

DataData modeling impact

Best match

Cosmos DB database vs container

Are you defining a namespace and throughput boundary, or the item collection with partitioning and indexing?

A Cosmos DB database is a logical namespace; a container is the unit where items, partitioning, indexing, and throughput are usually applied.

When to use Cosmos DB database

Use the Cosmos DB database as the logical grouping and, when configured, a shared throughput boundary for containers.

When to use container

Use the container as the place where items live, partition keys are enforced, indexing is configured, and most query behavior is shaped.

Key differences

Databases organize containers and can hold shared throughput in some designs.
Containers hold items and define partition key, indexing policy, TTL, and throughput if not shared.
Most performance and cost problems show up at the container and partition-key level.

Common mistake

Treating the database as the main scale boundary and ignoring the container partition key that actually governs item distribution.

Safe next steps

List databases, containers, throughput mode, and partition keys before schema changes.
Check hot partitions, indexing policy, and RU consumption at the container level.
Do not change partitioning strategy without a migration plan.

Related terms

Open the next path

Related runbook Command path Search all

StorageData access impact

Best match

Cosmos DB logical vs physical partition

Are you reasoning about your partition key value, or the backend partitions Azure uses to serve data?

Logical partitions group items by partition key value; physical partitions are internal storage and throughput partitions managed by Cosmos DB.

When to use Cosmos DB logical

Use logical partition thinking when designing item distribution, partition key values, and query patterns.

When to use physical partition

Use physical partition thinking when investigating throughput distribution, hot partitions, storage growth, and backend scaling behavior.

Key differences

Logical partitions are based on your partition key values.
Physical partitions are Azure-managed infrastructure that host logical partitions.
Hot logical partitions can create throttling even when total account throughput looks sufficient.

Common mistake

Looking only at total RU/s and missing that one logical partition key is carrying most of the traffic.

Safe next steps

Inspect partition key design, high-volume keys, RU consumption, and throttling evidence.
Review queries that do not include the partition key.
Do not change partition key strategy without a data migration plan.

Related terms

Open the next path

Related runbook Command path Search all

DataData platform impact

Best match

Cosmos DB manual vs autoscale throughput

Is demand predictable enough for fixed RU/s, or variable enough to pay for autoscale headroom?

Manual throughput fixes provisioned RU/s; autoscale adjusts RU/s within a configured range as workload demand changes. Also covers: Cosmos DB manual throughput vs autoscale; Manual vs Autoscale Throughput.

When to use Cosmos DB manual

Use manual throughput when workload demand is steady, understood, and managed against a fixed RU/s target.

When to use autoscale throughput

Use autoscale throughput when traffic varies and you need the service to scale within a configured maximum.

Key differences

Manual throughput gives predictable capacity and cost when usage is stable.
Autoscale can absorb bursts but requires careful max RU/s selection to avoid surprise spend.
Both modes still depend on partition key quality and query efficiency.

Common mistake

Switching to autoscale to hide inefficient queries or hot partitions instead of fixing the access pattern.

Safe next steps

Review RU consumption, throttling, hot partitions, and peak windows before switching modes.
Set autoscale max based on measured demand, not guesswork.
Capture before/after metrics after changing throughput mode.

Related terms

Open the next path

Related runbook Command path Search all

DataScale / cost impact

Best match

Cosmos DB serverless vs Provisioned throughput

Is the workload intermittent and unpredictable, or does it need predictable reserved throughput and scale controls?

Serverless charges for request units consumed; provisioned throughput reserves RU/s capacity at database or container scope.

When to use Cosmos DB serverless

Use serverless when usage is intermittent, smaller, or early-stage and you want consumption-based billing without managing RU/s.

When to use Provisioned throughput

Use provisioned throughput when workloads need predictable RU/s, autoscale/manual controls, or production capacity planning.

Key differences

Serverless removes RU/s planning but has different limits and cost behavior.
Provisioned throughput requires partition and RU planning.
Autoscale provisioned throughput is different from serverless billing.

Common mistake

Choosing serverless for a workload that needs predictable sustained throughput, then being surprised by limits or cost shape.

Safe next steps

Estimate workload request patterns before choosing mode.
Review partition key and hot partition risk.
Use Cosmos runbooks before scaling or changing backup/failover settings.

Related terms

Open the next path

Related runbook Command path Search all

DataData platform impact

Best match

Data Factory vs Synapse Pipelines

Do you need standalone data integration orchestration, or orchestration inside a Synapse analytics workspace?

Data Factory focuses on data integration; Synapse pipelines bring similar orchestration into an analytics workspace. Also covers: Data Factory Pipeline vs Synapse Pipeline.

When to use Data Factory

Use Data Factory when the primary job is data movement, transformation orchestration, integration runtime, and pipeline scheduling across services.

When to use Synapse Pipelines

Use Synapse Pipelines when orchestration is part of a Synapse workspace with SQL pools, Spark, and analytics development in the same environment.

Key differences

Data Factory is the standalone integration service and often fits cross-platform orchestration.
Synapse Pipelines share workspace context with Synapse SQL, Spark, linked services, and workspace security.
Integration runtime, private endpoints, triggers, and rerun behavior still need safe review in either service.

Common mistake

Choosing based on UI similarity and overlooking workspace governance, runtime networking, and who owns reruns after failures.

Safe next steps

List pipelines, triggers, linked services, and integration runtimes before migration or rerun.
Confirm whether the pipeline depends on Synapse workspace assets.
Capture failed activity output before changing triggers or rerunning production pipelines.

Related terms

Open the next path

Related runbook Command path Search all

DataData platform impact

Best match

Dedicated SQL Pool vs Serverless SQL Pool

Do you need provisioned warehouse capacity, or on-demand SQL over lake data?

Dedicated SQL pools reserve data warehouse capacity; serverless SQL pools query lake data on demand.

When to use Dedicated SQL Pool

Use a dedicated SQL pool when workloads need provisioned MPP warehouse capacity, predictable performance, and controlled loading/query patterns.

When to use Serverless SQL Pool

Use a serverless SQL pool for exploratory or on-demand querying over files in the lake without provisioning a warehouse.

Key differences

Dedicated pools have allocated capacity and require pause/resume, scale, distribution, and workload management decisions.
Serverless pools charge per query/data processed and depend heavily on file layout, format, and query efficiency.
Dedicated is a data warehouse choice; serverless is often an exploration, external table, or lightweight query choice.

Common mistake

Running broad serverless queries over poorly partitioned data and being surprised by performance or scan cost.

Safe next steps

Confirm workload pattern, concurrency, data size, and performance objective.
Review pool state, SKU, pause/resume behavior, and query history before changes.
Test representative queries against real file layout before committing.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceIdentity permission impact

Best match

Delegated vs Application Permission

Should the app act as the signed-in user, or act on its own without a user present?

Delegated permission acts with a signed-in user; application permission lets an app act as itself without a user.

When to use Delegated

Use delegated permissions when the app should operate in the context of a signed-in user and respect that user's access.

When to use Application Permission

Use application permissions for daemon, service, or automation scenarios where the app acts as itself.

Key differences

Delegated permission combines app consent with the user's identity and effective access.
Application permission can operate without a user and often has broader tenant or resource impact.
Admin consent, token claims, and audit evidence differ between the two models.

Common mistake

Granting application permission to solve a user-flow issue, creating an app-only identity with more access than intended.

Safe next steps

Confirm whether a user is present and whether the app should inherit user restrictions.
Review app registration API permissions and granted consent before changing scopes.
Collect sign-in/audit logs after consent changes.

Related terms

Open the next path

Related runbook Command path Search all

MonitoringEvidence and telemetry impact

Best match

Diagnostic setting vs Activity Log

Are you collecting resource telemetry, or investigating control-plane changes in Azure?

Activity Log records subscription-level control-plane events; diagnostic settings route selected platform logs and metrics to destinations like Log Analytics, Storage, or Event Hubs.

When to use Diagnostic setting

Use diagnostic settings to route supported resource logs and metrics to a destination such as Log Analytics, Event Hubs, or Storage.

When to use Activity Log

Use Activity Log when the question is who changed Azure resources, when, and whether the control-plane operation succeeded.

Key differences

Diagnostic settings are configured per resource or scope to export telemetry categories.
Activity Log records Azure control-plane operations at subscription scope.
Diagnostic settings can increase ingestion cost; Activity Log may not contain data-plane details.

Common mistake

Looking for application or data-plane logs in Activity Log, or enabling every diagnostic category without cost review.

Safe next steps

Identify whether the evidence needed is control-plane change, resource log, metric, or application telemetry.
List current diagnostic settings and destinations before enabling more categories.
Capture Activity Log around the incident window before changing resources.

Related terms

Open the next path

Related runbook Command path Search all

MonitoringTelemetry collection

Best match

Diagnostic settings vs data collection rule

Are you routing Azure platform logs from a resource or configuring an agent/data collection pipeline?

Diagnostic settings route platform logs and metrics from resources; data collection rules define how monitoring agents and collection pipelines gather telemetry.

When to use Diagnostic settings

Use diagnostic settings when the scenario is sending resource platform logs/metrics to Log Analytics, Event Hubs, or Storage.

When to use Data collection rule

Use a data collection rule when the scenario is Azure Monitor Agent, VM/guest telemetry, collection transforms, or DCR associations.

Key differences

Diagnostic settings are attached to Azure resources and categories.
DCRs are agent/pipeline configuration objects with associations.
Both can affect cost and data volume, so exam scenarios often include destination and collection scope clues.

Common mistake

Enabling all log categories everywhere without estimating ingestion cost or retention impact.

Safe next steps

List supported diagnostic categories first.
Check Log Analytics retention and table cost before turning on verbose logs.
Verify DCR associations when troubleshooting missing VM telemetry.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataDocument extraction choice

Best match

Document Intelligence layout model vs prebuilt model

Do you need generic document structure or fields from a known document type?

Layout extracts structure from documents; prebuilt models target known document types and common fields.

When to use Layout model

Use layout when the task is extracting text, tables, selection marks, and structure across varied documents.

When to use Prebuilt model

Use a prebuilt model when the document type matches a supported form, invoice, receipt, ID, or other known pattern.

Key differences

Layout is broader and structure-oriented; prebuilt models are targeted and field-oriented.
Prebuilt models reduce custom training when the document type matches.
Layout is often a step before deciding whether custom extraction is needed.

Common mistake

Training a custom model before testing whether layout or a prebuilt model already solves the extraction need.

Safe next steps

Identify the document type and fields required.
Try layout/prebuilt extraction with sample documents.
Use custom extraction only when the generic or prebuilt path misses required fields.

Related terms

Open the next path

Related runbook Command path Search all

DataDatabase purchasing impact

Best match

DTU vs vCore

Do you want a simplified blended sizing model, or more transparent compute, memory, and licensing control?

DTU is a bundled performance model; vCore separates compute, memory, and storage choices more explicitly.

When to use DTU

Use DTU when a simple bundled performance model is enough and the workload is not asking for detailed hardware or license control.

When to use vCore

Use vCore when you need clearer CPU/memory/storage sizing, hardware family choice, reserved capacity, or Azure Hybrid Benefit planning.

Key differences

DTU bundles compute, storage, and IO into an abstract unit.
vCore exposes compute and storage choices more directly and aligns better with SQL Server licensing conversations.
Migration between models should be tested with workload metrics, not only pricing tables.

Common mistake

Changing purchasing model to reduce cost without checking CPU, IO, storage, and query wait evidence.

Safe next steps

Review current database metrics, wait stats, storage, and service objective.
Estimate cost with production workload peaks and licensing assumptions.
Test the target tier before changing a production database.

Related terms

Open the next path

Related runbook Command path Search all

IntegrationMessaging / integration impact

Best match

Event Grid vs Event Hubs

Are you routing discrete events to handlers, or ingesting a high-volume event stream?

Event Grid routes discrete events; Event Hubs ingests high-volume streams.

When to use Event Grid

Use Event Grid when producers emit discrete events that subscribers should react to through filtering and delivery semantics.

When to use Event Hubs

Use Event Hubs when the workload ingests telemetry, logs, or ordered streams at high throughput for consumers to process.

Key differences

Event Grid is event routing with subscriptions, filters, retry, and dead-letter behavior.
Event Hubs is streaming ingestion with partitions, consumer groups, throughput, and capture options.
Event Grid pushes events to handlers; Event Hubs is read by consumers from a stream.

Common mistake

Using Event Grid as a telemetry stream or Event Hubs as a simple notification router without checking delivery and consumer semantics.

Safe next steps

Define event volume, ordering needs, subscriber count, retry behavior, and dead-letter requirements.
Inspect existing subscriptions, filters, consumer groups, and throughput before changes.
Test failure handling with a non-production event before changing production routing.

Related terms

Open the next path

Related runbook Command path Search all

IntegrationMessaging / integration impact

Best match

Event Hubs vs Service Bus

Is the workload a telemetry/event stream, or brokered business messaging?

Event Hubs is high-throughput streaming; Service Bus is enterprise brokered messaging for commands, workflows, and business transactions.

When to use Event Hubs

Use Event Hubs for high-throughput event ingestion where consumers read from partitions and handle stream processing.

When to use Service Bus

Use Service Bus when messages need broker features such as queues/topics, locks, sessions, dead-lettering, and business process reliability.

Key differences

Event Hubs optimizes stream ingestion and multiple consumer groups.
Service Bus optimizes commands, work items, pub/sub messages, and delivery guarantees.
Ordering, replay, lock duration, sessions, and dead-letter handling are key decision points.

Common mistake

Choosing Event Hubs for work queues that need per-message settlement, retries, sessions, or dead-letter operations.

Safe next steps

Clarify whether consumers process streams or individual business messages.
Check ordering, retry, dead-letter, replay, and throughput requirements.
Inspect current lag, dead-letter count, and consumer behavior before migration.

Related terms

Open the next path

Related runbook Command path Search all

IntegrationMessaging / integration impact

Best match

Event Hubs vs Service Bus vs Event Grid

Do you need stream ingestion, brokered messaging, or event routing?

Event Hubs is for high-throughput event streams, Service Bus is for brokered enterprise messaging, and Event Grid is for reactive event routing.

When to use Event Hubs

Use Event Hubs when the core need is high-volume stream ingestion and consumer-group processing.

When to use Service Bus / Event Grid

Use Service Bus for durable business messaging or Event Grid for event notification and routing to handlers.

Key differences

Event Hubs is the stream; Service Bus is the broker; Event Grid is the event router.
Consumer behavior differs: stream readers, message receivers, or event handlers.
Retry, ordering, dead-letter, filtering, and replay expectations should decide the service.

Common mistake

Picking the service with the familiar name instead of writing down delivery semantics and failure behavior.

Safe next steps

Classify the workload as telemetry stream, work queue, pub/sub message, or event notification.
Document ordering, replay, dead-letter, retry, and fan-out needs.
Run a small failure-mode test before changing integration routes.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingHybrid connectivity impact

Best match

ExpressRoute vs Site-to-site VPN

Do you need private provider-backed hybrid connectivity, or is encrypted internet VPN connectivity enough?

ExpressRoute provides private connectivity through a connectivity provider; site-to-site VPN uses encrypted tunnels over the public internet.

When to use ExpressRoute

Use ExpressRoute when requirements call for private connectivity, predictable performance, provider integration, or enterprise hybrid network architecture.

When to use Site-to-site VPN

Use site-to-site VPN when encrypted connectivity over the internet is acceptable, the environment is smaller, or VPN is needed as a backup path.

Key differences

ExpressRoute is private provider connectivity, not encrypted internet tunneling by default.
VPN is usually faster to start but may have lower predictability than private circuit connectivity.
Many enterprise designs use both: ExpressRoute primary, VPN backup.

Common mistake

Treating ExpressRoute as automatically encrypted end-to-end or ignoring VPN backup and route failover behavior.

Safe next steps

Document latency, bandwidth, encryption, and failover requirements.
Check route propagation and gateway SKU choices before deployment.
Use network discovery before changing hybrid routing.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataAgentic architecture

Best match

Foundry agent vs chat completion

Does the solution need tool-using, stateful task orchestration or a direct model response to a prompt?

Agents coordinate tools, memory, orchestration, and multi-step tasks; chat completions produce model responses for direct prompt-and-response interactions.

When to use Foundry agent

Use an agent when the scenario requires tools, actions, retrieval, multi-turn task state, or workflow-like behavior.

When to use Chat completion

Use chat completion when the scenario is a direct prompt, system message, grounding payload, and model response without managed agent orchestration.

Key differences

Agents add orchestration concepts that must be secured and monitored beyond the model call.
Chat completion remains simpler and easier to reason about for stateless prompt/response flows.
Agent scenarios usually require more attention to tool permissions, data boundaries, and human approval points.

Common mistake

Using an agent for every generative AI scenario. On exams, the simpler model call is often enough when no tool orchestration is required.

Safe next steps

List tools, data sources, and actions the agent can call.
Separate model output risk from tool-execution risk.
Check identity and approval boundaries before allowing an agent to change data or call APIs.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingEdge and regional protection impact

Best match

Front Door WAF vs Application Gateway WAF

Should web protection happen at the global edge or at a regional application gateway?

Front Door WAF protects global edge traffic; Application Gateway WAF protects regional application gateway traffic inside a regional design.

When to use Front Door WAF

Use Front Door WAF when traffic enters through global edge routing and needs edge-level policy before reaching origins.

When to use Application Gateway WAF

Use Application Gateway WAF when traffic is regional, VNet-oriented, or protected at the application gateway layer.

Key differences

Front Door WAF applies at the global edge in front of Front Door routes.
Application Gateway WAF applies regionally on Application Gateway listeners.
Policy mode, exclusions, managed rules, custom rules, and origin/back-end exposure must be checked in context.

Common mistake

Applying WAF in one layer while the public origin or alternate route bypasses that WAF entirely.

Safe next steps

Trace DNS and client path to confirm which WAF actually receives traffic.
Review WAF mode, managed rule sets, exclusions, and custom rules before switching to prevention.
Check origin exposure so users cannot bypass the protected entry point.

Related terms

Open the next path

Related runbook Command path Search all

App hostingScale / cold-start impact

Best match

Functions Consumption vs Premium

Can the function tolerate cold start and pay-per-execution scaling, or does it need warm capacity and stronger network/runtime control?

Consumption is pure serverless usage billing; Premium adds prewarmed capacity and advanced networking capabilities.

When to use Functions Consumption

Use Consumption when executions are bursty, short, event-driven, and cold start is acceptable.

When to use Premium

Use Premium when the app needs pre-warmed instances, VNet integration, higher scale control, or workloads that do not fit basic Consumption behavior.

Key differences

Consumption emphasizes automatic scale and pay-per-execution economics.
Premium adds always-ready capacity, richer networking, and stronger runtime control at a baseline cost.
Cold start, execution duration, trigger pressure, and dependency latency should drive the choice.

Common mistake

Optimizing for apparent idle cost while ignoring cold-start, VNet, and throughput requirements that force a later plan migration.

Safe next steps

Check trigger volume, execution duration, memory use, and dependency latency.
Inspect current plan, app settings, and scale behavior before changing hosting plan.
Test cold-start and peak-load behavior with representative events.

Related terms

Open the next path

Related runbook Command path Search all

App hostingImage deployment impact

Best match

Generalized vs specialized VM image

Are you creating a reusable clean VM image, or preserving one machine exactly as-is?

Generalized images remove machine identity for reusable provisioning; specialized images preserve state from the source VM.

When to use Generalized

Use a generalized image when you want reusable VM instances without carrying the source machine identity.

When to use specialized VM image

Use a specialized image when you intentionally need to preserve OS state, machine identity, users, and installed configuration.

Key differences

Generalized images are prepared to create new machines from a clean identity state.
Specialized images keep the source VM state and are closer to cloning that machine.
The wrong image type can cause identity duplication, domain issues, or unusable provisioning expectations.

Common mistake

Capturing a specialized image when the team expected a reusable base image for scaled deployments.

Safe next steps

Confirm whether the VM should be sysprepped/generalized before capture.
Record extensions, disks, network assumptions, and identity dependencies.
Test a new VM from the image before using it in production scale sets or deployments.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceAccess management impact

Best match

Group-based RBAC vs Direct user role assignment

Should access be managed through membership and lifecycle processes, or assigned directly to one person?

Group-based RBAC assigns roles to an Entra group; direct assignment grants a role to an individual user at scope.

When to use Group-based RBAC

Use group-based RBAC when access should follow a team, job role, approval workflow, or lifecycle-managed membership.

When to use Direct user assignment

Use direct user assignment only for exceptional, temporary, or highly specific cases where a group model is not appropriate.

Key differences

Group-based access is easier to review and remove at scale.
Direct assignments accumulate quickly and are harder to govern.
Privileged groups still need owner, membership, and PIM review.

Common mistake

Cleaning up direct assignments without checking inherited scope or group membership, which can leave hidden access paths behind.

Safe next steps

List role assignments at the target and parent scopes.
Separate direct assignments from group-based assignments.
Review owners, eligible assignments, and privileged roles before cleanup.

Related terms

Open the next path

Related runbook Command path Search all

CostCost impact

Best match

Hot vs Cool vs Archive Blob Tiers

How often will the data be read, and how quickly must it be restored?

Access tiers trade read/write frequency, retrieval latency, and storage cost.

When to use Hot

Use Hot for data that is actively read, updated, or needed quickly.

When to use Cool / Archive Blob Tiers

Use Cool or Archive when data is rarely accessed and lower storage cost matters more than retrieval speed or transaction cost.

Key differences

Hot usually favors frequent access with higher storage cost and lower access friction.
Cool lowers storage cost for infrequent access but can increase access and early deletion considerations.
Archive is for long-term retention and requires rehydration before normal reads.

Common mistake

Moving data to Archive to save money without checking restore-time expectations or downstream jobs that still read it.

Safe next steps

Review access patterns, lifecycle rules, retention requirements, and restore-time objectives.
Check current blob tier distribution and last modified/access evidence before lifecycle changes.
Test rehydration or restore expectations with the data owner before applying broad policies.

Related terms

Open the next path

Related runbook Command path Search all

App hostingDeployment impact

Best match

Incremental vs Complete Deployment Mode

Should the deployment update only declared resources, or enforce that the template is the full desired state at that scope?

Incremental mode updates declared resources without deleting omitted ones; complete mode can remove resources not declared at the target scope.

When to use Incremental

Use incremental mode for most deployments where resources not in the template should remain untouched.

When to use Complete Deployment Mode

Use complete mode only when the template deliberately represents the entire target scope and deletion impact is approved.

Key differences

Incremental mode creates or updates resources in the template without deleting unrelated resources.
Complete mode can remove resources at the deployment scope that are not represented in the template.
Scope is everything: resource group, subscription, and nested deployments change the blast radius.

Common mistake

Running complete mode against a shared resource group and deleting resources that were never meant to be managed by that template.

Safe next steps

Run what-if and inspect delete operations before any complete-mode deployment.
Confirm the target scope and ownership of every resource that would be removed.
Keep a rollback and recovery plan for deleted resources before applying complete mode.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataSearch indexing architecture

Best match

Integrated vectorization vs custom embedding pipeline

Should Azure AI Search own vectorization or should the application/data pipeline produce embeddings explicitly?

Integrated vectorization lets Azure AI Search handle chunking/vectorization flow; custom pipelines give teams deeper control over preprocessing, embeddings, and storage.

When to use Integrated vectorization

Use integrated vectorization when the goal is simpler AI Search indexing with managed vectorization steps and less custom pipeline code.

When to use Custom embedding pipeline

Use a custom embedding pipeline when you need exact preprocessing control, external data movement, custom scheduling, or nonstandard chunking/evaluation.

Key differences

Integrated vectorization reduces custom code but still needs service identity, indexer, skillset, and data-source permissions.
Custom pipelines can fit mature data engineering flows but create more moving parts to secure and monitor.
Both patterns need evaluation for chunk quality, relevance, cost, and stale index behavior.

Common mistake

Choosing vectorization mechanics before defining retrieval quality, permissions, refresh cadence, and cost limits.

Safe next steps

Check data source, index, vector fields, and skillset permissions.
Measure retrieval quality with expected queries and citations.
Review indexer failures and service scale before production.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceSecret / cryptography impact

Best match

Key Vault key vs Secret vs Certificate

Are you storing a cryptographic key, an application secret, or certificate material that has lifecycle and renewal needs?

Key Vault keys are cryptographic keys, secrets are sensitive values, and certificates manage certificate material and lifecycle.

When to use Key Vault key

Use keys when the workload needs cryptographic operations such as encryption, signing, wrapping, or key rotation.

When to use Secret / Certificate

Use secrets for sensitive values and certificates for certificate objects that need issuance, renewal, or binding workflows.

Key differences

Keys, secrets, and certificates have different permissions and operations.
Certificate objects often create backing keys/secrets that must be understood.
Rotation and access review differ by object type.

Common mistake

Granting broad secret access when the workload only needs a certificate binding or key operation.

Safe next steps

List vault access model and object permissions before changing access.
Separate app runtime access from human operator access.
Collect evidence before rotation or purge-protection changes.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceSecret access impact

Best match

Key Vault RBAC vs Access Policy

Should Key Vault data access be governed through Azure RBAC or vault-local access policies?

Key Vault RBAC uses Azure role assignments; access policies use the older vault-level policy model for data-plane permissions.

When to use Key Vault RBAC

Use Key Vault RBAC when you want centralized role assignment, scope inheritance, and Azure RBAC governance for vault data operations.

When to use Key Vault access policy

Use access policies when the vault or migration state still depends on the older vault-local permission model.

Key differences

RBAC uses Azure role assignments and scope inheritance.
Access policies are configured directly on the vault and can be easier to inspect locally but less aligned with central RBAC governance.
Switching models can break secret, key, or certificate access if assignments are not prepared.

Common mistake

Changing the vault access model before proving every application identity has equivalent data-plane access.

Safe next steps

List current vault access model, access policies, and role assignments before changes.
Map each application identity to required secret/key/certificate operations.
Test read-only secret access with the target identity before switching models.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingTraffic routing impact

Best match

Load Balancer vs Application Gateway

Do you need Layer 4 load distribution, or HTTP-aware Layer 7 routing?

Load Balancer operates at transport layers; Application Gateway handles HTTP routing, listeners, backend settings, and WAF integration.

When to use Load Balancer

Use Load Balancer for TCP/UDP load distribution, inbound NAT, and network-level balancing without HTTP routing logic.

When to use Application Gateway

Use Application Gateway for HTTP/S routing, TLS termination, path/host rules, cookies, probes, and WAF options.

Key differences

Load Balancer works at Layer 4 and does not understand HTTP paths or headers.
Application Gateway works at Layer 7 and can route based on HTTP properties.
Health probes, backend pools, SNAT/outbound behavior, and TLS ownership differ.

Common mistake

Using Load Balancer for an HTTP app that needs host/path routing or WAF, then trying to recreate Layer 7 behavior elsewhere.

Safe next steps

Identify protocol, TLS termination point, routing rules, and health probe requirements.
Inspect current backend health and probe configuration before changing traffic flow.
Check whether outbound/SNAT behavior is part of the design.

Related terms

Open the next path

Related runbook Command path Search all

MonitoringObservability model impact

Best match

Log Analytics workspace vs Application Insights

Are you centralizing operational logs, or instrumenting application behavior and user-facing performance?

Log Analytics workspace is the central query and retention store for logs; Application Insights focuses on application performance, failures, dependencies, availability, and user behavior telemetry.

When to use Log Analytics workspace

Use Log Analytics workspace as the query and retention home for operational logs across resources and services.

When to use Application Insights

Use Application Insights when the focus is application telemetry such as requests, dependencies, exceptions, availability, and traces.

Key differences

Log Analytics is the broader workspace and KQL query surface for operational data.
Application Insights adds application performance monitoring concepts and SDK/agent telemetry.
Many App Insights resources are workspace-based, so workspace retention and cost still matter.

Common mistake

Treating Application Insights as just another log bucket and missing sampling, dependency tracking, availability tests, and app-level signals.

Safe next steps

Map which apps and resources send data to which workspace.
Check ingestion, retention, sampling, and table cost before expanding telemetry.
Use App Insights for application symptoms and workspace queries for cross-resource evidence.

Related terms

Open the next path

Related runbook Command path Search all

IntegrationMessaging / integration impact

Best match

Logic Apps Standard vs Consumption

Do workflows need single-tenant runtime control, or simple multi-tenant pay-per-action execution?

Standard runs workflows in a single-tenant model; Consumption runs workflows in a multi-tenant serverless model.

When to use Logic Apps Standard

Use Logic Apps Standard when workflows need stronger runtime isolation, local project structure, VNet options, or multiple workflows in one app.

When to use Consumption

Use Logic Apps Consumption for simpler serverless workflows where per-action billing and multi-tenant managed runtime fit.

Key differences

Standard runs on a single-tenant app model with hosting and networking choices.
Consumption is simpler to start and bills per action/trigger execution.
Connector availability, networking, deployment model, and cost behavior should be checked before migration.

Common mistake

Migrating to Standard for control without planning hosting cost, deployment workflow, or connector differences.

Safe next steps

List triggers, actions, connectors, integration accounts, and network dependencies.
Compare execution volume and hosting cost before switching plan type.
Test workflow runs and connector authentication in a non-production environment.

Related terms

Open the next path

Related runbook Command path Search all

StorageData access impact

Best match

LRS vs ZRS vs GRS

Do you need local, zone-level, or regional redundancy for the data and recovery objective?

LRS keeps replicas within one physical location, ZRS spreads data across zones, and GRS copies data to a paired secondary region.

When to use LRS

Use LRS when local durability is enough and the workload does not require zone or regional redundancy.

When to use ZRS or GRS

Use ZRS or GRS when availability-zone resilience or cross-region disaster recovery is part of the requirement.

Key differences

LRS keeps replicas in one region location model; ZRS spreads across availability zones where supported.
GRS adds cross-region replication for regional disaster recovery scenarios.
Redundancy choice affects cost, failover behavior, read access options, and compliance expectations.

Common mistake

Choosing the cheapest redundancy tier without confirming recovery-time and recovery-point expectations with the data owner.

Safe next steps

Confirm RTO/RPO, region requirements, read-access needs, and compliance constraints.
Check current account redundancy and failover readiness before changing production storage.
Review backup, soft delete, versioning, and restore evidence separately from redundancy.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceAccess / credential impact

Best match

Managed Identity vs Service Principal

Can Azure manage the identity lifecycle for the workload, or do you need a standalone app identity with credentials?

Managed identity is lifecycle-managed by Azure for a resource; a service principal is an application identity that often requires explicit credential and permission management.

When to use Managed Identity

Use managed identity for Azure-hosted workloads that can authenticate without storing client secrets or certificates.

When to use Service Principal

Use a service principal when the identity must exist outside a single Azure resource lifecycle, cross environments, or integrate with systems that need app credentials.

Key differences

Managed identity removes credential storage and is tied to Azure resource or user-assigned identity lifecycle.
Service principals require explicit credential/certificate rotation and ownership tracking.
Both still need least-privilege role assignments at the correct scope.

Common mistake

Creating service principals with long-lived secrets for Azure workloads that could use managed identity instead.

Safe next steps

List existing identities, credentials, owners, and role assignments before changing authentication.
Prefer managed identity when the workload platform supports it and scope is clear.
If service principal remains required, document rotation owner and expiry monitoring.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceGovernance scope impact

Best match

Management group vs Subscription

Is the decision about organizing governance across subscriptions, or operating resources inside one subscription boundary?

Management groups organize and govern subscriptions; subscriptions are billing, quota, and resource-management containers.

When to use Management group

Use management groups to apply governance, policy, and RBAC structure across multiple subscriptions.

When to use Subscription

Use subscriptions as the billing, quota, access, and resource-management boundary for workloads or environments.

Key differences

Management groups are hierarchy and governance containers, not resource deployment locations.
Subscriptions contain resource groups and resources and carry billing/quota/provider registration concerns.
Inheritance from management groups can surprise teams working only at subscription level.

Common mistake

Troubleshooting a policy or role assignment only inside the subscription while the effective assignment is inherited from a management group.

Safe next steps

Map the management group hierarchy and subscription placement before changing governance.
Check inherited policy and role assignments before creating local exceptions.
Record the exact scope string for every assignment or exemption.

Related terms

Open the next path

Related runbook Command path Search all

MonitoringAlerting impact

Best match

Metric alert vs Log alert

Can a metric threshold detect the condition, or does the alert require a KQL query?

Metric alerts evaluate Azure platform metrics; log alerts run queries over Log Analytics data and can express richer conditions at higher query and ingestion complexity.

When to use Metric alert

Use a metric alert for fast threshold alerts on platform metrics with predictable dimensions.

When to use Log alert

Use a log alert when the condition requires KQL, joins, text matching, custom tables, or correlated evidence.

Key differences

Metric alerts are usually simpler, faster, and less dependent on log ingestion delays.
Log alerts are more flexible but depend on table schema, ingestion latency, query quality, and cost.
Alert noise is a design problem: severity, frequency, window, dimensions, and action groups all matter.

Common mistake

Writing a log alert for a simple metric condition, then debugging ingestion delay and noisy KQL instead of using the metric directly.

Safe next steps

Check whether the needed signal exists as a metric with useful dimensions.
If using logs, test the KQL over the incident window before creating the rule.
Review action groups, frequency, severity, and suppression strategy before enabling.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataModel lifecycle

Best match

Model catalog vs model deployment

Is the scenario about choosing a model or running an application against a configured model endpoint?

A model catalog helps users discover/select models; a deployment is the configured runtime endpoint/capacity that applications call.

When to use Model catalog

Use the model catalog when the task is discovering, comparing, or selecting foundation models and capabilities.

When to use Model deployment

Use a model deployment when the task is runtime access, quota, capacity, monitoring, endpoint names, or application integration.

Key differences

Catalog selection is design-time; deployments are runtime resources with cost, quotas, and access patterns.
Applications usually need deployment names/endpoints rather than catalog entries.
Operational questions focus on metrics, throttling, capacity, and deployment health.

Common mistake

Assuming selecting a model in a catalog means the application is ready to call it in production.

Safe next steps

Identify model, deployment name, region, quota, and endpoint.
Check tokens, throttling, metrics, and content safety before release.
Keep official model availability and pricing checks outside the static exam notes.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingOutbound connectivity impact

Best match

NAT Gateway vs Load Balancer outbound rules

Should outbound SNAT be handled at the subnet level, or through load-balancer-tied outbound rules?

NAT Gateway provides managed outbound SNAT for subnets; Load Balancer outbound rules can provide outbound connectivity tied to a load balancer frontend.

When to use NAT Gateway

Use NAT Gateway when a subnet needs predictable outbound internet SNAT with dedicated public IPs or prefixes.

When to use Load Balancer outbound rules

Use Load Balancer outbound rules when outbound behavior is intentionally tied to a Standard Load Balancer backend pool design.

Key differences

NAT Gateway attaches to subnets and becomes the preferred outbound path for resources in those subnets.
Load Balancer outbound rules depend on backend pool membership and load balancer configuration.
SNAT exhaustion, public IP ownership, routes, and subnet association drive troubleshooting.

Common mistake

Adding NAT Gateway without checking existing load balancer outbound rules, public IP dependencies, and route tables.

Safe next steps

List subnet NAT Gateway associations, load balancer outbound rules, public IPs, and route tables.
Check SNAT metrics or connection failure evidence before changing egress.
Test outbound path from a representative VM or workload after the change.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataModel serving impact

Best match

Online endpoint vs Batch endpoint

Does inference need to respond now, or can it run asynchronously over a batch of inputs?

Online endpoints serve real-time inference; batch endpoints process asynchronous inference jobs over larger inputs. Also covers: ML online endpoint vs batch endpoint; Online Endpoint vs Batch Endpoint.

When to use Online endpoint

Use an online endpoint for real-time scoring where callers expect a request/response API and latency matters.

When to use Batch endpoint

Use a batch endpoint when jobs can run asynchronously over files, datasets, or large input sets and callers can wait for output.

Key differences

Online endpoints need traffic routing, authentication, scale, and live availability planning.
Batch endpoints need job scheduling, input/output storage, retry behavior, and throughput planning.
Online cost is tied to serving capacity; batch cost is tied to job execution and data movement.

Common mistake

Using an online endpoint for large offline jobs, then fighting timeout, cost, and scaling issues that batch inference is designed to absorb.

Safe next steps

Confirm latency, throughput, authentication, and failure-retry expectations with the consuming app owner.
Inspect existing endpoint deployments and traffic before changing production inference routes.
Run a representative test payload and record latency, error rate, and cost before committing.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governancePrivilege impact

Best match

Owner vs Contributor vs Reader role

Does the identity need to manage access, change resources, or only view configuration?

Owner can manage access and resources; Contributor can manage resources but not access; Reader can view resources without making changes.

When to use Owner

Use Owner only when the identity must manage role assignments or fully administer the scope.

When to use Contributor / Reader

Use Contributor or Reader when the identity should modify resources without granting access, or only inspect state.

Key differences

Owner can manage access; Contributor can change resources but not grant access; Reader can view configuration.
Broad Owner at subscription or management-group scope is high-risk.
Least privilege often means custom roles or narrower scope, not just choosing among the three common roles.

Common mistake

Granting Owner because a deployment failed, when the real missing permission is narrower and temporary.

Safe next steps

List existing assignments, inherited scope, and recent role changes before granting anything.
Choose the narrowest scope and role that satisfies the task.
Set review or removal evidence for temporary elevated access.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceGovernance / compliance impact

Best match

Policy definition vs Policy initiative

Are you enforcing one governance rule, or grouping several rules into a compliance baseline?

A policy definition describes one rule; a policy initiative groups multiple policy definitions into a larger compliance objective.

When to use Policy definition

Use a policy definition for one specific rule, effect, and parameterized evaluation.

When to use Policy initiative

Use a policy initiative when multiple related definitions should be assigned, tracked, and reported together.

Key differences

Definitions are individual rules; initiatives are grouped rule sets.
Initiatives simplify assignment and compliance reporting for baselines.
Parameters can exist at both definition and initiative assignment layers.

Common mistake

Assigning many separate policy definitions when an initiative would make compliance state and exceptions easier to manage.

Safe next steps

Review existing initiatives before creating a new standalone rule.
Check assignment scope, parameters, exemptions, and enforcement mode.
Use policy compliance runbooks before remediating resources.

Related terms

Open the next path

Related runbook Command path Search all

Exam readinessStudy method choice

Best match

Practice assessment vs scenario lab

Are you trying to measure readiness or build the experience needed to reason through scenario questions?

A practice assessment checks exam-style recall and reasoning; a scenario lab teaches how Azure decisions behave in a real portal/CLI context.

When to use Practice assessment

Use practice assessments to find weak objective areas and get used to exam wording.

When to use Scenario lab

Use scenario labs when you need hands-on context for why a service, command, or configuration choice is correct.

Key differences

Practice questions expose gaps; labs create durable mental models.
Labs should be read-only first and low-cost where possible.
The strongest study loop is Learn objective → Compare decision → command/resource discovery → runbook/lab → practice question.

Common mistake

Only memorizing answers without building the operational context behind scenario questions.

Safe next steps

Use Microsoft Learn as the official objective source.
Map every missed question to a concept, Compare card, and lab action.
Use Resource Graph and read-only CLI commands before running any cost-impacting lab.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataDocument AI model impact

Best match

Prebuilt vs Custom Document Intelligence model

Can a prebuilt extraction model handle the document type, or do your forms require training on your own examples?

Prebuilt models cover common document types; custom models are trained on examples for organization-specific document extraction.

When to use Prebuilt

Use a prebuilt Document Intelligence model for common document types where the built-in fields match what you need.

When to use Custom Document Intelligence model

Use a custom model when layouts, field names, language, or business rules are specific to your organization.

Key differences

Prebuilt models reduce training effort but limit field control.
Custom models require labeled examples and validation but can fit your document shape.
Custom accuracy depends on representative samples, versioning, and ongoing review after document templates change.

Common mistake

Training a custom model before proving that a prebuilt model fails on real samples, or trusting a prebuilt model without testing edge cases.

Safe next steps

Run a sample set through the prebuilt model and record fields, confidence scores, and misses.
If custom is needed, separate training, validation, and holdout samples before production use.
Check data privacy, region, and cost before sending production documents through any model.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingName resolution impact

Best match

Private DNS zone vs Public DNS zone

Should this name resolve only inside Azure/private networks, or should it be publicly resolvable?

Private DNS zones resolve names inside linked virtual networks; public DNS zones publish names for public resolution on the internet.

When to use Private DNS zone

Use Private DNS zones for private endpoints, internal service names, and VNet-linked name resolution that should not be public.

When to use Public DNS zone

Use public DNS zones when internet clients must resolve the name through public DNS.

Key differences

Private DNS depends on VNet links and private records.
Public DNS is externally resolvable and must be managed with public exposure in mind.
Private endpoint failures are often DNS failures, not endpoint failures.

Common mistake

Creating a private endpoint but leaving clients resolving the old public name or missing the required private DNS zone link.

Safe next steps

Check which resolver path clients use.
List private DNS zone links and records before cutover.
Validate CNAME/A records from the network where the app actually runs.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataAI security boundary

Best match

Private endpoint vs managed identity for AI security

Is the scenario about network reachability or identity-based authorization?

Private endpoints restrict network path; managed identity controls who or what can authenticate and authorize requests.

When to use Private endpoint

Use private endpoints when the requirement is to remove public network exposure and route service access over private network paths.

When to use Managed identity

Use managed identity when the requirement is to avoid secrets and grant scoped identity-based access.

Key differences

Private endpoint answers “where can traffic come from?”
Managed identity answers “who is allowed to call or manage this?”
Strong designs often need both, plus DNS, diagnostics, and least-privilege roles.

Common mistake

Adding a private endpoint and assuming identity/role assignments are no longer needed.

Safe next steps

Separate network controls from identity controls in the scenario.
Validate private DNS resolution before disabling public access.
Review role assignments and keys before release.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingReachability impact

Best match

Private Endpoint vs Service Endpoint

Do you need a private IP connection to a specific resource, or VNet-restricted access to a service endpoint?

Private Endpoint gives a private IP for a specific resource; Service Endpoint extends VNet identity to a public service endpoint without creating a private IP.

When to use Private Endpoint

Use Private Endpoint when the service should be reached through a private IP in your VNet and public exposure should be minimized.

When to use Service Endpoint

Use Service Endpoint when the service can remain on its public endpoint but access should be restricted to selected virtual networks/subnets.

Key differences

Private Endpoint creates a network interface with a private IP for a specific resource.
Service Endpoint extends subnet identity to supported PaaS services while the service endpoint remains public-facing.
Private DNS, approval state, network policies, and public network access settings are common failure points.

Common mistake

Creating a private endpoint and forgetting private DNS, causing clients to keep resolving the public endpoint.

Safe next steps

Check public network access, private endpoint connection state, DNS zone links, and client resolution before changes.
Map which subnet and workload must reach the service.
Test name resolution and connection from the consuming network before disabling public access.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataAI workflow design

Best match

Prompt flow vs code-first orchestration

Do you need a managed AI workflow/evaluation surface or application code that owns the orchestration path?

Prompt flow helps design, evaluate, and iterate AI workflows; code-first orchestration gives app teams full control over runtime behavior, deployment, and integration logic.

When to use Prompt flow

Use prompt flow when the focus is prompt-chain design, evaluation, traceability, and fast iteration with AI workflow assets.

When to use Code-first orchestration

Use code-first orchestration when the app must own branching, retries, authorization, deployment, telemetry, and integration behavior directly.

Key differences

Prompt flow helps with experimentation and evaluation; code owns production-grade control and app-specific lifecycle.
Code-first paths may integrate better with existing CI/CD and testing standards.
Prompt flow can clarify model and prompt behavior before teams harden the production implementation.

Common mistake

Treating prompt flow as a complete production architecture without checking identity, observability, deployment, and rollback requirements.

Safe next steps

Map the flow steps to the app runtime path.
Check how secrets, identities, and telemetry are handled.
Use a deployment runbook before promoting prompt or code changes.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataGenerative AI grounding

Best match

RAG with vector search vs fine-tuning

Is the problem missing knowledge/evidence or does the model need different learned behavior?

RAG grounds model responses with retrieved content; fine-tuning changes model behavior for patterns, style, or task adaptation.

When to use RAG with vector search

Use RAG when answers must cite or use fresh, permissioned, domain content without retraining a model.

When to use Fine-tuning

Use fine-tuning when the model needs to adapt behavior, format, or task patterns that prompts and retrieval cannot solve.

Key differences

RAG keeps knowledge in data sources and indexes; fine-tuning changes the model artifact or deployment behavior.
RAG needs retrieval quality, chunking, indexing, and access control.
Fine-tuning needs training data quality, evaluation, cost, deployment governance, and rollback planning.

Common mistake

Fine-tuning a model just because the model does not know private company documents. That is usually a retrieval/grounding problem first.

Safe next steps

Identify whether the exam scenario asks for current facts, citations, or private data.
Test retrieval precision and permission filtering before changing model behavior.
Compare token, index, and training costs before production.

Related terms

Open the next path

Related runbook Command path Search all

Backup & resilienceRestore / data-loss impact

Best match

Recovery point vs Snapshot

Are you using managed backup restore history, or a direct point-in-time copy of a storage/disk object?

A recovery point is managed by backup/recovery tooling; a snapshot is a point-in-time copy of a disk, blob, or file share object.

When to use Recovery point

Use recovery points when restore needs to follow backup policy, retention, vault protection, and workload-aware recovery workflows.

When to use Snapshot

Use snapshots for storage or disk point-in-time copy scenarios where lifecycle, consistency, and retention are managed separately.

Key differences

Recovery points are tied to backup policy and vault behavior.
Snapshots may not provide the same application consistency or retention governance.
Both can create cost and data-exposure concerns if left unmanaged.

Common mistake

Assuming a snapshot is a full backup strategy without checking retention, consistency, restore process, or ownership.

Safe next steps

Check protected items, backup policy, and recent job status.
List snapshots and confirm age, owner, and delete safety.
Test restore path before relying on it for exam scenarios or production.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceScope / lifecycle impact

Best match

Resource group vs Resource

Are you managing a lifecycle boundary for related assets, or one specific Azure object?

A resource group is a management container; a resource is the individual service instance being managed.

When to use Resource group

Use a resource group as the management, deployment, tagging, RBAC, and lifecycle container for related resources.

When to use Resource

Use resource-level operations when the change should affect only one Azure resource and not the whole group.

Key differences

Resource-group actions can affect every contained resource.
Resource-level actions are narrower but may still affect dependencies.
RBAC, locks, policy, tags, and deployment history can exist at both levels.

Common mistake

Deleting or locking at resource-group level when the request was really about one resource.

Safe next steps

Inventory all resources in the group before any group-level change.
Check locks, policy, role assignments, and dependencies at both group and resource scope.
Use resource-level action when blast radius should be narrow.

Related terms

Open the next path

Related runbook Command path Search all

GovernanceGovernance control choice

Best match

Resource lock vs Azure Policy deny

Should the control protect existing resources from accidental changes or prevent noncompliant configurations from being created?

Resource locks block delete or write actions at a scope; Policy deny prevents noncompliant resource create/update requests based on rules.

When to use Resource lock

Use a resource lock when the goal is to protect a critical resource or scope from accidental delete/write actions.

When to use Policy deny

Use Policy deny when the goal is to enforce configuration rules before resources are created or updated.

Key differences

Locks are blunt protection on a resource/scope; policy deny is rule-based governance.
Locks can surprise operators during legitimate maintenance.
Policy deny needs assignment scope, parameters, exemptions, and compliance review.

Common mistake

Using locks as a substitute for governance policy or using deny policy without testing exemptions and deployment impact.

Safe next steps

Identify whether the risk is accidental change or noncompliant configuration.
Review lock inheritance and policy assignments before remediation.
Test policy in audit mode when possible before deny.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingTraffic flow / security impact

Best match

Route table vs Network Security Group

Are you troubleshooting path selection, or traffic allow/deny rules?

Route tables influence where packets go; Network Security Groups decide whether allowed traffic can pass.

When to use Route table

Use route table analysis when traffic is taking the wrong next hop, missing a firewall, or bypassing expected routing.

When to use Network Security Group

Use NSG analysis when traffic reaches the right path but is allowed or blocked by security rules.

Key differences

Routing and filtering are separate layers.
Effective routes and effective NSG rules should both be checked for VM traffic.
A firewall path can fail because of either routing or rules.

Common mistake

Changing NSG rules when the real problem is a route table or next-hop issue.

Safe next steps

Check effective routes and effective NSG rules from the NIC/subnet perspective.
Verify route association and NSG association at subnet/NIC scope.
Use Network Watcher before changing production networking.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataSearch enrichment impact

Best match

Search indexer vs Skillset

Are you moving data into a search index, enriching it during indexing, or both?

An indexer reads and loads data; a skillset enriches content during indexing before it lands in the search index.

When to use Search indexer

Use an indexer when the job is connecting to a supported data source, detecting changes, and loading documents into an index.

When to use Skillset

Use a skillset when content needs enrichment such as extraction, chunking, OCR, or AI transformation before it lands in the index.

Key differences

An indexer controls data source reads, schedules, field mappings, and indexing runs.
A skillset controls enrichment steps that transform or add fields during indexing.
A failed indexer run may be caused by data access, mapping, or skill execution, so inspect both when troubleshooting.

Common mistake

Adding a skillset to fix a data-loading problem. If the indexer cannot reach or map the source, enrichment will not solve it.

Safe next steps

List the data source, index, indexer, and skillset before editing any one of them.
Run or inspect indexer status in a non-production window if reindexing can change visible search results.
Capture failing field names, skill errors, and document counts before changing mappings.

Related terms

Open the next path

Related runbook Command path Search all

IntegrationMessaging / integration impact

Best match

Service Bus Queue vs Topic

Should each message go to one consumer path, or be published to multiple subscriptions?

A queue has point-to-point consumers; a topic publishes messages to subscriptions with filters.

When to use Service Bus Queue

Use a Service Bus queue for point-to-point work where one receiver should process each message.

When to use Topic

Use a Service Bus topic when publishers should send once and multiple subscriptions receive filtered copies.

Key differences

Queues model competing consumers for one work stream.
Topics add subscriptions and filters for pub/sub fan-out.
Dead-letter handling, sessions, duplicate detection, and lock duration still matter in both.

Common mistake

Building fan-out by adding multiple consumers to one queue, then discovering only one consumer gets each message.

Safe next steps

Confirm whether the business event has one owner or multiple independent subscribers.
Inspect subscription filters and dead-letter counts before routing changes.
Test with duplicate, failed, and delayed messages before production rollout.

Related terms

Open the next path

Related runbook Command path Search all

Identity + securityCredential strategy

Best match

Service principal secret vs federated credential

Should automation store a secret or use workload federation to avoid long-lived credentials?

A client secret is a stored credential; a federated credential lets an external identity provider exchange tokens without a long-lived secret.

When to use Service principal secret

Use a service principal secret only when federation is unavailable and rotation, storage, and access controls are strong.

When to use Federated credential

Use a federated credential when CI/CD or workload identity can trust an external identity provider without storing a client secret.

Key differences

Secrets must be stored, rotated, and protected.
Federation reduces secret sprawl but requires correct issuer, subject, and audience configuration.
Exam scenarios often reward eliminating stored secrets when a supported federation path exists.

Common mistake

Treating a secret in a pipeline variable as equivalent to identity-based federation.

Safe next steps

Inventory app credentials before rotation.
Confirm federated issuer/subject/audience values.
Use least-privilege role assignments at the narrowest useful scope.

Related terms

Open the next path

Related runbook Command path Search all

StorageData access impact

Best match

Service SAS vs User Delegation SAS

Should access be signed with storage account authority, or delegated from an Entra-authenticated user?

A service SAS is signed with storage account credentials; a user delegation SAS is secured with Microsoft Entra credentials.

When to use Service SAS

Use a service SAS when you intentionally need a SAS signed with account key authority for supported storage resources.

When to use User Delegation SAS

Use a user delegation SAS for Blob access when Entra-based delegation is available and stronger identity governance is preferred.

Key differences

Service SAS relies on storage account key material and can be broad if not scoped carefully.
User delegation SAS is tied to Entra authentication and is preferred for many Blob scenarios.
Both still require tight expiry, permissions, IP/protocol restrictions, and evidence of who issued the token.

Common mistake

Issuing a long-lived broad SAS because it is quick, then losing track of who can access the data.

Safe next steps

Confirm resource type, required permissions, expiry, IP range, and protocol before issuing SAS.
Prefer user delegation where supported and operationally practical.
Record the issuing identity, scope, expiry, and revocation/rotation path.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataSpeech workload choice

Best match

Speech to text vs Speech translation

Does the app need transcription only, or transcription plus translation?

Speech to text transcribes spoken audio; speech translation transcribes and translates speech across languages.

When to use Speech to text

Use speech to text when the output should preserve the source language as text.

When to use Speech translation

Use speech translation when the scenario requires translated text or multilingual speech workflows.

Key differences

Speech to text is the base transcription workload.
Speech translation adds language translation requirements and related quality considerations.
Both require attention to language, audio quality, region, latency, and privacy requirements.

Common mistake

Choosing translation when the scenario only asks for transcription or captions in the original language.

Safe next steps

Identify source language, target language, and latency requirement.
Test representative audio, accents, and noise conditions.
Check endpoint and key handling before release.

Related terms

Open the next path

Related runbook Command path Search all

StorageData access impact

Best match

Storage account key vs SAS

Does the consumer need full account-level key access, or a scoped, time-bound token?

Storage account keys grant broad shared-key access; SAS tokens delegate scoped access for a time, permission set, and resource scope. Also covers: SAS vs storage account access key.

When to use Storage account key

Use storage account keys only when account-level shared-key access is explicitly required and tightly controlled.

When to use SAS

Use SAS when the need can be scoped to specific resources, permissions, time windows, IP ranges, or protocols.

Key differences

Account keys can grant broad access across the storage account.
SAS can narrow access but must be generated and governed carefully.
Disabling shared key, rotating keys, or revoking SAS can break dependent apps if not inventoried first.

Common mistake

Sharing an account key for a narrow upload/download task that should have used a scoped SAS or managed identity.

Safe next steps

Inventory apps using account keys, connection strings, and SAS before rotation or disabling shared key.
Prefer the narrowest access method that satisfies the consumer.
Capture expiry, permissions, and owner for every SAS issued.

Related terms

Open the next path

Related runbook Command path Search all

StorageData access impact

Best match

Storage Queue vs Service Bus Queue

Is this a simple storage-backed queue, or enterprise messaging with broker features?

Storage Queue is simple queue storage; Service Bus Queue is an enterprise broker with sessions, dead-lettering, and advanced messaging. Also covers: Queue Storage vs Service Bus.

When to use Storage Queue

Use Storage Queue for simple asynchronous work items where basic queue semantics are enough.

When to use Service Bus Queue

Use Service Bus Queue when the workload needs richer broker features such as sessions, dead-lettering, duplicate detection, transactions, or advanced retry behavior.

Key differences

Storage Queue is simpler and tied to storage account patterns.
Service Bus Queue is built for messaging semantics and operational controls.
Poison-message handling, ordering, lock duration, and delivery guarantees are the deciding factors.

Common mistake

Starting with Storage Queue for a business process that later needs dead-letter inspection, sessions, or stronger delivery semantics.

Safe next steps

Define ordering, retry, duplicate, poison-message, and dead-letter requirements.
Inspect queue length, age, failures, and consumer behavior before migration.
Test failure and replay behavior before production cutover.

Related terms

Open the next path

Related runbook Command path Search all

Identity & governanceIdentity lifecycle impact

Best match

System-assigned vs User-assigned managed identity

Should the identity live and die with one resource, or be reusable across resources?

A system-assigned identity is tied to one Azure resource lifecycle; a user-assigned identity is a standalone reusable identity that can be attached to multiple resources.

When to use System-assigned managed identity

Use a system-assigned managed identity when one Azure resource needs its own identity tied to that resource lifecycle.

When to use User-assigned managed identity

Use a user-assigned managed identity when multiple resources need the same identity or when identity lifecycle must be managed separately.

Key differences

System-assigned identity is created and deleted with the resource.
User-assigned identity is a standalone Azure resource that can be attached to multiple workloads.
Role assignments and cleanup differ; deleting a workload may not remove user-assigned identity access.

Common mistake

Using a user-assigned identity for convenience and forgetting that its role assignments survive after workloads are deleted.

Safe next steps

List attached resources and role assignments for the identity before changing it.
Choose system-assigned for simple one-resource access; user-assigned for reuse or stable identity needs.
Audit stale user-assigned identities and remove unused role assignments.

Related terms

Open the next path

Related runbook Command path Search all

Core platformScope and identity

Best match

Tenant vs subscription

Is the exam scenario asking about identity/directory scope or resource/billing/deployment scope?

A tenant is the Microsoft Entra identity boundary; a subscription is an Azure billing and resource-management boundary.

When to use Tenant

Use tenant when the scenario is users, groups, app registrations, identities, consent, and directory-wide controls.

When to use Subscription

Use subscription when the scenario is resources, quotas, policies, role assignments, deployments, and billing/cost scope.

Key differences

Tenant and subscription are related but not interchangeable.
RBAC role assignments can be scoped under subscriptions, but identities come from a tenant.
Many mistakes come from changing the wrong account, tenant, or subscription in CLI context.

Common mistake

Answering a subscription question with a tenant control, or running commands in the wrong subscription after switching directories.

Safe next steps

Run az account show before lab commands.
Confirm tenant ID and subscription ID separately.
Use runbooks for RBAC and subscription-scope changes.

Related terms

Open the next path

Related runbook Command path Search all

StorageData access impact

Best match

User delegation SAS vs Account SAS

Can the SAS be based on Entra delegation, or must it be signed with account key authority?

User delegation SAS uses Microsoft Entra credentials for Blob access; account SAS relies on storage account keys and can authorize broader account operations.

When to use User delegation SAS

Use user delegation SAS for Blob scenarios where Entra-authenticated delegation is supported and preferred.

When to use Account SAS

Use account SAS only when the required resource type or operation requires account-key signing and the risk is accepted.

Key differences

User delegation SAS avoids direct account-key use and ties issuance to Entra authorization.
Account SAS can cover broader service/resource types but inherits account-key risk.
Both need narrow permissions, short expiry, and owner-recorded evidence.

Common mistake

Using account SAS by habit even when user delegation SAS would provide a better identity-governed path.

Safe next steps

Confirm the storage service, operation, and client support before choosing SAS type.
Use short expiry and minimal permissions; record who issued the token and why.
Review shared-key settings, key rotation impact, and active SAS dependencies.

Related terms

Open the next path

Related runbook Command path Search all

AI + dataSearch relevance impact

Best match

Vector Search vs Semantic Ranker

Do users need similarity over embeddings, language-aware reranking of text results, or a hybrid of both?

Vector search finds embedding similarity; semantic ranker reranks text results with deeper language understanding. Also covers: Semantic ranking vs Vector search; Vector search vs semantic ranker.

When to use Vector Search

Use vector search when meaning similarity matters, such as finding related passages even when users do not use exact keywords.

When to use Semantic Ranker

Use semantic ranker when text results need language-aware reranking, captions, or answer-style improvements after retrieval.

Key differences

Vector search depends on embeddings, vector fields, chunking strategy, and distance similarity.
Semantic ranker works over text results and semantic configuration; it does not replace an embedding strategy.
Hybrid retrieval can combine keyword, vector, and semantic ranking, but each layer adds cost and tuning complexity.

Common mistake

Turning on every relevance feature at once and then not knowing whether vector quality, keyword matching, or semantic ranking caused the result.

Safe next steps

Create a small relevance test set with expected documents before changing ranking behavior.
Check index fields, vector dimensions, semantic configuration, and query mode before production changes.
Measure quality and latency separately for keyword, vector, semantic, and hybrid queries.

Related terms

Open the next path

Related runbook Command path Search all

ArchitectureCompute scale / availability impact

Best match

Virtual Machine Scale Set vs Availability set

Do you need a managed scalable fleet, or fixed VM placement for availability of individually managed VMs?

A VM Scale Set manages a group of VM instances as a scalable fleet; an availability set spreads individual VMs across fault and update domains.

When to use Virtual Machine Scale Set

Use VM Scale Sets when you need autoscale, uniform/flexible orchestration, rolling upgrades, or a fleet of similar VM instances.

When to use Availability set

Use availability sets when you have individually managed VMs that need fault/update-domain placement without fleet scaling.

Key differences

Scale sets are about fleet management and scaling.
Availability sets are about VM placement across fault/update domains.
Scale sets can integrate with load balancers and autoscale rules; availability sets do not provide that fleet lifecycle by themselves.

Common mistake

Using an availability set when the real requirement is autoscale and rolling instance management.

Safe next steps

Clarify whether the workload needs instance individuality or fleet behavior.
Review load balancing, image, upgrade, and autoscale requirements.
Check VM/disks/network dependencies before migration.

Related terms

Open the next path

Related runbook Command path Search all

NetworkingConnectivity / routing impact

Best match

VNet peering vs VPN Gateway

Are you connecting Azure VNets directly, or do you need encrypted tunnel connectivity across on-premises, branches, or separate network boundaries?

VNet peering connects Azure virtual networks through Microsoft backbone routing; VPN Gateway connects networks through encrypted tunnels, often across on-premises or separate environments.

When to use VNet peering

Use VNet peering for low-latency private connectivity between Azure virtual networks when address spaces do not overlap and transitive routing is not assumed.

When to use VPN Gateway

Use VPN Gateway when you need IPsec/IKE tunnel connectivity, hybrid connectivity, point-to-site users, or network boundaries that require gateway mediation.

Key differences

Peering is not a VPN tunnel and does not automatically provide transitive routing.
VPN Gateway introduces gateway SKUs, tunnel configuration, routing, and availability design.
Peering is usually simpler for same-cloud VNet-to-VNet connectivity; gateways are required for many hybrid scenarios.

Common mistake

Building a hub-and-spoke design with peering but forgetting that traffic does not transit unless routing and gateway transit are explicitly designed.

Safe next steps

Inventory address spaces and route tables before connecting networks.
Check DNS resolution and private endpoint dependencies before cutover.
Use Resource Graph and network runbooks before changing routes.

Related terms

Open the next path

Related runbook Command path Search all

Make the Azure decision before you build, expose, scale, or change.

Pick the kind of Azure decision you are making.

ACR admin user vs managed identity pull

ACR tag vs digest

Action group vs Alert rule

AI Foundry project vs Azure AI services resource

AI Search indexer vs manual ingestion

AKS system vs user node pool

AKS Workload Identity vs Managed Identity

API key vs managed identity for AI service access

API Management vs Application Gateway

App Service plan vs Functions hosting plan

App Service slot swap vs direct production deployment

App Service vs Azure Functions

App Service vs Static Web Apps

Application Gateway vs Front Door

Application Security Group vs Network Security Group

Availability zone vs Availability set

Azure Advisor recommendation vs Policy compliance

Azure AI Content Safety vs Prompt Shields

Azure AI Search vs Azure OpenAI

Azure AI Vision OCR vs Document Intelligence

Azure budget vs Cost alert

Azure Firewall vs Network Security Group

Azure Monitor metrics vs logs

Azure Policy vs Management Locks

Azure RBAC vs Azure Policy

Azure Resource Graph vs Log Analytics KQL

Azure SQL Database vs SQL Managed Instance

Azure SQL vs Cosmos DB

Backup vault vs Recovery Services vault

Bicep vs ARM Template

Blob Storage vs Azure Files

Blob Storage vs Data Lake Storage Gen2

Container Apps vs AKS

Container Apps vs Container Instances

Control Plane vs Data Plane

Conversational Language Understanding vs custom text classification

Cosmos DB database vs container

Cosmos DB logical vs physical partition

Cosmos DB manual vs autoscale throughput

Cosmos DB serverless vs Provisioned throughput

Data Factory vs Synapse Pipelines

Dedicated SQL Pool vs Serverless SQL Pool

Delegated vs Application Permission

Diagnostic setting vs Activity Log

Diagnostic settings vs data collection rule

Document Intelligence layout model vs prebuilt model

DTU vs vCore

Event Grid vs Event Hubs

Event Hubs vs Service Bus

Event Hubs vs Service Bus vs Event Grid

ExpressRoute vs Site-to-site VPN

Foundry agent vs chat completion

Front Door WAF vs Application Gateway WAF

Functions Consumption vs Premium

Generalized vs specialized VM image

Group-based RBAC vs Direct user role assignment

Hot vs Cool vs Archive Blob Tiers

Incremental vs Complete Deployment Mode

Integrated vectorization vs custom embedding pipeline

Key Vault key vs Secret vs Certificate

Key Vault RBAC vs Access Policy

Load Balancer vs Application Gateway

Log Analytics workspace vs Application Insights

Logic Apps Standard vs Consumption

LRS vs ZRS vs GRS

Managed Identity vs Service Principal

Management group vs Subscription

Metric alert vs Log alert

Model catalog vs model deployment

NAT Gateway vs Load Balancer outbound rules

Online endpoint vs Batch endpoint

Owner vs Contributor vs Reader role

Policy definition vs Policy initiative

Practice assessment vs scenario lab

Prebuilt vs Custom Document Intelligence model

Private DNS zone vs Public DNS zone

Private endpoint vs managed identity for AI security

Private Endpoint vs Service Endpoint