Monitoring and Observability Alerts premium

Action group

An Azure Monitor action group defines who gets notified and which actions run when an alert fires. It can include notification receivers such as email, SMS, and push, plus automation actions such as webhooks, secure webhooks, Azure Functions, Logic Apps, runbooks, ITSM, or Event Hubs.

Back to glossary browser Open Microsoft Learn source

Aliases: Azure Monitor action group, alert action group, monitor action group
Difficulty: fundamentals
CLI mappings: 5
Last verified: 2026-05-08

Microsoft Learn

Microsoft Learn: Create and manage action groups in Azure Monitor2026-05-08

Technical context

In Azure architecture, this term sits in the observability control plane. Alert rules detect conditions through metrics, logs, activity events, Service Health, Advisor, or other monitoring signals, but action routing determines who responds and what automation starts. The object is usually deployed in a resource group, referenced by alert rules, and inspected with Azure Monitor commands or templates. A strong design separates detection logic from response ownership so operations can reuse notification targets, rotate responders, and audit automated actions without rewriting every alert rule. For Action group, the important fields are receivers, action types, short name, resource group, region or global setting, and the alert rules that reference the group. Multiple alert rules can reuse one group, and action execution is concurrent rather than ordered.

Why it matters

This matters because an alert without reliable routing is only stored noise. If the wrong team is notified, if automation fails silently, if old receivers stay configured, or if several alert rules duplicate contact lists, incidents take longer and engineers lose trust in monitoring. The practical habit is to treat alert response as architecture: define owner, severity, receiver, automation endpoint, evidence, test method, privacy region, and fallback before relying on an alert in production. For Action group, the most useful evidence is not the product label; it is the concrete output that shows current state, ownership, scope, and the next safe action.

Where you see it

Signals, screens, and Azure surfaces where this term usually becomes operational.

Signal 01

Azure Monitor alert rules

Signal 02

Service Health alerts

Signal 03

Advisor alerts

Signal 04

incident routing

Signal 05

az monitor action-group commands

When this becomes relevant

Specific situations where this term helps solve real Azure design, operations, migration, security, reliability, cost, or governance problems.

Route alerts to people, teams, and automation endpoints.
Reuse one notification target across multiple alert rules.
Trigger diagnostic or remediation workflows when alerts fire.
Separate warning, critical, platform, and application response paths.

Real-world case studies

Different enterprise-style examples that show the term being used to hit measurable objectives.

Case study 01

Action group in action

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

Stonegate Finance had alert rules across SQL, AKS, and App Service, but notifications went to old distribution lists and no one owned automation responses.

Business/Technical Objectives

Create reusable alert response targets for production services.
Route critical alerts to on-call engineers and automation runbooks.
Reduce missed alerts during weekend incidents.
Standardize evidence for alert routing reviews.

Solution Using Action group

The observability team created Azure Monitor action groups for platform, database, and application operations. Each action group defined who was notified and what automation ran when alerts fired, including email, SMS, secure webhook, and Azure Function actions. Metric alerts and log search alerts were updated to reference the appropriate reusable action group instead of one-off contacts. The team documented that multiple alert rules could use the same action group and that actions execute concurrently without a guaranteed order, so automation steps could not depend on sequential execution. The action group inventory was reviewed monthly through Azure CLI output.

Results & Business Impact

Missed critical alerts dropped from 6 per quarter to 1.
Alert rule creation time fell 35% because teams reused approved groups.
Weekend mean time to acknowledge improved from 42 minutes to 11 minutes.
Obsolete notification contacts were removed from 83 alert rules.

Key Takeaway for Glossary Readers

An action group is the operational bridge between an alert firing and the right people or automation responding.

Case study 02

Action group in action

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

Greenfield Clinics needed Azure Service Health and database alerts to reach different teams without copying contact settings into every alert rule.

Business/Technical Objectives

Separate clinical application alerts from infrastructure alerts.
Use reusable notification and automation targets.
Meet a 15-minute acknowledgment objective for critical incidents.
Keep privacy-sensitive alert routing within approved regions where required.

Solution Using Action group

Cloud operations designed action groups around responder ownership. Clinical application alerts used an action group that notified the application on-call rotation and opened an ITSM incident. Database performance alerts used a separate action group that notified DBAs and triggered a Logic App to collect metrics. Service Health alerts used a global action group because public-cloud Service Health alerting required that configuration. For privacy-sensitive workloads, regional action groups were selected where the organization required regional processing. The team kept action group names, short names, receivers, and automation endpoints in source-controlled deployment templates and verified them with Azure CLI. The runbook also recorded the owner, scope, verification command, rollback contact, and evidence to save after the change so operators could repeat the pattern safely.

Results & Business Impact

Critical alert acknowledgment met the 15-minute target in 96% of incidents.
Duplicate receiver configuration was removed from 41 alert rules.
DBA triage packets were generated automatically for high-session alerts.
Service Health routing reached the enterprise incident bridge reliably.

Key Takeaway for Glossary Readers

Action groups make alert response reusable, reviewable, and aligned to real ownership.

Case study 03

Action group in action

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

Nimbus Retail saw too many low-value alert emails, so engineers ignored notifications until customer-impacting issues escalated.

Business/Technical Objectives

Separate warning notifications from critical incident automation.
Reduce noisy email-only alerts by at least 50%.
Trigger remediation only for alerts with clear runbook ownership.
Measure response improvement after routing changes.

Solution Using Action group

The monitoring team rebuilt alert routing around action groups. Warning-level metric alerts used an email and Teams webhook action group for awareness. Critical alerts used a different group that paged on-call engineers, created an ITSM incident, and invoked an Azure Function to collect diagnostics. Alert rules were limited to the action groups they truly needed because a single alert rule can attach only a small number of action groups. Operators verified action group memberships with `az monitor action-group list` and used alert history to confirm which notifications fired. The design avoided chaining actions that depended on execution order. The runbook also recorded the owner, scope, verification command, rollback contact, and evidence to save after the change so operators could repeat the pattern safely.

Results & Business Impact

Low-value alert emails decreased 64% in the first month.
Critical incident acknowledgment improved from 28 minutes to 8 minutes.
Automated diagnostic collection saved about 20 minutes per major incident.
Alert routing review became a quarterly evidence export instead of manual screenshots.

Key Takeaway for Glossary Readers

An action group is powerful because it separates alert detection from disciplined response design.

Why use Azure CLI for this?

Azure CLI is useful for Action group because it turns portal-visible configuration into repeatable evidence. Operators can list the target, inspect IDs and state, confirm whether the command is read-only or mutating, and save sanitized output for release or incident records. For this term, CLI work should start with context checks such as tenant, subscription, resource group, registry, database, role scope, or resource ID. Use JSON output when tags, digests, receiver lists, token permissions, assignment scopes, metrics, or replica links matter. Treat commands that delete manifests, disable tokens, update scope maps, create alerts, or fail over databases as approval-required changes, not casual inspection.

CLI use cases

Inventory the current Action group configuration before a release, access change, cleanup, alert update, or database operation so the operator can prove the target is correct.
Troubleshoot production behavior by comparing live Azure output for Action group with the expected architecture, owner, scope, tag, permission, metric, or replica state.
Automate a repeatable verification check around Action group in a pipeline or runbook so drift is caught before users, workloads, or auditors discover it.
Create sanitized post-change evidence for Action group, keeping IDs, state, timestamps, digests, run IDs, metrics, or links while redacting secrets and sensitive endpoints.

Before you run CLI

Confirm the active tenant and subscription with `az account show`; many Action group mistakes are wrong-context mistakes that look like product failures.
Identify the exact resource boundary before running commands: registry, repository, scope map, token, task, webhook, action group, role scope, database, or replica link.
Classify the command as read-only, security-impacting, cost-impacting, destructive, or availability-impacting; approval is required before mutating Action group in production.
Decide how sensitive output will be handled before the command runs, especially token passwords, webhook URIs, object IDs, connection details, and registry metadata.

What output tells you

The output confirms whether Azure returned the intended Action group target by name, resource ID, subscription, region, repository, database, role scope, or alert group.
State fields show what Azure will actually use: tags, digests, retention status, repository actions, token status, task triggers, receivers, session metrics, or replica links.
Nested JSON often contains the important evidence; table output can hide permissions, timestamps, action lists, credential status, endpoint scope, or metric dimensions.
The output also tells you whether the next action should be inspection, approval, rollback, scale, cleanup, failover, credential rotation, or a safer design review.

Mapped Azure CLI commands

Action group commands

direct

az monitor action-group list --resource-group <resource-group> --output table

az monitor action-groupdiscoverMonitoring and Observability

az monitor action-group show --name <action-group> --resource-group <resource-group>

az monitor action-groupdiscoverMonitoring and Observability

az monitor action-group create --name <action-group> --resource-group <resource-group> --short-name <short-name>

az monitor action-groupprovisionMonitoring and Observability

az monitor metrics alert create --name <alert-name> --resource-group <resource-group> --scopes <resource-id> --condition '<condition>' --action-group <action-group-id>

az monitor metrics alertprovisionMonitoring and Observability

az monitor action-group delete --name <action-group> --resource-group <resource-group>

az monitor action-groupremoveMonitoring and Observability

Architecture context

Design action groups around responder ownership and severity. Keep warning notifications separate from critical paging, use secure automation endpoints, document privacy-region decisions, and test routing after receiver changes. Do not let obsolete email addresses become the hidden reason alerts fail.

Security

Security depends on who can change alert routing and what automation an alert can trigger. Action groups can call webhooks, functions, runbooks, ITSM systems, and event hubs, so a weak receiver or stale endpoint can become a privileged automation path. For Action group, restrict update rights, secure webhook endpoints, avoid exposing secrets in payloads, and review receiver lists so sensitive incidents are routed to approved teams without leaking operational details.

Cost

Cost impact is indirect but important. Good action groups reduce wasted incident time, duplicate alert configuration, and manual diagnostic collection. Poorly designed automation can also generate downstream costs by triggering functions, runbooks, tickets, or event ingestion too often. For Action group, cost review should ask whether low-value alerts are routed differently from critical alerts and whether automation is scoped to signals with a clear operational owner.

Reliability

Reliability depends on alert response working when the platform is already under stress. If an action group points to an old mailbox, a disabled phone number, or a failing automation endpoint, detection does not become response. For Action group, test receivers, keep fallback notification paths, avoid depending on execution order, and verify global or regional behavior against privacy and resilience requirements before trusting it for production incidents.

Performance

Performance is about response speed, not application latency. Action group design affects how quickly an alert reaches the right person or workflow and how much diagnostic context is collected before engineers join. For Action group, test notification latency, endpoint response, receiver freshness, and action concurrency. Do not build remediation steps that assume actions run sequentially unless the workflow itself enforces ordering.

Operations

Operational excellence means alert routing is reusable, documented, and easy to audit. Use Azure CLI and templates to list action groups, inspect receivers, attach them to alert rules, and confirm which rules depend on each group. For Action group, the runbook should state owner, severity, receivers, automation target, escalation path, last test date, and what evidence proves notifications fired during an incident or drill.

Common mistakes

Treating Action group as a label instead of a boundary with owner, scope, evidence, and rollback consequences.
Running mutating commands from the wrong subscription, resource group, registry, role scope, database, or region because the Azure CLI context was not checked first.
Saving raw secrets, token passwords, webhook URLs, or sensitive identity details in tickets instead of sanitized operational evidence.
Assuming a successful command proves the design is correct; it only proves Azure accepted the request or returned the current state.

Operator quick checks

Can you name the owner of Action group and the production workload, user group, pipeline, or database that depends on it?
Do you know which command is read-only and which command could change security, cost, artifact availability, alert routing, or failover behavior?
Have you captured the expected output fields before changing anything, including IDs, tags, digests, actions, receivers, metrics, status, or replica links?
Is there a rollback plan that another operator could execute without guessing from the portal or rebuilding context during an incident?

Questions to ask

What boundary does Action group control: artifact, identity, alert response, metric signal, or database recovery?
Who owns approvals for changing this setting, and who will be paged if the change breaks production?
What would prove the current state is wrong: a failed pull, broad credential, stale privilege, missed alert, high sessions, or failed failover drill?
Which output fields are safe to save, and which fields must be redacted because they expose credentials, endpoints, identities, or sensitive workload details?
Does the Microsoft Learn source still match the command surface and behavior in this tenant, region, SKU, and subscription?

Related terms

No related terms mapped yet.

Graph connections

Graph edges are queued for this term.

Learning paths

Learn next

Use related terms, graph links, command groups, and comparison cards to keep moving through Azure without losing context.

Open relationship graph