Databases Azure SQL cost and compute complete field-manual-complete template-specs-five-use-cases-three-case-studies

SQL serverless auto-pause

SQL serverless auto-pause is the feature that lets an Azure SQL Database stop charging for compute after it sits idle long enough. It is best for databases that are used in bursts, such as development tools, training labs, demos, seasonal apps, or internal systems with long quiet periods. When activity returns, the database resumes automatically, but users may feel a warm-up delay. It is not a fit for workloads that need instant response all day or background jobs that keep waking the database.

Back to glossary browser Open Microsoft Learn source

Aliases: Azure SQL auto-pause, serverless database auto-pause, auto-pause delay, SQL Database serverless pause
Difficulty: intermediate
CLI mappings: 5
Last verified: 2026-05-25

Microsoft Learn

Microsoft Learn describes Azure SQL Database serverless auto-pause as a serverless compute behavior that pauses an inactive database after a configured delay. While paused, compute billing stops and only storage is billed; the database automatically resumes when logins or other activity return, where supported.

Microsoft Learn: Serverless compute tier for Azure SQL Database2026-05-25

Technical context

In Azure architecture, SQL serverless auto-pause belongs to the Azure SQL Database serverless compute model. The database has a configured vCore range and an auto-pause delay. When eligible inactivity conditions are met, compute pauses and storage remains allocated. The next login, query, or supported activity resumes compute. Auto-pause is configured at the database level, interacts with service tier support, monitoring, connection pooling, scheduled jobs, alerts, and application retry behavior. It is a compute-cost control, not a backup, retention, or database deletion mechanism.

Why it matters

This term matters because idle database compute is one of the easiest costs to miss in dev, test, demo, and intermittent production systems. Serverless auto-pause can turn quiet hours into real savings, but it changes the user experience. The first connection after a pause may wait for resume, and background monitoring, health checks, or jobs can prevent pausing if they keep the database active. Teams need to decide whether the savings are worth the resume delay. When configured thoughtfully, auto-pause lets small teams keep real databases available without paying continuously for compute they rarely use. Those decisions should be visible in cost reviews and support runbooks. Measure resumed-user experience, too.

Where you see it

Signals, screens, and Azure surfaces where this term usually becomes operational.

Signal 01

In Azure SQL database compute settings, where serverless databases show compute model, min vCores, max vCores, configured capacity range, and auto-pause delay minutes for cost reviews.

Signal 02

In Azure CLI output from az sql db show, where computeModel, autoPauseDelay, minCapacity, maxSizeBytes, sku, and status confirm whether auto-pause applies during release pipeline audits.

Signal 03

In cost and metric reports, where compute charges stop during paused periods while storage charges continue and first-request latency may increase after resume events during operations reviews.

When this becomes relevant

Specific situations where this term helps solve real Azure design, operations, migration, security, reliability, cost, or governance problems.

Reduce dev and test database compute spend when teams use realistic databases only during business hours.
Run training labs and demos that must remain available by name but do not need continuous compute.
Support internal tools with unpredictable usage while accepting a documented first-request warm-up delay.
Identify background jobs or health probes that prevent serverless databases from pausing and erase expected savings.
Standardize auto-pause delay values by environment so production, staging, and sandbox databases behave intentionally.

Real-world case studies

Different enterprise-style examples that show the term being used to hit measurable objectives.

Case study 01

Training provider cuts lab database waste

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

A cloud training provider kept hundreds of hands-on lab databases available between classes, even though most students used them only during scheduled workshops.

Business/Technical Objectives

Reduce idle compute cost without deleting student lab databases.
Keep lab start instructions simple by preserving stable database names.
Limit morning cold-start delays to an acceptable instructor window.
Give FinOps proof that auto-pause produced real savings.

Solution Using SQL serverless auto-pause

The platform team moved lab databases to the Azure SQL serverless compute tier and configured SQL serverless auto-pause with a delay long enough to survive short class breaks but short enough to pause overnight. Instructors ran a warm-up script thirty minutes before class to resume expected databases. Azure CLI listed computeModel, autoPauseDelay, minCapacity, and tags for every lab database, while cost reports compared paused hours by cohort. Health probes that touched each database every ten minutes were removed because they prevented inactivity. Finance reviewed paused hours monthly, not just the configured setting. The support desk received the approved wake-time guidance.

Results & Business Impact

Monthly lab compute spend dropped by 61 percent while storage costs remained predictable.
Average class startup delay stayed under nine minutes after the instructor warm-up script.
Paused-hour reports identified 37 databases that should have been deleted after old cohorts.
Support tickets for missing lab databases did not increase after the change.

Key Takeaway for Glossary Readers

SQL serverless auto-pause works best when idle periods are real, measurable, and paired with a simple warm-up plan.

Case study 02

Architecture team fixes slow mornings for an internal tool

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

An internal design-review application used an auto-paused database, and employees complained that the first morning submission often failed with a timeout.

Business/Technical Objectives

Keep the cost savings from overnight auto-pause.
Stop first-user failures during the morning review window.
Avoid moving the tool back to always-on provisioned compute unnecessarily.
Identify whether connection timeout, retry, or capacity settings caused the issue.

Solution Using SQL serverless auto-pause

Operators inspected the database with Azure CLI and found a short auto-pause delay, low minimum capacity, and an application connection timeout that was shorter than typical resume time. They increased the auto-pause delay to avoid pausing during the lunch window, adjusted minimum capacity for the first burst, and updated application retry policy so the first connection after idle time retried cleanly. A scheduled warm-up call was added before the daily review meeting. Metrics were tracked for resume time, first-request failures, and active compute cost. Operators also captured the current setting before every scheduled change window. Change evidence was saved in the runbook.

Results & Business Impact

Morning first-submit failures fell from 18 per week to 1 per week.
The database still paused for an average of 11 overnight hours.
Monthly compute cost increased only 8 percent compared with the broken auto-pause setup.
The team avoided a 46 percent cost increase from returning to provisioned compute.

Key Takeaway for Glossary Readers

Auto-pause is not a binary cost switch; delay, capacity, and retry settings decide whether users experience savings or frustration.

Case study 03

Nonprofit keeps seasonal campaign database affordable

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

A nonprofit used a donation-matching application heavily during quarterly campaigns, but the database sat almost unused for weeks between events.

Business/Technical Objectives

Keep the campaign database available for donor support between events.
Avoid paying full compute cost during long inactive periods.
Ensure campaign launch traffic did not hit an unexpected cold-start problem.
Create a repeatable checklist for future campaigns.

Solution Using SQL serverless auto-pause

The nonprofit moved the database to serverless compute and enabled SQL serverless auto-pause outside campaign windows. During inactive periods, the delay allowed occasional support lookups but paused compute overnight and on quiet days. One week before each campaign, automation disabled auto-pause and increased capacity so launch traffic stayed warm. After the campaign, Azure CLI restored the lower serverless settings and exported proof of the database configuration. Donor-support scripts were updated to expect a possible resume delay outside campaign periods. The owner tag was required before the database could keep the exception. Finance reviewed the exception during monthly chargeback.

Results & Business Impact

Between campaigns, compute charges dropped by 73 percent compared with the previous provisioned tier.
Campaign launch checkout latency stayed under the 250-millisecond database target after auto-pause was disabled.
Support users accepted a documented resume delay during quiet periods.
The checklist was reused for three later campaigns without emergency database changes.

Key Takeaway for Glossary Readers

For seasonal systems, SQL serverless auto-pause can be part of a calendar-aware operating model rather than a permanent setting.

Why use Azure CLI for this?

With ten years of Azure engineering experience, I use Azure CLI for SQL serverless auto-pause because cost controls need fleet visibility. A portal screen can show one database delay; CLI can find every serverless database, min and max vCores, autoPauseDelay, SKU, tags, and recent operations across environments. That is how I catch dev databases that never pause, production databases accidentally set to pause, and health probes that defeat the savings plan. CLI also fits automation: I can set standard delays for lab databases, disable auto-pause for critical workloads, and export before-and-after evidence for FinOps. It also catches databases that were copied from production with unsuitable defaults. Across subscriptions, drift becomes visible.

CLI use cases

List serverless databases and auto-pause delay settings across a subscription for FinOps review.
Update a dev database to use serverless compute with approved min capacity, max capacity, and auto-pause delay.
Disable auto-pause on a workload that proved unable to tolerate resume latency.
Export metrics and current database settings after a user reports slow first access in the morning.
Compare tags and compute model settings to ensure only approved workload classes use auto-pause.

Before you run CLI

Confirm the database is eligible for serverless auto-pause and that the service tier supports the behavior you expect.
Check subscription, resource group, server, database, environment tag, owner, and production criticality before changing delay values.
Understand cost and reliability risk: changing compute model or auto-pause settings can affect billing and first-request latency.
Use secure output handling because database names and tags may reveal sensitive application or environment information.
Coordinate with application owners to verify retries, connection timeouts, jobs, and monitoring probes before enabling auto-pause.

What output tells you

computeModel indicates whether the database is serverless or provisioned, which determines whether auto-pause can apply.
autoPauseDelay shows the inactivity period before compute pauses, or whether pausing is disabled.
minCapacity and capacity describe the active compute range and likely cost floor while the database is not paused.
Status and operation history help distinguish an intentional pause from scaling, update, or connectivity problems.
Metrics and cost data show whether the database actually spends meaningful time paused between active bursts.

Mapped Azure CLI commands

SQL serverless auto-pause CLI evidence

direct

az sql db show --resource-group <resource-group> --server <sql-server> --name <database> --query "{name:name,computeModel:computeModel,autoPauseDelay:autoPauseDelay,minCapacity:minCapacity,capacity:capacity,status:status}" --output json

az sql dbdiscoverDatabases

az sql db update --resource-group <resource-group> --server <sql-server> --name <database> --edition GeneralPurpose --compute-model Serverless --family Gen5 --min-capacity 1 --capacity 4 --auto-pause-delay 1440 --output json

az sql dbconfigureDatabases

az sql db update --resource-group <resource-group> --server <sql-server> --name <database> --auto-pause-delay -1 --output json

az sql dbconfigureDatabases

az monitor metrics list --resource <database-resource-id> --metric cpu_percent --interval PT15M --output json

az monitor metricsdiscoverDatabases

az sql db list --resource-group <resource-group> --server <sql-server> --query "[].{name:name,computeModel:computeModel,autoPauseDelay:autoPauseDelay,sku:sku.name,tags:tags}" --output table

az sql dbdiscoverDatabases

Architecture context

Architecturally, SQL serverless auto-pause is a workload classification decision. I treat it as appropriate for intermittent databases where cost savings matter more than first-request latency. It should not be enabled blindly on customer-facing systems with strict availability expectations, queue processors that expect instant database access, or services with constant telemetry pings. The design includes min and max vCores, auto-pause delay, retry policy, connection pooling, monitoring rules, and user communication. In landing zones, I usually use policy, tags, and pipeline defaults to distinguish dev/test serverless databases from always-on production databases. The architecture also needs a way to prove whether savings actually happen, because one chatty health check can keep compute awake indefinitely.

Security

Security impact is indirect. Auto-pause does not change database permissions, encryption, firewall rules, private endpoints, or authentication. The risk comes from operational shortcuts around paused databases. Teams may create broad firewall rules or admin accounts while troubleshooting resume delays, or they may disable monitoring because a paused database looks quiet. Paused does not mean deleted or declassified; stored data, backups, secrets, and access paths still matter. Operators should keep normal security controls in place, monitor configuration changes, and ensure that resume activity from unknown clients is investigated like any other database connection pattern. Treat pause troubleshooting as operational diagnostics, not permission relaxation. Keep identity reviews separate from compute-state troubleshooting.

Cost

Cost impact is direct. When the database is paused, compute cost goes to zero and storage cost continues. Savings depend on idle duration, configured minimum capacity, resume frequency, and whether background activity prevents pausing. A database that wakes every few minutes may save little while still frustrating users. A lab or demo database used a few hours per week can save a lot. FinOps teams should measure actual paused hours, not just whether auto-pause is enabled. Tags should identify workload criticality, owner, and expected schedule so teams can tune delay values instead of applying one default everywhere. Track paused hours, not just configured delay, when claiming savings. Use measured pause hours.

Reliability

Reliability impact is significant for user experience. A paused database is healthy, but it is not instantly ready. The first request after inactivity can see resume latency, connection timeouts, or retry storms if the application is not designed for warm-up. Scheduled jobs can also fail if they expect immediate access. Reliable use of auto-pause requires appropriate delay settings, client retry logic, longer first-connection timeouts, clear health-check design, and alerts that distinguish paused state from an outage. Critical workloads that cannot tolerate warm-up should disable auto-pause or use provisioned compute instead. Support teams should know the expected wake time before declaring downtime. Document who can override pause behavior during business events.

Performance

Performance impact is mostly about warm-up and scaling behavior. During active periods, serverless compute can scale within the configured vCore range, but after auto-pause the first request must wait for resume. That delay can look like poor application performance unless users and retries are prepared. Auto-pause also interacts with connection pooling: stale pooled connections may fail and need to reconnect cleanly. Once resumed, query performance still depends on compute range, indexes, workload shape, and storage. Teams should measure cold-start time, active query latency, and whether minimum capacity is too low for the first burst after resume. Warm-up automation should be deliberate, measured, and tied to business schedules. Validate with realistic concurrency.

Operations

Operators work with SQL serverless auto-pause during cost reviews, environment provisioning, performance complaints, and application onboarding. They inspect compute model, min and max capacity, auto-pause delay, current status, and metrics that show whether the database actually paused. They also identify keep-alive patterns from agents, jobs, dashboards, or probes that wake the database. Runbooks should tell help desk teams that a first query delay after idle time may be expected, while repeated resumes or failed connections need investigation. Change records should capture why a database can or cannot pause. That evidence separates expected savings behavior from misconfiguration or user impact. Capture warm-up timing, application retry behavior, and owner acceptance after tests. Include service-desk messaging for planned wake-up latency.

Common mistakes

Enabling auto-pause on a user-facing workload that cannot tolerate cold-start or resume delay.
Leaving monitoring probes, scheduled jobs, or connection pools configured so they constantly wake the database.
Assuming paused means free, while storage, backups, logs, and related resources continue to cost money.
Setting minimum capacity too low and blaming auto-pause for slow performance after the database resumes.
Using one auto-pause delay for every environment instead of matching delay to usage and reliability expectations.

Operator quick checks

Confirm the database compute model is serverless and the auto-pause delay is not disabled unintentionally.
Review recent metrics or billing data to see whether compute actually paused during expected idle windows.
Identify health checks, jobs, dashboards, or agents that may keep the database active all night.
Test first-request behavior after a pause from the same network path real users or services use.
Document which workloads can accept resume delay and which must remain provisioned or always active.

Questions to ask

How long can users or jobs wait for the database to resume after idle time?
What background activity might prevent this database from ever becoming inactive?
Are savings measured by actual paused hours or assumed from configuration alone?
Who approves disabling auto-pause if resume latency creates production impact?
Should this workload use serverless auto-pause, provisioned compute, or an elastic pool instead?

Related terms

No related terms mapped yet.

Graph connections

Graph edges are queued for this term.

Learn next

Use related terms, graph links, command groups, and comparison cards to keep moving through Azure without losing context.

Open relationship graph