Databases Azure SQL performance and availability complete field-manual-complete template-specs-five-use-cases-three-case-studies

SQL read scale-out

SQL read scale-out lets an application send read-only database work to a replica instead of competing with writes on the primary database. It is useful when dashboards, reports, exports, or analytics queries are heavy enough to slow customer transactions. The application still talks to the same Azure SQL workload, but the connection string asks for a read-only path. It is not a magic copy of every tier; availability depends on the service tier and replica model, and data can lag slightly behind the primary.

Back to glossary browser Open Microsoft Learn source

Aliases: Azure SQL read scale-out, read-only replica routing, ApplicationIntent ReadOnly, SQL Database read replicas
Difficulty: intermediate
CLI mappings: 5
Last verified: 2026-05-25

Microsoft Learn

Microsoft Learn describes SQL read scale-out as a feature that routes read-only workloads to read-only replicas instead of the primary read-write replica. It is available for Premium, Business Critical, and certain Hyperscale configurations, using ApplicationIntent=ReadOnly to separate reporting or analytics reads from writes.

Microsoft Learn: Use read-only replicas to offload read-only query workloads2026-05-25

Technical context

In Azure SQL architecture, SQL read scale-out sits in the database connection and high-availability layer. Premium and Business Critical databases have local read-only replicas behind the service, while Hyperscale can use high availability replicas or named replicas for more isolated read patterns. Clients signal read intent through the connection string, usually with ApplicationIntent=ReadOnly. The database engine routes eligible sessions to a readable secondary, where DMVs, Query Store behavior, tempdb usage, and replica latency must be interpreted differently from the primary.

Why it matters

This term matters because read-heavy work often hides behind normal application traffic until a report, export, or dashboard blocks the primary workload. SQL read scale-out gives architects a way to separate read demand without immediately creating a separate analytics platform. That can protect checkout flows, API calls, and write-heavy business transactions from noisy reporting users. It also forces better thinking about consistency. A read-only replica may be seconds behind, so it fits operational dashboards and trend analysis better than workflows requiring immediate read-after-write certainty. Used well, it improves capacity use, user experience, and incident isolation without adding another database to maintain.

Where you see it

Signals, screens, and Azure surfaces where this term usually becomes operational.

Signal 01

In Azure CLI and database properties, where the readScale setting and service objective reveal whether read-only routing is enabled, supported, or unavailable for the database before rollout.

Signal 02

In application connection strings and app settings, ApplicationIntent=ReadOnly signals that eligible sessions should route to a readable secondary rather than the primary write replica after deployment.

Signal 03

In performance dashboards and Query Store reviews, primary CPU, IO, and query duration drop for transactions while reporting sessions appear as read-only workload activity during validation.

When this becomes relevant

Specific situations where this term helps solve real Azure design, operations, migration, security, reliability, cost, or governance problems.

Move executive dashboards and exports away from the primary database during peak order-entry or API write windows.
Let reporting tools query a Business Critical database without blocking customer transactions that must commit quickly.
Use Hyperscale replicas to isolate tenant analytics workloads that need more read capacity than one primary can provide.
Test whether read-only routing can delay a costly service-tier upgrade caused mainly by reporting demand.
Separate troubleshooting of slow reports from primary database incidents by proving which replica handled the workload.

Real-world case studies

Different enterprise-style examples that show the term being used to hit measurable objectives.

Case study 01

Marketplace protects checkout from dashboard traffic

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

A marketplace analytics team refreshed seller dashboards every fifteen minutes, and those queries began competing with checkout writes during holiday flash sales.

Business/Technical Objectives

Keep checkout P95 database time below 180 milliseconds during dashboard refreshes.
Avoid buying a larger primary tier for a workload that was read-only.
Keep dashboard freshness within five seconds of primary data for seller operations.
Prove the routing change before the holiday release freeze.

Solution Using SQL read scale-out

The platform team kept the order database in the Business Critical tier and separated dashboard connections from write-path API connections. Dashboard services were given connection strings with ApplicationIntent=ReadOnly, while checkout and inventory services continued using the primary read-write path. Engineers verified the route by checking database updateability from the dashboard host and comparing primary CPU before and after refresh windows. Azure CLI inventory captured the database SKU, readScale state, and resource ID for change approval. Query Store and Azure Monitor were reviewed daily during the first week to confirm that seller queries were no longer stealing capacity from checkout writes.

Results & Business Impact

Checkout P95 database time fell from 310 milliseconds to 142 milliseconds during flash-sale refresh windows.
Primary CPU peaks dropped by 27 percent without changing the primary service objective.
Seller dashboard freshness stayed between two and four seconds, within the agreed tolerance.
The team avoided an estimated 38 percent monthly compute increase for the holiday period.

Key Takeaway for Glossary Readers

SQL read scale-out is valuable when read-only traffic needs capacity isolation but can tolerate a small amount of replica lag.

Case study 02

Insurance actuaries run model extracts without blocking claims

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

An insurance carrier ran large actuarial extracts from the same database that claims adjusters used during regional storm events.

Business/Technical Objectives

Let actuaries run hourly portfolio extracts during active claims surges.
Reduce blocking complaints from adjusters working high-priority claims.
Keep regulatory data exports inside the existing Azure SQL security boundary.
Document whether replica freshness was acceptable for actuarial calculations.

Solution Using SQL read scale-out

Database engineers moved the actuarial extract service to a read-only connection path using SQL read scale-out on the Business Critical claims database. They kept adjuster applications on the primary and used separate database roles for the extract identity so it could read approved tables without broad administrative rights. The team tested representative extracts during a planned storm-readiness exercise, measured replica latency, and confirmed that actuarial calculations were not affected by a few seconds of delay. Azure CLI show commands were added to the evidence packet so each regional environment could be checked for the same service tier and readScale setting.

Results & Business Impact

Adjuster blocking tickets dropped from 31 in the prior storm drill to 4 in the next drill.
Hourly extract completion time improved from 42 minutes to 24 minutes.
No firewall exception or separate data copy was required for the actuarial team.
Replica freshness stayed under six seconds during the highest tested write burst.

Key Takeaway for Glossary Readers

The best read scale-out use cases are not generic reports; they are specific read-heavy jobs whose consistency tolerance is known before routing changes.

Case study 03

Travel platform isolates tenant analytics on Hyperscale replicas

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

A travel booking platform offered premium tenants near-real-time booking analytics, but one tenant repeatedly overloaded shared reporting capacity before Monday revenue reviews.

Business/Technical Objectives

Protect booking writes from tenant analytics spikes.
Give the largest tenant enough read capacity without penalizing smaller tenants.
Keep analytics access governed through existing Azure SQL identities.
Create a repeatable capacity model for future premium tenants.

Solution Using SQL read scale-out

The architecture team moved the booking database to a Hyperscale design that used read scale-out for normal reporting and a named replica for the largest tenant analytics workload. Standard BI jobs used ApplicationIntent=ReadOnly, while the premium tenant analytics service connected to the named replica with separately monitored compute. Operators used CLI output to record the database resource, replica names, service objectives, and tags, then added alerts for replica CPU and analytics latency. They also documented which dashboards could tolerate delayed reads and which operational screens had to remain on the primary. Operators also confirmed the dashboard identity had no write permissions.

Results & Business Impact

Primary booking-write latency during Monday reviews fell from 620 milliseconds to 190 milliseconds.
The premium tenant dashboard refreshed in 11 minutes instead of 36 minutes.
Smaller tenant reports stopped timing out during the largest tenant review window.
Chargeback reports assigned replica cost to the premium analytics product line.

Key Takeaway for Glossary Readers

In Hyperscale designs, SQL read scale-out can evolve from simple read routing into a deliberate tenant-isolation and cost-allocation strategy.

Why use Azure CLI for this?

With ten years of Azure engineering experience, I use Azure CLI for SQL read scale-out because connection behavior and service-tier eligibility are easy to misread in the portal. CLI lets me inventory database SKUs, readScale settings, replica counts, failover groups, and server names across subscriptions in repeatable JSON. That matters when a reporting outage is caused by one database left in General Purpose while another moved to Business Critical. CLI also fits change control: I can export current settings, compare environments, update read scale flags where supported, and hand application teams exact evidence for connection-string changes. It also creates a clean evidence trail for review boards and post-incident analysis. That is invaluable during review.

CLI use cases

Inventory database tiers and readScale settings across subscriptions before moving reporting traffic to read-only routing.
Export database resource IDs, SKUs, and failover group membership for a change ticket that updates application connection strings.
Compare read scale configuration in dev, test, and production to catch environment drift before a release.
Check recent database operations and metrics after enabling read scale-out or changing service tiers.
Collect read-only evidence for an incident review, including server, database, SKU, and diagnostic configuration.

Before you run CLI

Confirm the subscription, resource group, SQL server, database name, and whether the database tier actually supports read scale-out.
Use read-only show and list commands first; changing service tier or readScale settings can alter cost and connection behavior.
Know which application owners will change connection strings, because CLI cannot prove every client uses ApplicationIntent=ReadOnly.
Check permissions for SQL database read and update actions, and export JSON output for review before modifying production.
Confirm whether the workload needs immediate consistency; if it does, do not route it to a read-only replica.

What output tells you

The SKU and edition show whether the database is in a tier that can use local readable replicas.
The readScale setting tells you whether Azure SQL Database routing is enabled or disabled where the setting is configurable.
Resource IDs and tags identify the application owner, environment, cost center, and change-management scope.
Operation history can connect a routing problem to a recent scale change, failover group update, or database migration.
Metric output shows whether primary CPU or IO pressure improved after read-only traffic moved away.

Mapped Azure CLI commands

SQL read scale-out CLI evidence

direct

az sql db show --resource-group <resource-group> --server <sql-server> --name <database> --query "{name:name,edition:edition,sku:sku.name,readScale:readScale,status:status}" --output json

az sql dbdiscoverDatabases

az sql db list --resource-group <resource-group> --server <sql-server> --query "[].{name:name,sku:sku.name,readScale:readScale,status:status}" --output table

az sql dbdiscoverDatabases

az sql db update --resource-group <resource-group> --server <sql-server> --name <database> --read-scale Enabled --output json

az sql dbconfigureDatabases

az sql failover-group list --resource-group <resource-group> --server <sql-server> --output table

az sql failover-groupdiscoverDatabases

az monitor metrics list --resource <database-resource-id> --metric cpu_percent,dtu_consumption_percent --interval PT5M --output json

az monitor metricsdiscoverDatabases

Architecture context

Architecturally, SQL read scale-out is a workload isolation pattern inside the Azure SQL data platform. It is not the same as sharding, caching, or creating a separate warehouse. The write replica remains the system of record, and read-only sessions are routed to secondary capacity that is part of the service-tier design. I usually design it by classifying reads into immediate-consistency, near-real-time, and analytics groups. Only the second and third groups should use read intent. The pattern interacts with connection pooling, retry logic, Query Store on replicas, failover design, reporting tools, and alerting for replica lag. It is strongest when application owners can choose the read path per workload instead of applying one global connection string everywhere.

Security

Security impact is indirect but still important. SQL read scale-out does not grant new database permissions by itself; users and applications still authenticate and authorize through normal SQL or Microsoft Entra paths. The risk appears when teams create separate reporting identities, connection strings, or firewall rules for read-only traffic and accidentally over-grant access. Read-only does not mean non-sensitive: replicas contain the same data as the primary. Operators should keep least-privilege database roles, protect connection strings, audit read activity, and ensure BI tools using ApplicationIntent=ReadOnly do not bypass private networking, Conditional Access expectations, or data classification rules. That review should be part of every reporting-client onboarding checklist. Confirm least privilege during onboarding.

Cost

Cost impact depends on the tier. In Premium and Business Critical, read scale-out can use existing included secondary capacity, which may avoid buying a larger primary just to satisfy reports. In Hyperscale, adding named replicas or extra high availability replicas can create explicit additional cost, so the isolation benefit needs a budget owner. There is also operational cost: bad routing can create duplicate troubleshooting, stale reports, or unnecessary scale-ups. FinOps teams should compare avoided primary scale, replica cost, BI workload value, and idle reporting patterns before treating read scale-out as free capacity. Document whether future report growth will require a different scaling pattern. Compare cache, replica, and analytics alternatives before scaling.

Reliability

Reliability impact is practical because heavy reads can destabilize write workloads if they all hit the primary database. SQL read scale-out can reduce that blast radius by moving suitable reads to secondary capacity. It also introduces replica-awareness. Data propagation latency can vary, a readable replica can be unavailable, and failover can reconnect a session to a replica at a different point in time. Reliable design uses retry logic, clear consistency expectations, monitoring for lag-sensitive workflows, and fallback rules for critical reports. Operators should test failover behavior and confirm that read-only connections never run required write operations. Test both normal routing and failover routing before declaring the pattern production-ready. Name rollback owners before rollout.

Performance

Performance is the main reason to use SQL read scale-out, but the improvement is workload-specific. Long reports, exports, and analytical reads can run away from write-heavy primary traffic, reducing CPU, IO, blocking, and query pressure on the primary. The read replica still has finite compute, and large queries can create their own waits, tempdb pressure, or version cleanup side effects. Data can also be slightly stale. Teams should measure primary latency, replica query duration, resource use, and user-visible report freshness before and after routing. A read-only path helps only when the workload is truly read-only and consistency tolerance is understood. Review waits and query plans if the readable replica becomes the new bottleneck.

Operations

Operators deal with SQL read scale-out during performance triage, release reviews, reporting onboarding, and service-tier changes. They inspect database SKU, read scale setting, replica availability, failover group design, and connection strings used by applications or BI tools. They also compare primary workload metrics with read-only query behavior, because a slow report on a replica should not automatically trigger primary database scaling. Runbooks should explain how to verify a session is connected to a read-only replica, how to identify replica lag symptoms, and when to send a workload back to the primary for consistency reasons. Without that evidence, teams cannot tell whether the change helped. Add validation evidence to release notes.

Common mistakes

Assuming ApplicationIntent=ReadOnly works on every Azure SQL tier, including tiers without readable secondary capacity.
Routing workflows that require immediate read-after-write consistency to a replica and then treating normal lag as data loss.
Forgetting that reporting identities still need least-privilege permissions and can expose sensitive data through BI tools.
Scaling the primary database for a slow report without checking whether the report is actually using the read-only path.
Ignoring failover and retry behavior, which can reconnect applications to a different replica state than expected.

Operator quick checks

Verify the database edition, service objective, and readScale value before changing application connection strings.
Confirm the client connection string includes ApplicationIntent=ReadOnly only for workloads that never write.
Run a database updateability check from the client host to prove the session reached a read-only replica.
Compare primary CPU, waits, and report latency before and after routing changes.
Document acceptable data freshness for every report or dashboard using read scale-out.

Questions to ask

Which reports or queries are safe to run against a replica that may lag behind the primary?
Who owns the connection-string change, and how will they prove the workload is read-only?
What user-facing metric should improve if read scale-out is working correctly?
What happens during failover if the reporting connection is dropped or routed to another replica?
Would a cache, warehouse, named replica, or service-tier change be a better isolation pattern?

Related terms

No related terms mapped yet.

Graph connections

Graph edges are queued for this term.

Learn next

Use related terms, graph links, command groups, and comparison cards to keep moving through Azure without losing context.

Open relationship graph