Databases Cache and in-memory data verified

Redis cache SKU

A Redis cache SKU is the size and capability package you buy for the cache. It answers questions like how much memory the cache has, what performance level it can provide, which high-availability or scaling features are available, and how much it will cost. For Azure teams, SKU is not just a billing label. It is a design decision that affects latency, resilience, migration options, monitoring expectations, and whether the cache can support the application when traffic grows.

Aliases
Redis SKU, cache SKU, Redis size
Difficulty
fundamentals
CLI mappings
4
Last verified
2026-05-21

Microsoft Learn

A Redis cache SKU is the purchasable resource shape for a Redis cache on Azure. It combines service family, tier, memory size, compute capability, and feature availability so teams can choose the right balance of performance, resilience, and cost for a workload.

Microsoft Learn: Scale an Azure Managed Redis instance2026-05-21

Technical context

In Azure architecture, the Redis cache SKU sits at the resource planning layer for the cache service. Azure Cache for Redis historically used Basic, Standard, Premium, Enterprise, and Enterprise Flash style choices, while Azure Managed Redis uses memory sizes and performance tiers such as Balanced, Memory Optimized, Compute Optimized, and Flash Optimized. The selected SKU influences capacity, high availability, persistence, geo-replication, clustering behavior, network options, and cost model. Operators see it in ARM properties, portal Overview, CLI output, cost reports, and capacity reviews.

Why it matters

Redis cache SKU matters because many cache failures are really sizing and feature-selection failures. A development SKU may work during testing but lack the availability, memory, throughput, persistence, or network posture needed in production. An oversized SKU can quietly waste budget every hour. A SKU that cannot support required migration, security, or recovery features can force a redesign late in the project. Architects use SKU selection to align workload criticality with capability; operators use it to explain limits; FinOps teams use it to challenge spend. The right SKU gives the cache enough headroom without hiding waste. That discipline prevents both fragile launches and quiet waste.

Where you see it

Signals, screens, and Azure surfaces where this term usually becomes operational.

Signal 01

The Azure portal Overview blade shows cache SKU, size, region, provisioning state, and pricing-relevant configuration during design review or support triage for the resource owner.

Signal 02

ARM, Bicep, Terraform, and CLI output expose SKU values that determine repeatable deployment behavior across dev, test, staging, and production environments during release validation cycles.

Signal 03

Cost Management and Azure Monitor reveal whether the selected SKU matches actual memory, throughput, availability, and latency patterns after production traffic arrives for a workload.

When this becomes relevant

Specific situations where this term helps solve real Azure design, operations, migration, security, reliability, cost, or governance problems.

  • Choose a production Redis shape that supports required memory, performance, high availability, and security controls before launch.
  • Right-size an overbuilt cache after metrics show low memory use, light operations, and no business need for the current SKU.
  • Compare Azure Cache for Redis and Azure Managed Redis SKU models during migration planning and retirement-risk review.
  • Explain why a nonproduction cache should use a smaller or non-HA option instead of mirroring production cost.
  • Investigate latency, evictions, or timeouts where the selected SKU is below the application’s actual peak workload.

Real-world case studies

Different enterprise-style examples that show the term being used to hit measurable objectives.

Case study 01

Citizen services portal replaces guesswork with SKU evidence

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

A city government portal cached permit status, appointment slots, and multilingual content. The team had copied a small development Redis SKU into production and saw timeouts during tax-season traffic.

Business/Technical Objectives
  • Select a Redis SKU based on measured peak traffic instead of habit.
  • Keep portal reads responsive during public filing deadlines.
  • Reduce emergency database scaling caused by cache evictions.
  • Document the cost and reliability tradeoff for procurement review.
Solution Using Redis cache SKU

The platform team used Azure CLI to inventory the existing cache, capture SKU and capacity, and export metrics for used memory, server load, operations, and evictions. They tested two larger Redis cache SKU options in a staging subscription with production-like traffic replay. The final design used a production-appropriate SKU with private access, TLS, diagnostic settings, and alerts for memory pressure. The application still treated the permit database as authoritative, but Redis carried high-volume reads for content and recent appointment availability. Procurement received a comparison showing the higher Redis SKU cost versus the database and incident labor it avoided.

Results & Business Impact
  • P95 portal lookup latency improved from 410 milliseconds to 74 milliseconds during the filing window.
  • Database read spikes fell by 42 percent when cache evictions stopped during peak traffic.
  • The annual SKU decision was documented with metrics instead of a portal screenshot.
  • The city avoided two planned emergency database scale-ups during the busiest month.
Key Takeaway for Glossary Readers

A Redis cache SKU should be selected from workload evidence, not copied from the first environment that happened to work.

Case study 02

B2B analytics vendor maps legacy caches to Managed Redis SKUs

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

A SaaS analytics vendor prepared to migrate several Azure Cache for Redis instances to Azure Managed Redis. Product teams used inconsistent SKU names, and nobody could explain which caches were mission-critical.

Business/Technical Objectives
  • Create a migration inventory showing current SKU, memory, and feature use.
  • Map each workload to a Managed Redis size and performance tier.
  • Avoid paying for production-grade capability in sandbox tenants.
  • Complete migration testing without changing application cache semantics.
Solution Using Redis cache SKU

The cloud engineering team scripted Azure CLI discovery across subscriptions and grouped caches by environment, memory pressure, operations per second, private access, and persistence requirements. For each workload, architects reviewed whether the cache held user sessions, report fragments, or disposable dashboard hints. Critical customer-report caches moved to a Managed Redis SKU with enough memory and performance tier headroom, while sandbox caches used smaller nonproduction choices. Key Vault references and connection settings were rotated during migration testing. Azure Monitor dashboards compared cache hit ratio and backend query load before and after each cutover.

Results & Business Impact
  • The inventory found 18 stale caches and removed about 11 percent of monthly Redis spend.
  • Migration test cycles dropped from six weeks to three weeks because SKU mapping was repeatable.
  • No customer-report cache exceeded 70 percent used memory after cutover.
  • Support tickets about unexplained cache tier differences fell sharply after the decision table was published.
Key Takeaway for Glossary Readers

SKU mapping is a migration design task, especially when older Redis service families and newer Managed Redis choices use different naming models.

Case study 03

Route optimization platform right-sizes an overbuilt cache estate

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

A logistics optimization platform used Redis to cache route fragments and depot constraints. Several large caches stayed online after a proof-of-concept ended, quietly consuming budget.

Business/Technical Objectives
  • Identify Redis SKUs that were oversized for their current traffic.
  • Protect peak planning jobs from latency regression while lowering monthly spend.
  • Keep production and test environments clearly separated.
  • Create a repeatable monthly review for cache cost ownership.
Solution Using Redis cache SKU

FinOps and platform engineering joined forces to export Redis cache SKU, tags, region, and metric data through CLI. They reviewed used memory, operations, hit ratio, and server load for each cache over a full planning cycle. Two production caches stayed on their current SKU because route planning had short, intense bursts. Three test caches were resized to smaller SKUs, and four abandoned proof-of-concept resources were deleted after owner approval. Dashboards now show cache cost beside performance indicators, so teams can see whether the SKU is justified. IaC templates were updated to prevent the portal changes from drifting back.

Results & Business Impact
  • Monthly Redis spend dropped by 23 percent without increasing planning-job runtime.
  • All production route caches stayed below the agreed latency budget during peak planning windows.
  • Stale proof-of-concept resources were removed from two subscriptions.
  • Monthly SKU review became a standard FinOps control with named owners.
Key Takeaway for Glossary Readers

The right Redis SKU is not always larger; sometimes the best engineering decision is to prove a smaller SKU is enough.

Why use Azure CLI for this?

Azure CLI is useful for Redis cache SKU work because SKU mistakes repeat across environments when infrastructure is copied. After many production reviews, I prefer CLI for inventory, drift checks, and evidence: list caches, show SKU and capacity, compare resource groups, and export JSON for cost or architecture review. The portal is fine for one cache, but CLI finds every cache a team owns. It also helps validate that deployment automation created the intended SKU and that a migration target uses the planned Managed Redis size and performance tier before application teams cut over. That repeatable proof matters when cost owners challenge the cache bill.

CLI use cases

  • List Redis caches and identify their SKU, capacity, region, ownership tags, and service family.
  • Show a cache before resizing so the change record captures the current SKU and provisioning state.
  • Create a test cache with the intended SKU to validate performance and client settings before production.
  • Compare Azure Cache for Redis and Azure Managed Redis resources during migration discovery.
  • Export SKU evidence for FinOps review, rightsizing, or decommissioning stale environments.

Before you run CLI

  • Confirm tenant, subscription, resource group, cache name, region, and whether you are using az redis or az redisenterprise commands.
  • Use read-only list and show commands before running create, scale, delete, key, firewall, or force-reboot operations.
  • Check provider registration, permissions, naming policy, tags, and output format before automating SKU inventory across subscriptions.
  • Estimate hourly cost, replica behavior, persistence, network, and migration implications before creating or resizing a SKU.
  • Validate that the SKU is available in the target region and matches the documented workload criticality.

What output tells you

  • SKU name, capacity, and family show the purchased capability envelope for memory, performance, feature availability, and cost.
  • Provisioning state shows whether the cache is ready, updating, failed, or still unsafe for production validation.
  • Location and resource ID connect the SKU decision to a subscription, region, resource group, and cost owner.
  • Network and authentication fields show whether the chosen cache is reachable and secure enough for the intended workload.
  • Metric output reveals whether the SKU is overbuilt, under pressure, or appropriate for current production usage.

Mapped Azure CLI commands

Redis SKU inventory and selection

direct
az redis list --resource-group <resource-group>
az redisdiscoverDatabases
az redis show --name <cache-name> --resource-group <resource-group>
az redisdiscoverDatabases
az redis create --name <cache-name> --resource-group <resource-group> --location <region> --sku Standard --vm-size C1
az redisprovisionDatabases
az redisenterprise list --resource-group <resource-group>
az redisenterprisediscoverDatabases

Architecture context

An Azure architect treats Redis SKU selection like choosing a database service objective. I start with workload purpose: session store, hot-read cache, rate limiter, semantic cache, leaderboard, or messaging-style buffer. Then I compare memory size, expected operations, high availability, persistence, network isolation, authentication, regional strategy, and migration plans. The SKU should match both the steady state and the incident state, because cache misses can punish the backend during failover or maintenance. For 2026 designs, I also check whether Azure Managed Redis is the better target because it changes the SKU model and supported capabilities. Every exception should name the business risk, expected traffic pattern, and rollback owner.

Security

Security impact is partly indirect because SKU does not grant access by itself. The SKU still matters because different Redis service families and tiers can expose different options for private access, TLS posture, authentication, persistence, and operational controls. A team that chooses a smaller or older SKU for cost reasons may lose features needed for regulated workloads or migration. Security review should confirm network isolation, Microsoft Entra or key-based authentication choices, secret storage, diagnostic export, and data retention expectations. SKU decisions should not be made only by application teams without platform and compliance review. Include SKU assumptions in threat models and exception reviews.

Cost

Cost impact is direct because SKU is one of the strongest billing drivers for Redis on Azure. Higher memory, performance, replicas, clustering, persistence, geo-replication, and enterprise capabilities increase hourly cost. The cheapest SKU can still be expensive if it causes database scale-out, outages, emergency engineering, or failed migrations. FinOps should compare cache spend with avoided backend load, latency targets, and business criticality. Right-sizing should review used memory, operations, server load, hit ratio, and growth trend. Nonproduction caches deserve schedules, tags, and smaller SKUs so test environments do not become permanent production-sized bills. Budget alerts should be tied to the resource owner.

Reliability

Reliability impact is direct because the SKU determines much of the cache capability envelope. Some choices provide high availability, replicas, zone redundancy, persistence, geo-replication, or better scaling options; others are appropriate only for development or low-risk workloads. A cache SKU that is too small can cause evictions, timeouts, reconnect storms, and backend overload. A production SKU should be selected from measured memory growth, server load, peak traffic, failover tolerance, and recovery objectives. Operators should document which failure modes the SKU covers and which ones the application must absorb through fallback logic. This prevents teams from treating availability as an afterthought.

Performance

Performance impact is direct because the SKU defines the practical ceiling for memory, throughput, CPU headroom, connection volume, and supported scale patterns. A cache can have excellent code and still perform badly when the SKU cannot absorb peak operations or large values. Larger or more capable SKUs may reduce latency, but only if key design, value size, connection pooling, and backend fallback are also healthy. Operators should test with realistic payloads and traffic bursts rather than relying on SKU names. Performance reviews should include cache hit ratio, server load, evictions, client timeouts, and database pressure. Baselines should be refreshed after every meaningful SKU change.

Operations

Operators use Redis cache SKU as a starting point for every support conversation. It tells them which commands, scaling options, metrics, and failure patterns are realistic. During incidents they check SKU, region, capacity, provisioning state, private endpoint, authentication mode, and recent scaling changes before blaming application code. During planning they compare current metrics with SKU limits and migration guidance. SKU reviews also support tagging, cost ownership, upgrade planning, and cleanup of forgotten caches. A strong operations practice keeps SKU selection documented in the architecture decision record, not only in the portal. Support notes should also name the approved resize path.

Common mistakes

  • Choosing the cheapest SKU for production and discovering too late that availability or scale features are missing.
  • Keeping a large temporary SKU after load testing, migration, or a one-time campaign has ended.
  • Comparing SKU names without checking the different models used by Azure Cache for Redis and Azure Managed Redis.
  • Resizing from the portal without updating IaC, runbooks, cost forecasts, and monitoring thresholds.
  • Ignoring regional availability or feature differences until a disaster recovery or migration exercise fails.