Analytics Data integration compute premium

Azure integration runtime

Azure integration runtime is the compute bridge that Data Factory and Synapse pipelines use to move data, dispatch activities, and reach cloud or private data stores. In plain English, it gives teams clear separation between pipeline logic and the compute path that actually moves or accesses data across. You usually see it when data pipelines need cloud copy, self-hosted access to private sources, managed virtual network isolation, or controlled data movement. It still needs ownership, monitoring, and change control. The practical question is whether it lets operators deploy, inspect, govern, or troubleshoot the workload clearly.

Aliases
Integration Runtime, IR, ADF integration runtime
Difficulty
intermediate
CLI mappings
5
Last verified
2026-05-11

Microsoft Learn

Azure integration runtime is the compute infrastructure used by Azure Data Factory and Synapse pipelines to move data, dispatch activities, and connect across network environments. Microsoft Learn places it in Integration runtime in Azure Data Factory and Synapse; operators confirm scope, configuration, dependencies, and production impact.

Microsoft Learn: Integration runtime in Azure Data Factory and Synapse2026-05-11

Technical context

Technically, Azure integration runtime is configured through integration runtime type, region, data integration units, and self-hosted nodes. Operators verify it with datafactory integration-runtime CLI output, monitoring data, node health, and activity run logs. It integrates with Azure Data Factory, Azure Synapse pipelines, Azure Relay, and Key Vault. Key settings include IR type, region, DIUs, and self-hosted node count. Capture desired state, compare it to live Azure state, and keep evidence for releases, incidents, and audits. These details matter because control-plane mistakes can affect many resources quickly.

Why it matters

Azure integration runtime matters because it turns a broad platform capability into something teams can design, review, and operate. Without a clear understanding of it, teams often make weak assumptions about ownership, limits, dependencies, or failure behavior. Used well, it helps architects choose the right boundary, gives operators observable signals, and gives security and finance teams evidence they can review. The value is not the label alone; the value is the repeatable operating model around it. For governed enterprise data movement, that operating model reduces surprises during releases, audits, incidents, and scale events. That clarity keeps small design choices from becoming hidden production risks.

Where you see it

Signals, screens, and Azure surfaces where this term usually becomes operational.

Signal 01

You see Azure integration runtime in Data Factory manage pages, integration runtime blades, self-hosted node status, linked service, where engineers confirm the design matches current resource state.

Signal 02

You see Azure integration runtime in pipeline support runbooks where operators diagnose stuck copy activities, node outages, credential, where operators connect evidence to ownership, recent changes, and incident response.

Signal 03

You see Azure integration runtime in architecture reviews covering data movement regions, private network paths, managed virtual networks, where architects, security, operations, and finance teams keep one shared decision record.

When this becomes relevant

Specific situations where this term helps solve real Azure design, operations, migration, security, reliability, cost, or governance problems.

  • Use Azure integration runtime for governed enterprise data movement when the workload needs repeatable governance.
  • Use it during production readiness reviews to confirm configuration, owners, and evidence.
  • Use it in incident response when operators need a shared technical reference.
  • Use it in automation when portal-only steps would create drift.

Real-world case studies

Different enterprise-style examples that show the term being used to hit measurable objectives.

Case study 01

Hospital private data copy

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

Westhaven Medical Network, a healthcare organization, needed nightly data copies from on-premises clinical systems into Azure without exposing private databases to the internet.

Business/Technical Objectives
  • Move 2.5 TB of clinical extracts nightly.
  • Keep data movement within approved network paths.
  • Alert on self-hosted node failure.
  • Reduce failed copy reruns by 40 percent.
Solution Using Azure integration runtime

Data engineers deployed a self-hosted Azure integration runtime on managed Windows servers inside the hospital network. Linked services used Key Vault-backed credentials, and Data Factory copy activities referenced the self-hosted IR for private sources. Two nodes provided resilience, while monitoring data tracked queue time, throughput, and node status. Firewall rules allowed required outbound connectivity only. Operators used CLI checks to verify IR nodes and monitoring output before every clinical data release. The team also assigned named owners, saved acceptance evidence, and reviewed rollout notes with support staff responsible for hospital private data copy. A final readiness check compared design assumptions, operator permissions, monitoring signals, and rollback steps before the production milestone.

Results & Business Impact
  • Nightly copies completed within the 4-hour operating window.
  • No source database required public exposure.
  • Node failure alerts triggered within 7 minutes during testing.
  • Failed copy reruns dropped by 52 percent.
Key Takeaway for Glossary Readers

Azure integration runtime is the control point that makes private data movement observable and governable.

Case study 02

Retail analytics cloud copy scale

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

RiverLane Outfitters, a retail organization, had slow cloud-to-cloud copy jobs that delayed daily sales dashboards for merchandising teams.

Business/Technical Objectives
  • Shorten daily sales load time from 3 hours to 90 minutes.
  • Tune copy throughput without rewriting pipelines.
  • Standardize linked-service evidence.
  • Separate source bottlenecks from IR sizing issues.
Solution Using Azure integration runtime

The analytics team reviewed Azure IR settings, copy activity DIUs, source throughput, and sink partitioning. They increased parallelism for high-volume stores, adjusted DIU settings for larger copy activities, and created dashboards for throughput by source. Integration runtime monitoring data was included in release notes so support teams could see whether failures came from Azure IR configuration, source throttling, or destination capacity. CLI outputs documented IR state and activity evidence during tuning. The team also assigned named owners, saved acceptance evidence, and reviewed rollout notes with support staff responsible for retail analytics cloud copy scale. A final readiness check compared design assumptions, operator permissions, monitoring signals, and rollback steps before the production milestone.

Results & Business Impact
  • Daily sales load time fell to 76 minutes.
  • Copy throughput improved by 61 percent on peak days.
  • Linked-service and IR evidence became part of release gates.
  • Two source throttling issues were separated from IR sizing problems.
Key Takeaway for Glossary Readers

Azure integration runtime helps teams tune data movement when pipeline logic and compute path are measured separately.

Case study 03

Manufacturing supplier gateway

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

IronVista Components, a manufacturing organization, needed to copy supplier quality files from partner SFTP servers into Azure with consistent monitoring and credential handling.

Business/Technical Objectives
  • Onboard 48 supplier feeds.
  • Store credentials outside pipeline code.
  • Detect failed transfers before production shifts.
  • Create a reusable IR pattern for future plants.
Solution Using Azure integration runtime

Data Factory pipelines used Azure integration runtime for cloud-accessible SFTP sources and linked services that pulled credentials from Key Vault. Each supplier feed used a standard naming pattern, retry policy, and failure alert. Operators tracked activity duration, error type, and transferred file count in Log Analytics. The team documented when Azure IR was sufficient and when self-hosted IR was required for private plant systems, making onboarding decisions repeatable across regions. The team also assigned named owners, saved acceptance evidence, and reviewed rollout notes with support staff responsible for manufacturing supplier gateway. A final readiness check compared design assumptions, operator permissions, monitoring signals, and rollback steps before the production milestone. After launch, the runbook kept Azure integration runtime checks tied to the same business objective rather than letting the configuration drift silently.

Results & Business Impact
  • Forty-eight supplier feeds were onboarded in six weeks.
  • No supplier credentials were stored in pipeline JSON.
  • Failed transfer alerts arrived before the morning production shift.
  • The reusable IR decision pattern was adopted by three plants.
Key Takeaway for Glossary Readers

Azure integration runtime gives data teams a practical boundary for secure, repeatable pipeline connectivity.

Why use Azure CLI for this?

Use Azure CLI for Azure integration runtime when you need repeatable inventory, governed changes, deployment checks, migration evidence, or incident proof. CLI output makes scope, identity, configuration, and timing explicit, which is better than relying on screenshots or memory during reviews.

CLI use cases

  • Inventory Azure integration runtime configuration across subscriptions, projects, tenants, or resource groups before a design review.
  • Capture repeatable evidence for incidents, audits, migrations, and release readiness checks.
  • Create or update supported settings through reviewed scripts instead of manual portal-only changes.
  • Compare expected state with actual state after deployment, rollback, migration, or platform upgrade work.

Before you run CLI

  • Confirm the active tenant, subscription, resource group, workspace, cluster, or project before running any command.
  • Check whether the command is read-only, mutating, cost-impacting, security-impacting, or destructive.
  • Use least-privilege identity and store sensitive output in approved locations only.
  • Have rollback notes and owner contacts ready before changing production configuration.

What output tells you

  • The output identifies the current Azure integration runtime resource, setting, relationship, or runtime state being inspected.
  • IDs, regions, SKUs, tags, endpoints, identities, and scopes show whether deployment matches the design.
  • Empty or missing fields often reveal an incomplete configuration, wrong scope, unsupported feature, or stale deployment.
  • Metric and state values help separate Azure configuration issues from application behavior problems.

Mapped Azure CLI commands

Azure integration runtime operations

direct
az datafactory integration-runtime list --factory-name <factory> --resource-group <resource-group>
az datafactory integration-runtimediscoverAnalytics
az datafactory integration-runtime show --factory-name <factory> --resource-group <resource-group> --name <integration-runtime>
az datafactory integration-runtimediscoverAnalytics
az datafactory integration-runtime get-monitoring-data --factory-name <factory> --resource-group <resource-group> --name <integration-runtime>
az datafactory integration-runtimediscoverAnalytics
az datafactory integration-runtime-node list --factory-name <factory> --resource-group <resource-group> --integration-runtime-name <integration-runtime>
az datafactory integration-runtime-nodediscoverAnalytics
az datafactory integration-runtime self-hosted create --factory-name <factory> --resource-group <resource-group> --name <integration-runtime>
az datafactory integration-runtime self-hostedprovisionAnalytics

Architecture context

Technically, Azure integration runtime is configured through integration runtime type, region, data integration units, and self-hosted nodes. Operators verify it with datafactory integration-runtime CLI output, monitoring data, node health, and activity run logs. It integrates with Azure Data Factory, Azure Synapse pipelines, Azure Relay, and Key Vault. Key settings include IR type, region, DIUs, and self-hosted node count. Capture desired state, compare it to live Azure state, and keep evidence for releases, incidents, and audits. These details matter because control-plane mistakes can affect many resources quickly.

Security

Security for Azure integration runtime starts with knowing who can configure it, who can use it, and what data or access path it can influence. The main risk is placing sensitive data movement on an unmanaged node, weak credential storage, blocked outbound connectivity, wrong region, or missing monitoring on self-hosted hosts. Review RBAC assignments, managed identities, keys or credentials, network exposure, diagnostic logs, and any linked resources before production use. Prefer least privilege, private connectivity where appropriate, audited changes, and secret storage outside application code. Also confirm that support teams can prove the current configuration during an incident without relying on screenshots or memory. Document the approved evidence before the first high-risk change and review it during access recertification.

Cost

Cost impact for Azure integration runtime comes from copy activity duration, DIU settings, pipeline retries, self-hosted infrastructure, managed virtual network overhead, data transfer, and idle private integration resources. The common waste pattern is enabling the capability for a pilot, then leaving resources, replicas, logs, or supporting infrastructure running after the original need changes. Estimate costs before rollout, tag resources to a clear owner, and compare steady-state usage with the design assumption. During reviews, look for unused resources, overbuilt tiers, avoidable data movement, and duplicated environments. Cost control works best when finance data is tied back to operational intent. Tie each optimization to an owner, forecast, and retirement date.

Reliability

Reliability depends on whether Azure integration runtime is designed for the workload’s real failure modes. Focus on node availability, queue backlog, activity retries, linked service failures, outbound network dependencies, self-hosted IR updates, and regional pipeline placement. A reliable design documents what should happen during scale-out, regional disruption, credential failure, deployment rollback, and operator error. Monitoring should show both the Azure resource state and the application symptoms users actually feel. Test the runbook before an outage, capture evidence from CLI or portal checks, and decide which failures require manual intervention versus automated recovery. Include dependency maps and health signals so responders know whether the platform or application failed during triage.

Performance

Performance depends on how Azure integration runtime affects latency, throughput, deployment speed, or operator decision time. Focus on DIU sizing, source and sink throughput, parallel copy settings, network latency, self-hosted node CPU, private endpoint routing, and copy activity bottlenecks. Do not assume the default setting is fast enough for production or that a faster tier fixes design problems. Measure before and after important changes, watch for throttling or slow control-plane calls, and test with realistic scale. Performance evidence should include user-facing symptoms, resource metrics, and configuration details so the team can distinguish service limits from application defects. Include baseline measurements so later tuning work has a defensible comparison point for teams.

Operations

Operationally, Azure integration runtime should appear in runbooks, dashboards, release gates, and ownership records. Focus on node patching, IR ownership, linked service evidence, throughput baselines, alerting, credential rotation, change records, and runbooks for stuck activity runs. The team should know which commands are safe for inventory, which changes are mutating, and which outputs prove compliance or readiness. Keep naming, tags, environments, and documentation consistent so support engineers can find the right resource quickly. Review the configuration after major releases, incident retrospectives, platform upgrades, and cost reviews rather than treating it as a one-time setup. Assign a named owner, keep an escalation path, and review stale automation before quarterly platform reviews.

Common mistakes

  • Running commands against the wrong subscription, tenant, workspace, cluster, or environment because context was not checked.
  • Treating a successful create command as proof that security, monitoring, and operations are complete.
  • Copying examples into production without adjusting regions, names, identities, SKUs, and network rules.
  • Ignoring service-specific limits, preview behavior, retirement status, or required extensions before automation rollout.