Kusto data connection - Azure Glossary

Microsoft Learn

Kusto data connection links Azure Data Explorer to event or storage sources such as Event Hubs, Event Grid, or IoT Hub so data can be ingested into a database table. Microsoft Learn places it in Create an Event Hubs data connection - Azure; operators confirm scope, configuration, dependencies, and production impact.

Microsoft Learn: Create an Event Hubs data connection - Azure Data Explorer2026-05-15

Technical context

Technically, Kusto data connection involves data connection resource, Event Hubs, Event Grid, IoT Hub, consumer group. Teams configure or inspect it through Azure portal, Azure CLI data-connection commands, ARM templates, Event Hubs, Event Grid subscriptions and validate it with connection state, source resource ID, consumer group, target table, mapping name. Key dependencies include Kusto database, source event service, network rules, identity permissions, mapping definitions. In production, document scope, identity, network path, telemetry, lifecycle, and rollback. Treat the term as runtime state: portal settings, Kusto commands, CLI output, logs, and policy assignments should agree before release.

Why it matters

Kusto data connection matters because wrong consumer groups, missing permissions, schema drift, or source throttling can silently delay or break streaming analytics pipelines. It also shapes event-driven ingestion, near-real-time dashboards, IoT analytics, storage-triggered ingestion, and source-to-table ownership. When teams treat it as a loose label, they create work that is invisible until a release, audit, incident, or scaling event. Good implementation gives architects a real decision point, operators a measurable signal, security teams a control to review, and finance teams a cost driver to explain. That makes the term a practical checkpoint for design quality, ownership, and production readiness.

Where you see it

Signals, screens, and Azure surfaces where this term usually becomes operational.

Signal 01

In the Azure portal or service blade, Kusto data connection appears around ADX database data connections, Event Hubs, Event Grid subscriptions, target tables, where owners review access, health, and readiness.

Signal 02

In CLI, Kusto command, or deployment output, Kusto data connection shows through connection properties, source IDs, target table, mapping names, giving operators evidence during audits and incidents.

Signal 03

In architecture reviews, Kusto data connection appears when teams debate streaming ingestion path, source permissions, schema mapping, then compare intended design with live state. during reviews, releases, and support handoffs.

When this becomes relevant

Specific situations where this term helps solve real Azure design, operations, migration, security, reliability, cost, or governance problems.

Use Kusto data connection during architecture review to make ownership, dependencies, and risk explicit before production deployment.
Use Kusto data connection in operational runbooks so support teams can verify live Azure or Kusto state without guessing.
Use Kusto data connection in compliance evidence when auditors ask how access, data flow, query behavior, or platform configuration is controlled.
Use Kusto data connection during incident triage to separate application defects from platform configuration or dependency failures.

Real-world case studies

Different enterprise-style examples that show the term being used to hit measurable objectives.

Case study 01

Stabilizing near-real-time manufacturing analytics

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

Contoso Components, a industrial manufacturing organization, needed to solve unreliable plant-floor analytics where late events and hidden ingestion delays caused incorrect production dashboards. The platform team used Kusto data connection to make the design observable, governed, and supportable in production.

Business/Technical Objectives

Improve freshness for critical dashboard data to under five minutes.
Reduce manual reconciliation after ingestion failures by at least 40%.
Expose backlog, schema, and policy evidence to on-call engineers.
Avoid adding permanent compute capacity without measured need.

Solution Using Kusto data connection

Architects defined Kusto data connection as part of the workload runbook and linked it to data connection resource, Event Hubs, Event Grid, IoT Hub, owner tags, diagnostic settings, and the approved deployment path. Operators used az kusto data-connection list --cluster-name <cluster-name> --database-name <database-name> --resource-group <resource-group> for read-only evidence, then compared the result with Kusto management commands, portal state, activity logs, metrics, and change records. Security reviewers checked managed identities, Event Hubs authorization, source RBAC, private endpoints, while reliability engineers validated source availability, consumer group health, mapping correctness, table schema under a realistic pilot workload. The rollout separated discovery from change-controlled steps, stored evidence with resource IDs and database names, and tied rollback to dashboards and support alerts.

Results & Business Impact

Dashboard freshness improved from 18 minutes to four minutes for priority telemetry.
Manual reconciliation work fell by 47% because failed ingestion and schema evidence were visible.
On-call engineers identified backlog sources in under ten minutes during three incidents.
Compute spend stayed within 8% of forecast because scaling decisions were tied to metrics.

Key Takeaway for Glossary Readers

Kusto data connection is valuable when teams convert an Azure concept into verified state, owner accountability, and measurable production behavior.

Case study 02

Reducing telemetry investigation time

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

Northwind Health, a regional healthcare analytics organization, needed to solve slow incident investigations across telemetry stores after a patient portal release increased diagnostic volume. The platform team used Kusto data connection to make the design observable, governed, and supportable in production.

Business/Technical Objectives

Reduce mean time to isolate telemetry issues by at least 35%.
Keep audit evidence for all production diagnostic changes.
Protect sensitive operational and patient-adjacent metadata from broad access.
Give support teams a repeatable recovery checklist for failed changes.

Solution Using Kusto data connection

Architects defined Kusto data connection as part of the workload runbook and linked it to data connection resource, Event Hubs, Event Grid, IoT Hub, owner tags, diagnostic settings, and the approved deployment path. Operators used az kusto data-connection list --cluster-name <cluster-name> --database-name <database-name> --resource-group <resource-group> for read-only evidence, then compared the result with Kusto management commands, portal state, activity logs, metrics, and change records. Security reviewers checked managed identities, Event Hubs authorization, source RBAC, private endpoints, while reliability engineers validated source availability, consumer group health, mapping correctness, table schema under a realistic pilot workload. The rollout separated discovery from change-controlled steps, stored evidence with resource IDs and database names, and tied rollback to dashboards and support alerts.

Results & Business Impact

Mean time to isolate telemetry issues fell by 42% after operators used one approved evidence path.
Audit preparation dropped from three days to six hours because resource IDs, commands, and approvals were stored together.
Security review found no broad reader role expansion after database and resource permissions were separated.
Rollback rehearsals reduced failed-change recovery from 55 minutes to 22 minutes.

Key Takeaway for Glossary Readers

Kusto data connection is valuable when teams convert an Azure concept into verified state, owner accountability, and measurable production behavior.

Case study 03

Hardening analytics governance for regulatory reporting

Scenario, objectives, solution, measured impact, and takeaway.

Scenario

Fabrikam Capital, a financial services organization, needed to solve regulatory reporting queries depended on undocumented analytics settings and inconsistent access between development and production. The platform team used Kusto data connection to make the design observable, governed, and supportable in production.

Business/Technical Objectives

Create traceable evidence for every production analytics configuration.
Lower query-related compliance exceptions by at least 50%.
Preserve performance for month-end reporting dashboards.
Document rollback and approval paths for all mutating operations.

Solution Using Kusto data connection

Architects defined Kusto data connection as part of the workload runbook and linked it to data connection resource, Event Hubs, Event Grid, IoT Hub, owner tags, diagnostic settings, and the approved deployment path. Operators used az kusto data-connection list --cluster-name <cluster-name> --database-name <database-name> --resource-group <resource-group> for read-only evidence, then compared the result with Kusto management commands, portal state, activity logs, metrics, and change records. Security reviewers checked managed identities, Event Hubs authorization, source RBAC, private endpoints, while reliability engineers validated source availability, consumer group health, mapping correctness, table schema under a realistic pilot workload. The rollout separated discovery from change-controlled steps, stored evidence with resource IDs and database names, and tied rollback to dashboards and support alerts.

Results & Business Impact

Compliance exceptions related to analytics configuration fell by 63% in the next audit cycle.
Month-end dashboard latency improved by 28% after query and cache evidence guided tuning.
Every mutating change included an owner, approved scope, and rollback note.
Reviewers reduced signoff time by 38% because live state matched source-controlled records.

Key Takeaway for Glossary Readers

Kusto data connection is valuable when teams convert an Azure concept into verified state, owner accountability, and measurable production behavior.

Why use Azure CLI for this?

Use CLI and Kusto commands for Kusto data connection when you need repeatable evidence instead of a one-off portal screenshot. Start with read-only discovery, compare output with source-controlled intent, and attach the result to the change, incident, or audit record. Mutating commands should run only after the owner, scope, rollback path, and customer-impact window are confirmed.

CLI use cases

Confirm the current Azure or Kusto state for Kusto data connection before approving a deployment or incident change.
Collect repeatable evidence for Kusto data connection during audits, service reviews, and ownership handoffs.
Compare expected configuration for Kusto data connection with live portal, CLI, query, and infrastructure-as-code evidence.
Validate graph-connected dependencies for Kusto data connection before changing production scope or access.

Before you run CLI

Confirm tenant, subscription, resource group, cluster, database, table, app, and environment before trusting command output.
Run list or show commands first, then save evidence before any create, alter, update, delete, export, start, stop, or deploy action.
Check whether output exposes secrets, connection strings, customer data, storage paths, query text, or regulated metadata.
Verify RBAC, database permissions, private network reachability, CLI extension version, and maintenance window before production changes.

What output tells you

It shows whether Kusto data connection exists in the expected scope and whether live state matches the approved design.
It exposes resource IDs, database names, table references, policy values, identities, endpoints, run history, or dependency settings.
It helps reviewers connect incidents to deployments, policy changes, query behavior, ingestion delays, export lag, or access failures.
It gives audit-ready evidence that can be attached to tickets, dashboards, change records, and post-incident timelines.

Mapped Azure CLI commands

Kusto data connection operational checks

direct

az kusto data-connection list --cluster-name <cluster-name> --database-name <database-name> --resource-group <resource-group>

az kusto data-connectiondiscoverAnalytics

az kusto data-connection event-hub show --cluster-name <cluster-name> --database-name <database-name> --resource-group <resource-group> --data-connection-name <connection-name>

az kusto data-connection event-hubdiscoverAnalytics

az kusto data-connection event-hub create --cluster-name <cluster-name> --database-name <database-name> --resource-group <resource-group> --data-connection-name <connection-name> --event-hub-resource-id <event-hub-resource-id> --consumer-group <consumer-group> --table-name <table-name> --mapping-rule-name <mapping-name>

az kusto data-connection event-hubprovisionAnalytics

az eventhubs eventhub show --namespace-name <namespace-name> --resource-group <resource-group> --name <event-hub-name>

az eventhubs eventhubdiscoverAnalytics

az monitor metrics list --resource <cluster-resource-id> --metric <metric-name>

az monitor metricsdiscoverAnalytics

Architecture context

Technically, Kusto data connection involves data connection resource, Event Hubs, Event Grid, IoT Hub, consumer group. Teams configure or inspect it through Azure portal, Azure CLI data-connection commands, ARM templates, Event Hubs, Event Grid subscriptions and validate it with connection state, source resource ID, consumer group, target table, mapping name. Key dependencies include Kusto database, source event service, network rules, identity permissions, mapping definitions. In production, document scope, identity, network path, telemetry, lifecycle, and rollback. Treat the term as runtime state: portal settings, Kusto commands, CLI output, logs, and policy assignments should agree before release.

Security

Security for Kusto data connection starts with managed identities, Event Hubs authorization, source RBAC, private endpoints, database permissions, diagnostic logs, network restrictions. Review who can create, alter, delete, query, export, ingest, publish, or diagnose the related configuration. Prefer Microsoft Entra ID, managed identities, least privilege, private networking, customer-managed keys where supported, diagnostic logs, and policy enforcement. Avoid storing secrets, connection strings, tokens, personal data, or regulated payload samples in scripts, consoles, queries, exported files, or shared tickets. During approval, check tenant boundaries, database roles, resource permissions, network exposure, alerting, and break-glass procedures so a configuration mistake does not become a breach.

Cost

Cost for Kusto data connection is driven by source events, ingestion volume, cluster capacity, monitoring logs, retry storms, Event Hubs throughput, storage events. The trap is assuming the feature is free because it looks like a policy, query, child resource, console, or metadata object. In Azure, the bill may appear through compute, storage, hot cache, query CPU, ingestion, export writes, monitoring ingestion, egress, replicas, reserved capacity, or support time. Tie the term to budgets, tags, alerts, and owner reviews. Also account for weak implementation: outage minutes, manual recovery, compliance exceptions, duplicated environments, and engineers spending hours proving state after an incident.

Reliability

Reliability for Kusto data connection depends on source availability, consumer group health, mapping correctness, table schema, ingestion backlog, retry behavior, event ordering expectations. A resource can exist and still fail the workload if schema, identity resolution, network reachability, quota, regional placement, retention, or dependent services are wrong. Build checks that prove the behavior from the caller's point of view, not only that the object is configured. Use health metrics, synthetic queries, retry-aware automation, backup or rollback plans, and documented ownership. During incidents, compare recent deployments with diagnostics and dependency state so teams can separate platform outage, configuration drift, capacity pressure, and application defects.

Performance

Performance for Kusto data connection depends on event volume, batch size, partitioning, mapping complexity, ingestion latency, source throttling, cluster ingestion capacity. Measure the real workflow instead of assuming the default design is fast enough. Look at latency, throughput, cache behavior, query plan, ingestion backlog, export lag, retry storms, regional distance, throttling, scheduling, and downstream bottlenecks. In many incidents the term is not the only slow component; it is where hidden limits, identity calls, network hops, storage behavior, or query shape become visible. Keep benchmarks tied to production-like data, expected concurrency, and monitoring dashboards so tuning does not weaken security or reliability.

Operations

Operations for Kusto data connection need runbooks covering connection inventory, source permission checks, failed ingestion review, mapping validation, table ownership, event hub monitoring, schema approvals. Operators should know which commands are safe read-only checks, which changes require approval, and which outputs prove state to auditors or incident commanders. Put ownership, environment naming, tagging, dashboards, alerts, and rollback steps beside the deployment pipeline. Do not let the portal become the only source of truth; capture cluster names, database names, table names, resource IDs, diagnostic settings, query text, and change history. Good operations turn the term into a predictable support motion instead of tribal knowledge.

Common mistakes

Treating Kusto data connection as a harmless label instead of checking the exact resource, owner, identity, and dependency path.
Running a mutating command in the wrong subscription, cluster, database, web app, or resource group because active context was not verified.
Assuming a successful deployment proves the feature works without checking logs, metrics, queries, access, and rollback evidence.
Ignoring cost, retention, cache, quota, network exposure, or data classification until an incident forces emergency cleanup.