Analytics Stream Analytics field-manual-complete field-manual-complete field-manual-complete

Stream Analytics job diagram

A Stream Analytics job diagram is a visual map of what the job is doing. Instead of reading only a query file and metric table, you can see inputs, query steps, outputs, and sometimes physical streaming nodes on one canvas. It is especially helpful when a job runs but produces no results, late results, or results that look wrong. The diagram gives developers and operators a faster way to narrow the problem before rewriting the query or scaling blindly.

Aliases
Stream Analytics job diagram, ASA job diagram, Stream Analytics logical diagram, Stream Analytics physical diagram, job diagram preview
Difficulty
intermediate
CLI mappings
4
Last verified
2026-05-26T18:37:22Z

Microsoft Learn

The Stream Analytics job diagram is a portal and developer-tool visualization that shows how a job’s inputs, query steps, streaming nodes, outputs, and metrics relate. It helps troubleshoot no-result, delayed-result, and unexpected-result scenarios by exposing logical query steps or physical processing nodes during monitoring and debugging.

Microsoft Learn: Stream Analytics job diagram in Azure portal2026-05-26T18:37:22Z

Technical context

In Azure architecture, the job diagram is an observability and troubleshooting view for a Stream Analytics job. It does not replace the job, query, input, or output resources; it visualizes their relationships and metrics. Logical diagrams focus on query steps, often using WITH-defined intermediate results. Physical diagrams focus on streaming nodes and processors. The feature appears in the Azure portal monitoring experience and in Stream Analytics developer tools, tying query design, runtime metrics, diagnostics, and local testing into one diagnostic workflow.

Why it matters

Job diagram matters because streaming failures are rarely obvious from a single metric. A job may be running, yet one input is quiet, one query step drops rows, a window waits on late events, or one output rejects records. The diagram helps operators see where events stop flowing and whether the issue is logical or physical. That shortens incident calls, reduces risky trial-and-error query edits, and prevents wasteful scaling when the real problem is schema, timestamp, or output connectivity. For learners, it turns abstract Stream Analytics concepts into a visible pipeline they can inspect and explain. during real incidents. quickly.

Where you see it

Signals, screens, and Azure surfaces where this term usually becomes operational.

Signal 01

In the Azure portal Monitoring area for a Stream Analytics job, the Job diagram preview shows logical query steps or physical streaming nodes with input, output, and metric context.

Signal 02

In Visual Studio Code Stream Analytics tools, the job diagram appears during local testing so developers can inspect intermediate result sets and metrics for specific query steps.

Signal 03

In incident reviews, screenshots of the diagram often sit beside resource logs, SU utilization charts, watermark delay, and output error records to explain where data flow failed.

When this becomes relevant

Specific situations where this term helps solve real Azure design, operations, migration, security, reliability, cost, or governance problems.

  • Find the exact query step where a running job stops producing expected rows.
  • Explain Stream Analytics flow to new operators without forcing them to parse the full query first.
  • Compare logical query behavior with physical streaming-node pressure before scaling streaming units.
  • Validate a complex WITH-based query locally by inspecting intermediate result sets.
  • Collect visual evidence during an incident where inputs are healthy but outputs are stale or empty.

Real-world case studies

Different enterprise-style examples that show the term being used to hit measurable objectives.

Case study 01

Media platform finds dropped ads in one query stage

A live sports media platform inserted ad decision events into a Stream Analytics job, but some regional ads never reached the dashboard. Metrics showed the job was running, so engineers argued over the input, query, and output for hours.

Scenario

A live sports media platform inserted ad decision events into a Stream Analytics job, but some regional ads never reached the dashboard. Metrics showed the job was running, so engineers argued over the input, query, and output for hours.

Business/Technical Objectives
  • Find the first pipeline stage where ad events disappeared.
  • Avoid scaling the job until the actual bottleneck was proven.
  • Give the broadcast support desk a clear incident explanation before the next match.
Solution Using Stream Analytics job diagram

The team opened the Stream Analytics job diagram during the replay window and switched between logical query steps and node metrics. CLI output confirmed the job name, expanded topology, and input/output aliases. The diagram showed events reaching the regional-filter step but not the output branch for one market. Engineers then inspected the matching WITH clause and found a stale region code list in reference data. They corrected the data source, reran the query test, and saved screenshots with CLI JSON in the incident record.

Results & Business Impact
  • Root-cause isolation time fell from three hours to 38 minutes.
  • No extra streaming units were added, avoiding an unnecessary capacity increase for the playoff weekend.
  • Regional ad delivery recovered before the second match, protecting an estimated 92 percent of scheduled impressions.
  • The support desk reused the diagram in the post-incident brief instead of hand-drawing the flow.
Key Takeaway for Glossary Readers

A job diagram turns a vague streaming complaint into a visible place to start proving the failure.

Case study 02

Building integrator diagnoses stale occupancy analytics

A smart-building integrator sent lobby camera counts and badge-reader events through a Stream Analytics job for occupancy reporting. Facilities managers saw stale counts after lunch peaks, while raw sensor dashboards looked healthy.

Scenario

A smart-building integrator sent lobby camera counts and badge-reader events through a Stream Analytics job for occupancy reporting. Facilities managers saw stale counts after lunch peaks, while raw sensor dashboards looked healthy.

Business/Technical Objectives
  • Explain why output lag appeared only during peak occupancy periods.
  • Separate sensor reliability issues from Stream Analytics query behavior.
  • Reduce tenant complaint response time for building operators.
Solution Using Stream Analytics job diagram

Operators used the job diagram to inspect the logical stages for badge correlation, camera-count smoothing, and floor-level aggregation. CLI checks verified the production job, output names, streaming units, and diagnostic workspace. The diagram highlighted a query step with rising watermark delay after a tumbling-window aggregation. Review of the query showed a grouping key that combined floor and tenant in a way that produced skew during lunch hours. The team adjusted the query, retested with sample peak data, and used the diagram to validate smoother flow before deployment.

Results & Business Impact
  • Occupancy dashboard lag dropped from 22 minutes to under four minutes at peak.
  • Facilities support tickets about stale counts fell by 31 percent in the next month.
  • The team avoided blaming sensors and replacing devices that were already reporting correctly.
  • New runbooks include diagram screenshots for expected flow during high-traffic periods.
Key Takeaway for Glossary Readers

The diagram is valuable when the job is running but one query stage quietly slows the business signal.

Case study 03

Food plant shortens night-shift telemetry triage

A packaged-food manufacturer monitored oven temperature and conveyor speed with Stream Analytics. On night shifts, alarms sometimes vanished from the supervisor dashboard, and the only available engineer was usually remote.

Scenario

A packaged-food manufacturer monitored oven temperature and conveyor speed with Stream Analytics. On night shifts, alarms sometimes vanished from the supervisor dashboard, and the only available engineer was usually remote.

Business/Technical Objectives
  • Give night-shift technicians a simple way to describe where data flow stopped.
  • Reduce unnecessary job restarts that caused duplicate dashboard records.
  • Capture evidence that day-shift engineers could review quickly.
Solution Using Stream Analytics job diagram

The operations team added job-diagram steps to the incident runbook. Technicians opened the diagram in the Azure portal, selected the incident time range, and compared input nodes, query stages, and output nodes with CLI-captured job state. In one incident, the diagram showed healthy input and query stages but a blocked dashboard-output path. Logs then confirmed the output destination rejected a schema change introduced by a maintenance script. The team fixed the schema, restarted with the documented output start behavior, and attached the diagram evidence to the shift handoff.

Results & Business Impact
  • Mean night-shift triage time fell from 74 minutes to 19 minutes.
  • Unnecessary job restarts dropped from five per month to one in the next quarter.
  • Duplicate dashboard rows caused by blind restarts were reduced by 86 percent.
  • Day-shift engineers received consistent evidence instead of fragmented chat messages.
Key Takeaway for Glossary Readers

A job diagram gives non-specialists enough context to escalate the right failure without making the outage worse.

Why use Azure CLI for this?

With ten years of Azure operations work, I treat CLI as the evidence layer around the job diagram. The diagram itself is mostly a portal or Visual Studio Code experience, but CLI lets me prove the job, inputs, outputs, transformation, streaming units, diagnostic settings, and current state before I trust what I am seeing. During incidents, I capture CLI JSON and then use the diagram to inspect flow and step metrics. That combination avoids a classic mistake: staring at a diagram from the wrong subscription or after a deployment changed topology. CLI makes the visual diagnosis auditable. for later review.

CLI use cases

  • Show the job and expanded topology before opening the diagram, so the visual view is tied to the correct resource.
  • List inputs and outputs to verify every diagram node represents the expected alias and destination.
  • Capture streaming units, job state, compatibility level, and tags while the diagram is used for incident evidence.
  • Check diagnostic settings and Log Analytics routing before relying on diagram metrics during troubleshooting.
  • Export current job configuration after a diagram-guided investigation to compare against the next release.

Before you run CLI

  • Confirm the tenant, subscription, resource group, and job name because a diagram from the wrong job can mislead an incident bridge.
  • Use read-only show and list commands before starting, stopping, or scaling the job while troubleshooting.
  • Make sure diagnostics and metrics are enabled, otherwise the diagram may have too little evidence to isolate the issue.
  • Record the time range under investigation so CLI output, diagram metrics, and logs refer to the same incident window.

What output tells you

  • Job show output confirms the exact Stream Analytics resource, state, location, identity, compatibility level, and capacity behind the diagram.
  • Input and output list output confirms whether the visible diagram nodes match actual aliases and configured data sources.
  • Diagnostic settings output tells whether resource logs and metrics are available to support diagram-based troubleshooting.
  • Expanded transformation details help correlate diagram query steps with WITH clauses, SELECT statements, and output routing.

Mapped Azure CLI commands

Job diagram evidence checks

validates
az stream-analytics job show --name <job-name> --resource-group <resource-group> --expand inputs,outputs,transformation,functions
az stream-analytics jobdiscoverAnalytics
az stream-analytics input list --job-name <job-name> --resource-group <resource-group> --output table
az stream-analytics inputdiscoverAnalytics
az stream-analytics output list --job-name <job-name> --resource-group <resource-group> --output table
az stream-analytics outputdiscoverAnalytics
az monitor diagnostic-settings list --resource <stream-analytics-job-resource-id> --output table
az monitor diagnostic-settingsdiscoverAnalytics

Architecture context

A job diagram belongs in the operational architecture for any serious streaming workload. Architects should not depend on diagrams as the source of truth, but they should design jobs so the diagram is useful: clear input aliases, named WITH steps, purposeful output names, diagnostics enabled, and a query structure that exposes stages. Logical diagrams help explain transformation flow, while physical diagrams help identify streaming node pressure and processor behavior. For supportability, the diagram should align with runbooks, dashboards, deployment templates, and incident notes. When teams name each stage intentionally, new operators can trace data from ingress to output without reverse-engineering a dense query at 2 a.m.

Security

Security impact is indirect because the diagram is a visualization, not a data-access mechanism. The risk appears in who can open the job, view topology, inspect metrics, and use the information during troubleshooting. A diagram can reveal source names, output destinations, query shape, and operational dependencies that matter to attackers or auditors. Access should follow the same RBAC and least-privilege rules used for the Stream Analytics job and monitoring data. Teams should avoid sharing screenshots that expose customer systems, namespaces, table names, or incident-sensitive topology. Diagnostic logs behind the diagram also need controlled access and retention. during audit reviews consistently.

Cost

The diagram has little direct billing impact; it is not a separate paid processing resource. Its cost value comes from avoiding the wrong fix. Without step-level visibility, teams often add streaming units, widen windows, replay events, or send more records to diagnostic logs before understanding the bottleneck. A clear diagram can show that an output sink, schema mismatch, or query stage is the issue, saving compute and incident time. It also helps justify whether scaling is actually needed. The indirect cost path is labor, overprovisioned capacity, extra replay, and unnecessary downstream writes caused by blind troubleshooting. across engineering teams clearly.

Reliability

Reliability impact is indirect but meaningful. The diagram does not make the job more resilient by itself, yet it helps teams detect where reliability is failing. Operators can distinguish a quiet input from a blocked output, see whether a query step is accumulating delay, and decide whether to rollback, restart, or scale. During a production incident, that context reduces time spent guessing and lowers the chance of a destructive restart. Reliable teams keep diagrams usable by enabling metrics, naming query stages clearly, testing local changes, and preserving runbooks that explain expected flow under normal load. throughout release events during cutovers.

Performance

Performance impact is diagnostic rather than runtime. The diagram does not speed up a Stream Analytics job, but it can expose where performance is lost. Logical views help identify expensive query stages, joins, windows, or filters that stop events from reaching outputs. Physical views help teams reason about streaming nodes and processor behavior under load. Operators should use the diagram alongside SU utilization, input backlog, watermark delay, late events, and output errors. That combination improves tuning speed and reduces the temptation to add capacity before confirming the query can parallelize or the output can keep up. under production pressure quickly.

Operations

Operators use the job diagram during active troubleshooting, release validation, and handover. They open it after a job is running, then inspect inputs, query steps, outputs, node metrics, watermark delay, and intermediate results where available. It is useful when a dashboard stops updating, output rows look wrong, or query changes behave differently from local tests. Operators should pair the diagram with resource logs, CLI job state, output error details, and recent deployment history. The diagram is not a substitute for telemetry, but it accelerates the first question every incident asks: where did the data stop behaving normally? during war rooms.

Common mistakes

  • Treating the diagram as the source of truth while ignoring deployment history, query text, logs, and CLI-confirmed topology.
  • Scaling streaming units because a node looks busy without checking whether the output sink or query step is the real bottleneck.
  • Using vague input and output aliases that make the diagram hard to interpret during a production incident.
  • Sharing screenshots that expose sensitive source names, destinations, or topology details outside the approved support channel.
  • Debugging a stale time range and assuming the current diagram explains an incident that happened hours earlier.