Microsoft Fabric Blog introduces the Spark Applications Comparison feature, allowing developers and data engineers to analyze, debug, and optimize Spark application runs by comparing key performance metrics side by side.

Microsoft Fabric Spark Applications Comparison (Preview)

The Spark Applications Comparison feature is now available in preview on Microsoft Fabric. This tool is designed to empower developers and data engineers to:

Analyze, debug, and optimize Apache Spark performance across multiple application runs
Track changes and improvements due to code updates or data variations

What Is Spark Applications Comparison?

This feature allows users to select up to four Spark application runs and compare them side by side. By presenting key execution and resource metrics for each run, it becomes easier to:

Identify performance regressions or improvements
Spot anomalies and root causes
Visualize metric deltas compared to a baseline run

Key Capabilities

Compare runs from the same artifact (Notebook or Spark Job Definition)
View differences in execution time, data trends, and resource usage
Debug problematic runs by inspecting metric changes

The Compare Panel

The Compare Panel presents a comprehensive breakdown of relevant performance and resource metrics, such as:

Start time
Running duration
Queued/total duration
Executor CPU time and run time
Input/output data size and record counts
Disk and memory spill
Shuffle write volume and record count

For outlier or anomalous runs, root causes and key resource bottlenecks are highlighted to help focus performance investigations.

Deep Dive: Spark Monitoring L2 Integration

Each run in the comparison view links to the Spark L2 Monitoring page, which provides:

Detailed job/stage breakdowns
Logs and configuration history
Fine-grained control for root cause analysis and performance optimization

How To Use It

Go to the Monitor Run Series page in Microsoft Fabric
Select ‘Compare Runs’ tab
Choose up to four completed Spark runs. One serves as the base run, while the rest are compared against it (the base run can be reassigned)
Review the highlighted changes and metrics in the Compare Panel

For further analysis, investigate individual runs using the linked L2 Monitoring tool.

Getting Started

To explore this feature, visit the Monitor Run Series page in Microsoft Fabric, select multiple completed runs, and access the Compare Panel. This enables you to track application evolution and optimize workflows.

For more details, see the official documentation.

This post appeared first on “Microsoft Fabric Blog”. Read the entire article here