Microsoft Fabric Spark Applications Comparison Feature (Preview)
Microsoft Fabric Blog introduces the Spark Applications Comparison feature, allowing developers and data engineers to analyze, debug, and optimize Spark application runs by comparing key performance metrics side by side.
Microsoft Fabric Spark Applications Comparison (Preview)
The Spark Applications Comparison feature is now available in preview on Microsoft Fabric. This tool is designed to empower developers and data engineers to:
- Analyze, debug, and optimize Apache Spark performance across multiple application runs
- Track changes and improvements due to code updates or data variations
What Is Spark Applications Comparison?
This feature allows users to select up to four Spark application runs and compare them side by side. By presenting key execution and resource metrics for each run, it becomes easier to:
- Identify performance regressions or improvements
- Spot anomalies and root causes
- Visualize metric deltas compared to a baseline run
Key Capabilities
- Compare runs from the same artifact (Notebook or Spark Job Definition)
- View differences in execution time, data trends, and resource usage
- Debug problematic runs by inspecting metric changes
The Compare Panel
The Compare Panel presents a comprehensive breakdown of relevant performance and resource metrics, such as:
- Start time
- Running duration
- Queued/total duration
- Executor CPU time and run time
- Input/output data size and record counts
- Disk and memory spill
- Shuffle write volume and record count
For outlier or anomalous runs, root causes and key resource bottlenecks are highlighted to help focus performance investigations.
Deep Dive: Spark Monitoring L2 Integration
Each run in the comparison view links to the Spark L2 Monitoring page, which provides:
- Detailed job/stage breakdowns
- Logs and configuration history
- Fine-grained control for root cause analysis and performance optimization
How To Use It
- Go to the Monitor Run Series page in Microsoft Fabric
- Select ‘Compare Runs’ tab
- Choose up to four completed Spark runs. One serves as the base run, while the rest are compared against it (the base run can be reassigned)
- Review the highlighted changes and metrics in the Compare Panel
For further analysis, investigate individual runs using the linked L2 Monitoring tool.
Getting Started
To explore this feature, visit the Monitor Run Series page in Microsoft Fabric, select multiple completed runs, and access the Compare Panel. This enables you to track application evolution and optimize workflows.
For more details, see the official documentation.
This post appeared first on “Microsoft Fabric Blog”. Read the entire article here