Mayunk_Jain introduces the expanded public preview for Azure SRE Agent, focusing on automation, governance, diagnostics, and incident management for DevOps and SRE teams operating in Azure.

Azure SRE Agent Public Preview Expands: New Features for DevOps and Automation

Author: Mayunk_Jain

The Azure SRE Agent is now in public preview, instantly available to all users without the need for sign-up. This release incorporates feedback from early adopters and delivers a suite of new capabilities for DevOps, site reliability engineering (SRE), and enterprise teams managing services on Azure.

What’s New in Azure SRE Agent

Secure-by-Default Governance

  • Operates with least-privilege access by default
  • Never executes write actions on Azure resources without explicit human approval
  • Leverages role-based access control (RBAC) for read-only or approver roles
  • Provides oversight, traceability, and flexibility—from insights to full automation

Deep Diagnostics and Extensible Automation

  • Supports all Azure services via AZ CLI and kubectl
  • Enhanced diagnostics for PostgreSQL, API Management, Azure Functions, AKS, Azure Container Apps, and Azure App Service
  • Consistent automation and insights for diverse cloud environments, from monoliths to microservices

Automated Incident Management

  • Native integrations with Azure Monitor, PagerDuty, and ServiceNow
  • Ingests alerts and triggers workflows compatible with existing DevOps toolchains
  • Enables streamlined incident detection and automated or human-in-the-loop response

Root Cause Analysis and Developer Integration

  • Integrates with GitHub and Azure DevOps for code-aware root cause analysis (RCA)
  • Traces incidents directly to source code and recent changes
  • Accelerates resolution by connecting operational data to engineering workflows

Closing the DevOps Loop

  • Automatically generates incident summary reports in GitHub and Azure DevOps, complete with diagnostic context
  • Optionally assigns incidents to GitHub Copilot coding agents for automated pull requests and merge workflows, contributing to permanent code fixes

Getting Started

Key Features by Area

  • Governance & Security: Least-privilege operations, explicit approval for write actions, RBAC-enabled roles.
  • Automation: Pluggable into existing Azure, GitHub, ServiceNow, and PagerDuty workflows.
  • Diagnostics: Advanced support for cloud-native and database platforms (AKS, App Service, PostgreSQL, etc.).
  • DevOps Integration: Seamless feedback loops between incident management and code repositories; enables continuous improvement.
  • Extensibility: Reuse of existing runbooks and customizable workflows.

Community & Support

Additional Resources

Updated October 2, 2025 — Version 2.0

This post appeared first on “Microsoft Tech Community”. Read the entire article here