Browse Azure Community (391)
JohnNaguib introduces the core problem RAG solves for LLMs—grounding responses in private or up-to-date business data—and points readers to a deep-dive walkthrough using Azure AI Search as the retrieval layer.
devanshirastogi explains upcoming changes to Azure Firewall explicit proxy and provides a migration walkthrough for PAC file–based setups, including moving PAC retrieval to customer-managed Azure Storage and using Managed Identity with the right RBAC roles. The guide includes portal steps plus PowerShell and Azure CLI examples for configuring Firewall Policy.
WSilveira explains an upcoming Azure Logic Apps hosting-model migration tied to the move toward .NET 10 support, including when the change is automatic and when customers need to update NuGet-based deployment processes to avoid unexpected out-of-proc migration.
varghesejoji introduces the ARM MCP Server and a GitHub catalog of 24 proof-of-concept agents that use Model Context Protocol tools to query Azure Resource Graph and (optionally) deploy ARM templates. The post focuses on safe, repeatable “determinism” patterns for governance, FinOps, and SRE workflows in Azure.
rhack summarizes an Azure Essentials Show episode with Thomas Maurer and Raffaele Garofalo on what typically derails AWS-to-Azure migrations and how to avoid it, focusing on stakeholder alignment, migrating like-for-like before optimizing, and using a blue-green cutover to keep rollback options open.
Nikita Bajaj explains how Azure Migrate integrates with GitHub Copilot Modernize (public preview) to generate portfolio-level code insights across multiple applications and repositories, helping teams assess Azure readiness, review remediation guidance, and plan modernization work using a shared workflow between migration admins and developers.
ShubhraS announces the general availability of Microsoft Signing Transparency (MST), a SCITT-aligned transparency ledger that logs production builds for selected Microsoft cloud services. The post explains how customers can independently verify code integrity using cryptographic proofs and audit trails, and introduces the Ledger Explorer tool for offline verification.
Gaurav Bhardwaj describes a hybrid Azure-based document extraction architecture for construction drawings and project documentation, combining deterministic field extraction with bounded LLM verification. The post breaks down the event-driven pipeline, confidence gating, cost trade-offs, and the security controls needed to run this kind of document intelligence workflow in production.
Jay Li explains how to migrate large Azure hub-and-spoke environments away from ExpressRoute hairpin routing through Microsoft Enterprise Edge (MSEE) and toward Azure Virtual Network Manager (AVNM) mesh. The post covers scale limits, enabling High-Scale Private Endpoints (HSPE), rollout/rollback strategy, and how to keep security inspection and segmentation intact.
samcogan explains how to use Azure API Management (APIM) as an MCP gateway to control which tools an agent can see and invoke when connecting to an MCP server. The post covers governance patterns (allowlist vs deny-list), plus auth, logging, network isolation, and streaming-related trade-offs.
SamuelFord requests an enhancement to Azure Virtual Desktop so session hosts can be created directly into an Azure Capacity Reservation Group, matching the standard Azure VM creation experience. The post explains how today’s workaround increases quota friction, delays GPU provisioning, and adds manual post-deployment steps like host registration and Intune/Entra configuration.
Sunita_AZ0708 shares a validation study of running Siemens NX 2506 in a multi-user Azure Virtual Desktop setup on GPU-backed Azure VMs, focusing on whether a single NVIDIA RTX PRO 6000 can reliably support 30 concurrent CAD users and how performance changes with multi-host scaling.
malaikanazim announces that Azure StandardV2 NAT Gateway now supports outbound ping using ICMP Echo Request/Reply, enabling basic reachability checks and faster troubleshooting for workloads that egress through NAT Gateway without needing per-VM public IPs or extra configuration.
yairgil explains how the Azure Copilot Observability Agent in Azure Monitor helps teams investigate AKS incidents by correlating metrics, logs, traces, Kubernetes events, and change history into an evidence-backed root-cause narrative with recommended next steps.
Karl-WE breaks down June 2026 changes to Azure Local licensing and the Azure Local Solutions hardware ecosystem, including new host fee tiers (S2D, external storage/SAN, and disconnected operations) and the shift from a 3-tier to 2-tier hardware catalog model. The post also clarifies key acronyms and support implications for existing deployments.
Efrat Ben Porat announces the general availability of dynamic thresholds for Azure Monitor log search alerts, which use machine learning to learn normal behavior from historical query results and automatically adapt alert thresholds over time. The post includes practical examples for AKS pod restart spikes and Azure Resource Graph inventory drift detection.
Efrat Ben Porat announces the general availability of Simple log alerts in Azure Monitor, a new alert type that evaluates each matching log row individually and supports Basic Logs—making it easier to keep lower-cost telemetry plans while still alerting quickly on important events.
azinh17 breaks down how Azure achieved a top MLPerf Training v6.0 result for Llama 3.1 405B, training at extreme scale across 8,192 GPUs. The post focuses on the cluster and network architecture choices—NVLink scale-up domains, Azure’s MRC fabric, and topology-aware parallelism mapping—that kept step time stable as the system scaled.
Anavi Nahar rounds up Azure Databricks announcements and sessions from Databricks Data + AI Summit 2026, focusing on tighter interoperability with Microsoft’s data stack (OneLake, ADLS) and governed access via Unity Catalog, plus new integrations like the Excel add-in, SharePoint ingestion, and OneLake catalog federation.
Jamesdld23 explains how to avoid the 230-second HTTP timeout in Azure Functions by splitting long-running sync work into an HTTP “request” function that enqueues a message and a queue-triggered function that performs the job, with practical PowerShell and Azure CLI examples plus Entra ID-based auth hardening.
yalavi explains how the Azure Copilot observability agent runs “deep investigations” to troubleshoot incidents by correlating telemetry across application, infrastructure, and platform layers, and by producing an evidence-backed narrative with clear mitigations rather than a single best-guess answer.
GeertVanTeylingen outlines a zero-copy pattern for making enterprise file data usable by modern AI and analytics platforms, using Azure NetApp Files as the system of record and Microsoft OneLake shortcuts to expose that data without migration or duplication.
GeertVanTeylingen explains how to build an enterprise RAG “knowledge pipeline” that can index and retrieve file-based content in place (no copy/migration) using Microsoft OneLake, Azure AI Search, and Azure OpenAI for embeddings and grounded answers with citations.
kinfey shows how to build a cloud-native evaluation harness for Azure AI Foundry skills using Foundry Hosted Agents, combining deterministic validators, an LLM judge that returns structured JSON, and a multi-turn adversarial attacker to catch regressions and compare models side by side.
RohitMadhavKrishnan introduces ArchAngel, an educational AI coding assistant designed to bring a team’s engineering standards directly into the IDE, so junior developers get constructive feedback while they write code. The post outlines the core idea, a reference architecture, and the Microsoft-centric stack used to ground guidance in “golden repos.”
BhaktiRath95 walks through common failure modes when running AI/ML inference workloads on Azure Container Apps, including slow model startup, probe timeouts, OOM kills, and GPU initialization problems. The post provides concrete probe settings, Python/FastAPI patterns, and Log Analytics queries to diagnose and fix issues methodically.
Dirk Brinkmann shows how to turn Azure Savings Plan recommendations into defensible, hour-by-hour data by exporting the underlying PAYG usage series and alternative commitment levels from the Azure Cost Management Benefit Recommendations API, using a companion PowerShell script that outputs CSV, Markdown, and JSON files.
viviandiec announces general availability of OpenTelemetry (OTel) Guest OS metrics for Azure VMs and Arc-enabled Servers, plus an updated Azure Monitor VM experience. The post explains what metrics are available, how OTel compares to Log Analytics-based metrics, and how to use PromQL and Grafana dashboards for troubleshooting at scale.
Sokuma announces the general availability of Service Level Indicators (SLIs) and Service Level Objectives (SLOs) in Azure Monitor, outlining how teams can track customer-experience reliability with SLI authoring, SLO tracking, error budgets, and burn rate–based alerting in a single Azure Monitor workflow.
Sokuma announces the general availability of Azure Monitor Metrics Export using data collection rules (DCRs), highlighting how to continuously stream platform metrics to Azure Storage, Event Hubs, or Log Analytics with multidimensional metrics support, metric-name filtering, and typical end-to-end latency of about three minutes.
Sunita_AZ0708 explains how to run Ansys Discovery on Azure using NVads V710 v5 GPU VMs, including a reference architecture, right-sizing guidance for fractional GPUs, and validation results across fluid, thermal, and structural simulation scenarios.
Rafia Aqil explains how to diagnose and respond when Azure Databricks clusters can’t start or scale due to Azure regional VM capacity constraints, including what to send to Microsoft support, which VM families to switch to, and longer-term design choices like instance pools, serverless compute, and multi-region deployments.
ShubhamSachdeva99 explains how to switch built-in connector connections at runtime in Azure Logic Apps Standard by making the service provider action’s connectionName dynamic, enabling a single workflow to route to different SFTP/SQL/Service Bus endpoints per team or environment.
TulikaC introduces new Azure CLI commands for listing and viewing Azure App Service for Linux startup logs, making it easier to diagnose container initialization issues, runtime startup failures, warmup probe problems, and slot-specific startup behavior directly from the command line.
BhaktiRath95 breaks down why Azure Container Apps can feel “slow to start” in production, separating true cold starts from scaling delays and resource throttling. It includes concrete fixes like minReplicas tuning, KEDA rule adjustments, probe configuration, image-size reduction, and practical .NET and Django startup optimizations backed by Log Analytics and Application Insights queries.
j_folberth explains how to deploy Azure AI Foundry Hosted Agents directly from a source-code ZIP instead of a container image, including the deployment lifecycle, an azd-based workflow, and a reusable GitHub Action that posts to the Foundry data plane and polls until the new agent version becomes active.
Mahesh Sundaram announces a public preview in Azure Monitor that lets platform teams collect Azure resource platform logs at scale using Data Collection Rules (DCRs), replacing per-resource diagnostic settings with a centralized, policy-driven model that supports governance, cost control, and modern identity-based access.
Heather Poulsen shares an optimization playbook for running agentic AI workloads in production on Azure, focusing on keeping multi-agent orchestration reliable while controlling token costs and latency. It highlights practical techniques like inference routing, prompt compression, RAG tuning, caching, and FinOps-style capacity planning.
Heather Poulsen outlines a governance-first blueprint for building scalable agentic AI systems, focusing on how to embed consistent controls and quality checks across user interactions, agent orchestration, integrations, data, and models so systems can scale without losing trust and oversight.
Heather Poulsen shares an event session overview on designing Azure AI Landing Zones as a production-ready foundation for deploying AI applications and AI agents at scale, with guardrails for networking, identity, security, governance, and cost control using Microsoft’s recommended architecture frameworks.