Content by vishnu charan tj (3)

Eliminate LLM Cold starts: Load models up to 6x Faster with Azure Blob Storage and Run:AI Model Streamer

May 19, 2026 by Vishnu Charan TJ

Vishnu Charan TJ explains how streaming LLM weights directly from Azure Blob Storage into GPU memory with Run:AI Model Streamer can cut inference cold-start times by up to ~6x, reducing idle GPU spend and improving autoscaling behavior for vLLM and SGLang deployments.

News

Easily Connect AI Workloads to Azure Blob Storage with adlfs

Oct 15, 2025 by Vishnu Charan TJ

Vishnu Charan TJ explains the latest enhancements in adlfs, empowering data professionals to efficiently connect Python-based AI and ML workloads to Azure Blob and Data Lake Storage, with real-world framework integrations and best practices.

News

Protect Azure Storage Accounts with Network Security Perimeter: General Availability

Sep 8, 2025 by Vishnu Charan TJ

Vishnu Charan TJ details the general availability of network security perimeters for Azure Storage, showing how centralized network controls can secure PaaS resources and prevent data exfiltration.

Community

End of content