Sumit Das

Sumit Das

Principal Software Engineer, Snowflake

Machine Learning

How to Scale Real-Time Model Serving for Low-Latency ML Inference

Learn how to scale real-time model serving with GPU optimization, dynamic batching, and latency tuning to maximize ML inference performance

Goutam Murlidhar|Vivek Alamuri|Sumit Das

FEB 12, 2026|8 min read

MORE POSTSFROM Sumit Das

Machine Learning

Scalable Model Serving: An Architectural Overview

Learn how Snowflake ML enables scalable model serving with low-latency inference, centralized governance, and simple deployment

Pradeep Dorairaj

JAN 20, 2026|11 min read

Snowflake ML Now Supports Expanded MLOps Capabilities for Streamlined Management of Features and Models

Product and Technology

Snowflake ML Now Supports Expanded MLOps Capabilities for Streamlined Management of Features and Models

Learn how Snowflake ML is expediting time to insights.

JUN 11, 2024|6 min read

Accelerate Your Machine Learning Workflows in Snowflake with Snowpark ML

Product and Technology

Accelerate Your Machine Learning Workflows in Snowflake with Snowpark ML

Building and managing ML models is easier and faster

JAN 23, 2024|7 min read