Jeff Rasley

Senior Software Engineer
Jeff Rasley is a Senior Engineer on the Snowflake AI Research Team, specializing in AI systems optimization and infrastructure. Previously, he spent five years at Microsoft, where he co-founded the DeepSpeed training and inference library. As its top contributor, Jeff played a key role in establishing DeepSpeed as an industry-standard library. He earned his Ph.D. in distributed systems from Brown University.

MORE POSTSFROM Jeff Rasley

Arctic Long Sequence Training (ALST): Scalable And Efficient Training For Multi-Million Token Sequences

Snowflake's ALST enables scalable training of long-context models with up to 15 million tokens using Hugging Face and DeepSpeed, all without custom modeling code.
||||||||
JUN 24, 2025|10 min read
Gen AI

Arctic Inference with Shift Parallelism: The Fastest Open Source Inference System for Enterprise AI

Built by Snowflake AI Research, Arctic Inference uses Shift Parallelism, SwiftKV, and speculative decoding to power the fastest open-source enterprise AI.
||||||||
MAY 29, 2025|15 min read
Gen AI

Low-Latency and High-Throughput Inference for Long Context with Sequence Parallelism (aka Arctic Ulysses)

Ulysses, a novel sequence parallelism technique, boosts long-context LLM inference performance with 3.4x lower latency and better GPU efficiency.
|||||
APR 03, 2025|14 min read
Digital illustration of connected lines and dots in a column lined with grids
Product and Technology

SwiftKV from Snowflake AI Research Reduces Inference Costs of Meta Llama LLMs up to 75% on Cortex AI

SwiftKV optimizes Meta Llama LLMs on Snowflake Cortex AI, reducing inference costs by up to 75% while maintaining accuracy for enterprise AI solutions.
||||
JAN 16, 2025|5 min read

Where Data Does More

  • 30-day free trial
  • No credit card required
  • Cancel anytime