LLM Training
Snowflake AI Research Blogs and Publications
We believe in a thriving research community, and we are committed to sharing our insights as we advance AI research focused on the tools, systems and algorithm optimizations for performant yet cost-effective LLM training and inference for everyone.
Results
Newest - Oldest
Filter
LLM Evaluation
Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA
Pretraining – System & Efficiency
TurboMoE: Enhancing MoE Model Training with Smart Kernel-Fusion and Data Transformation
LLM Deployment
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation
LLM Deployment
SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference
LLM Deployment
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Pretraining – System & Efficiency
SSDTrain: An Activation Offloading Framework to SSDs for Faster Large Language Model Training
Agentic System
ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration
1
2
3
4
5
...
9