join Snowflake at ACL VIENNA 2025
WHERE DATA DOES MORE.
Shaping the future of Enterprise AI through cutting-edge, open, foundational research.
28-30 july, VIENNA
Engagements at ACL Vienna
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Explain then Rank: Scale Calibration of Neural Rankers Using Natural Language Explanations from LLMs
In search settings, calibrating the scores during the ranking process to quantities such as click-through rates or relevance levels enhances a system’s usefulness and trustworthiness for downstream users. While previous research has improved this notion of calibration for low complexity learning-to-rank models, the larger data demands and parameter count specific to modern neural text rankers produce unique obstacles that hamper the efficacy of methods intended for the learning-to-rank setting.
This talk proposes exploiting large language models (LLMs) to provide relevance and uncertainty signals for these neural text rankers to produce scale-calibrated scores through Monte Carlo sampling of natural language explanations (NLEs). Our approach transforms the neural ranking task from ranking textual query-document pairs to ranking corresponding synthesized NLEs. Comprehensive experiments on two popular document ranking datasets show that the NLE-based calibration approach consistently outperforms past calibration methods and LLM-based methods for ranking, calibration, and query performance prediction tasks.
In Case You Missed It: ARC ‘Challenge’ Is Not That Challenging
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
The vast portion of workloads employing LLMs involves answering questions grounded on PDF or scan content. We introduce the Arctic-TILT achieving accuracy on par with models 1000× its size on these use cases. It can be fine-tuned and deployed on a single 24GB GPU, lowering operational costs while processing Visually Rich Documents with up to 400k tokens. The model establishes state-of-the-art results on seven diverse Document Understanding benchmarks, as well as provides reliable confidence scores and quick inference, which are essential for processing files in large-scale or time-sensitive enterprise environments.
ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback
Findings of IWSLT 2025
Iceberg & Polaris Lakehouse
RAG App on SEC Filings
MEET THE TEAM

Rafael Massei
Head of Developer Marketing, Snowflake

Lukasz Slabinski
Senior R&D Manager, Snowflake

Lukasz Borchmann
Senior Research Scientist, Snowflake

Mateusz Chilinski
Research Scientist, Snowflake

Mateusz Krubinski
Research Scientist, Snowflake

Rafal Kobiela
Seniore Software Engineer, Snowflake

Dorota Jaworska
Director, EMEA Talent and HR Operations, Snowflake

Karolina Proczek
Site Program Manager, Snowflake

Anupam Datta
Principal Research Scientist, Snowflake

Jaeseong Lee
Research Scientist, Snowflake
BUILD WITH THE AI RESEARCH TEAM
Collaborate with us! Snowflake AI Research conducts open, foundational research to turn big questions into bigger breakthroughs. We build trusted, efficient, and easy-to-use AI that powers innovation across the Snowflake platform, the enterprise AI ecosystem, and open-source community.
GETTING STARTED WITH SNOWFLAKE
EASY. CONNECTED. TRUSTED.