GEN AI
61 Results
Newest - Oldest

Introducing Data Governance Skills for Cortex Code
Discover Cortex Code governance skills in Snowflake. Use AI to classify data, enforce policies and monitor access with natural language—no SQL required.
MAR 18, 2026|8 min read

Cortex Agent Evaluations: Monitor, Measure and Improve Your AI Agents on Snowflake
A comprehensive solution for validating your agent's behavior and performance using Snowflake's research-backed GPA framework to move from prototype to production with confidence.
MAR 13, 2026|6 min read

Fast and More Accurate Causal Parallel Decoding Using Jacobi Forcing
Jacobi Forcing enables fast and more accurate causal parallel decoding for autoregressive transformers, offering near-AR quality and improved token throughput.
MAR 04, 2026|10 min read

Snowflake’s Blueprint for Parsing Complex, Real-World Documents
Learn how Snowflake enables enterprise-scale document AI with reliable structure extraction, OCR robustness and cost-efficient performance.
FEB 26, 2026|9 min read

Agent World Model (AWM): Infinity Synthetic Environments for Agentic Reinforcement Learning
Open-source Agent World Model generates 1,000 SQL-backed executable environments for agentic RL with benchmark-winning results.
FEB 13, 2026|10 min read

From General-Purpose to Domain Expert: Fine-Tune LLMs Directly in Snowflake
Transform general LLMs into domain experts directly within Snowflake's security boundary. Learn how ArcticTraining and ML Jobs streamline the fine-tuning process by eliminating data movement and infrastructure management.
FEB 02, 2026|7 min read

Paradigm Shifts of the Developer Mindset in the Age of AI
Learn how generative AI is reshaping the developer mindset, accelerating iteration cycles, and changing how software is designed, built and shipped.
JAN 21, 2026|9 min read

Arctic-Extract: Compact, Efficient and State-of-the-Art Vision-Language Processing
Snowflake’s Arctic-Extract powers AI_EXTRACT with a compact 6.6 GiB model that delivers state-of-the-art document understanding for visual, multilingual, and tabular data.
DEC 10, 2025|7 min read

SuffixDecoding at Production Scale with Arctic Inference and vLLM
SuffixDecoding now delivers 1.96x–3.12x end-to-end speedups in vLLM and Arctic Inference with major CPU optimizations for fast, production-ready LLM serving.
DEC 02, 2025|9 min read

Accelerating PyTorch Innovation at Scale: Snowflake at PyTorch Conference 2025
How Snowflake tackles four core AI challenges: scaling deep learning, training thousands of models, accelerating inference, and balancing multilingual performance.
NOV 19, 2025|7 min read

What’s Your Agent’s GPA? A Framework for Evaluating AI Agent Reliability
Learn how the Snowflake AI Research team's Agent GPA framework achieved 95% error detection and 86% localization accuracy to accurately measure and debug agent performance.
NOV 04, 2025|8 min read

Optimizing Query Execution in Cortex AISQL
How we made AISQL up to 8x faster and 70x cheaper through AI-aware planning, adaptive model cascading, and semantic join rewriting.
NOV 04, 2025|8 min read
1
2
3
4
5
6