“We wanted to switch to Snowpark for performance reasons and it was so easy to do. Converting our PySpark code to Snowpark was as simple as a change in an import statement.”
Principal Data Engineer
Homegenius
Set of libraries and runtimes that securely deploy and process Python and other programming languages in Snowflake to develop data pipelines, machine learning models, apps, and more.
Single platform with native support for any programming language, including Python.
Elastic scalability without maintenance or overhead.
Consistent, enterprise-grade governance controls & security.
Write queries and transform data using familiar DataFrames.
Python library and underlying infrastructure for model development with the Snowpark ML Modeling API and management with the Snowpark Model Registry (public preview).
Write and execute custom Python, Java and Scala code using user-defined functions and stored procedures. Leverage built-in packages from the Anaconda repo.
Register, deploy and run container images (public preview) in Snowflake-managed infrastructure.
Use Python to transform raw data into modeled formats for data pipelines
Customers see a median of 3.5x faster performance and 34% cost savings with Snowpark over managed Spark.
Transformations on data connected to your data lake, warehouse, or Iceberg tables in Snowflake.
Build and operationalize end to end ML workflows using Snowpark ML
Use Python frameworks, such as Scikit-learn and XGBoost, for preprocessing, feature engineering, and training models that can be deployed and managed in Snowflake without data movement.
Build ML models and Gen AI LLMs in any programming language, package as a container image and deploy in configurable CPUs & GPUs for ultimate developer flexibility.
“Being able to run data science tasks, such as feature engineering, directly where the data sits is massive. It’s made our work a lot more efficient and a lot more enjoyable.”
Data Science Lead
EDF
Try Snowflake free for 30 days and experience the Data Cloud that helps eliminate the complexity, cost and constraints inherent with other solutions.