Stream Summit keynotes free June 1–2.

From Data Wrangling to Feature Engineering

Every year, insights from business analytics and machine learning (ML) have a bigger and bigger effect on how organizations solve business problems with data. However, the insights in a business dashboard or predictions from an ML model are only as valuable as the quality of the data behind them.

Building high-quality data sets is a multi-step process known as data wrangling which includes cleaning, mapping, and transforming data into a workable format.

These activities commonly involve the following:

Merging multiple data sources into a single data set
Identifying gaps in the data (for example, empty cells in a table) and either filling or deleting them
Deleting data that’s either unnecessary or irrelevant to the project at hand, such as removing duplicates
Identifying extreme outliers in the data

This ebook describes how analytics and data science teams can maximize efficiency by leveraging a cloud data platform to unify and govern both data wrangling and feature engineering activities.

More recommended
for you

White Paper

WEF AI Infrastructure in the Age of Sovereignty: Requirements, Strategies and a Trusted Framework for Digital Embassies

Read Now

White Paper

Rethinking AI Sovereignty: Pathways to Competitiveness through Strategic Investments

Read Now

White Paper

AI at Work: From Productivity Hacks to Organizational Transformation

Read Now

Back to Resource Center

From Data Wrangling to Feature Engineering

More recommendedfor you

White Paper

WEF AI Infrastructure in the Age of Sovereignty: Requirements, Strategies and a Trusted Framework for Digital Embassies

White Paper

Rethinking AI Sovereignty: Pathways to Competitiveness through Strategic Investments

White Paper

AI at Work: From Productivity Hacks to Organizational Transformation

More recommended
for you