A right question is also an answer.

Aggregation is the process of performing calculations over a set of data, normally to gain its statistical figures. Splitting data into groups is a necessary step if you want to aggregate data by certain criteria, not the whole data set. This article talks about AGGREGATE FUNCTIONS and WINDOW FUNCTIONS, two main approaches to aggregate data in SQL, through an explanation section followed by a practical section with the Pokemon dataset. All technical points are based on PostgreSQL engine.

Photo by Lia Panidara on Unsplash

Ok we have two methods here. So when should we use which?

AGGREGATE FUNCTIONS +…


I always see data manipulation as Lego games — understanding small pieces will help you create your desired models!

Aggregation is a process of collecting/splitting data based on some criteria (1) and then applying a statistical method on them so that each group will be presented as a number (2). While (1) identifies the data scope, (2) does the mathematical stuff. This article is going to explain these steps in Pandas, one of the most popular library to manipulate data in Python.

Photo by Markus Spiske on Unsplash

(1) — It’s all about data

There is no much of work in this step if we want to aggregate the whole data or…

Duyen

An engineer loves writing

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store