Data mining – finding patterns – overview

Methods used for finding patterns in data: Cluster analysis – algorithm finds groups of similar data points by examining distance between points, density, ranges etc. Models for cluster analysis: connectivity – organizes points based on how close they are...

Database server statistics

This probably most common example of doing statistics with data. All modern databases do (or can do – depends on settings) some basic or more advanced statistics on data. Main reason for it is optimization of queries. Relational databases depends heavily on...

Databases for data science

In these days there are many “buzz-words” around and many different ideas about what is the best tool for “my big data”. We can hear / read many about “how new or old or sexy” different SQL or noSQL tools are. Names like...

Cleaning is everything

Some scientists even state that only discovery of “cleaning” and cleaning tools as broom and rag really allowed to create civilization. Before it peope have to live “on move” or keep to burn their simple wooden houses from time to time because...