Why Visualization is important in Data Science ?

Our minds are always able to comprehend pictures much faster than anything else. Data Visualization can be used for two purposes : 1- Illustration & Presentation Minard's Visualization Of Napoleon's 1812 March which tried to illustrate the cartographic depiction of numerical data on a map of Napoleon's disastrous losses suffered during the Russian campaign with different dimensions. 2- Exploration Drawing some »

Using Spark For Data Exploration

Spark is actively supported by Apache Open Source community, and it is used in production by many famous firms and companies. In this blog, the focus would be on productionizing Apache Spark. I will discuss the use cases of Spark and how to enable each of them on production environment. Currently, Spark has 2 deployment modes (Client , Cluster) with 3 »

Hive, a must known tool for any data engineer

Hive is a data warehouse system built on top of hadoop for allowing querying and managing data sets. Who ? Hive was created by Facebook and is currently highly adopted by many firms including Netflix, Facebook and Bookings. Why ? Actually not everyone is fond of writing java programs for every problem they have especially data analysts. Hive provides a high level »