mmasQuerying in HiveIn the previous entry we created some Hive tables and put data into them, here we are going to see how to retrieve, aggregate and filter…Oct 21, 2015Oct 21, 2015
mmasLoading data into HiveLet’s practise with different ways to load data into Apache Hive and optimization concepts.Oct 15, 2015Oct 15, 2015
mmasData analysis with Apache Hive. A practical introductionApache Hive is a framework for data warehousing for manage large datasets. Hive can be used for data analysis in a SQL-like language called…Oct 2, 2015Oct 2, 2015
mmasData analysis with Apache Pig. A practical introductionApache Pig is a platform for analyzing large datasets. With Pig you have a higher level of abstraction than in MapReduce, so you can deal…Sep 12, 2015Sep 12, 2015
mmasHadoop Streaming. Practical introduction to MapReduce with PythonApache Hadoop is a framework for distributed storage and processing. I’m not going to explain how Hadoop modules work or to describe the…Sep 11, 2015Sep 11, 2015
mmasFreelance invoices managerI’ve started this project because I needed a tool to autogenerate and email invoices to my clients and also to keep control of which one…Mar 10, 2015Mar 10, 2015
mmasPython image processing libraries performance: OpenCV vs Scipy vs Scikit-ImageWe are going to compare the performance of different methods of image processing using three Python libraries (scipy, opencv and…Feb 16, 2015Feb 16, 2015
mmasSimple web analytics with Python and PandasWe are going to do some analytics with our web visits data. As a simple report we are going to obtain the unique and total visits respect…Feb 13, 2015Feb 13, 2015