Data Science Reading List III

And here I am again! The third week in a row I present you my readings around data science.

The Relevance Vector Machine

Nice paper on a learner I found in RapidMiner. Was kind of suprised that I never heared about it before. Not sure why i did not.

Apache Arrow

Apache Arrow is an initiative to have a binary compatible standard for in-memory processesing of data. It sounds very cool; the issue I see that there is not that much movement at the moment. The only question is — Why?

A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis

Another paper I read this week, this time again from @AYLIEN ‘s Sebastian Ruder. It gets really a habit to read Sebastian’s articles on text mining.

Mapping the communities of Wordpress

Very cool network analysis of wordpress.com blogs. I especially like the vizualisations.

A Web of Alliances

Nice Visualisation of military alliances. I rather like the Venn Diagramm, but it’s cool anyway.

Data Science of Variable Selection: A Review

Oh yeah, feature selection — my old firend. Nice review article by KDNuggets