Homepage
Open in app
Sign in
Get started
Democratizing Data
Think like an amateur, write as a professional
Archive
Web site
Follow
My blog has moved
My blog has moved
This blog has moved to https://chezo.uno/blog Please visit me there.
Aki Ariga
Dec 15, 2019
An easy way to get URL list of your Medium publication
I imported blog posts from own Wordpress but I have to redirect old articles to Medium manually. There is Wordpress plugin which enables…
Aki Ariga
May 2, 2017
sparkavro: Manupilate Apache Avro file with sparklyr
I created a simple sparklyr extension to handle Apache Avro file. It is just a simple wrapper of DataBrick’s spark-avro. It is listed in…
Aki Ariga
Mar 26, 2017
How to connect secure Impala cluster from RStudio on macOS with implyr
How to connect secure Impala cluster from RStudio on macOS with implyr
Impala is very fast SQL-on-Hadoop, and it will enhance your R experience with implyr, a dplyr based interface for Apache Impala…
Aki Ariga
Mar 25, 2017
Visualize your massive data with Impala and Redash
Visualize your massive data with Impala and Redash
Redash is a famous OSS visualization tool, which enables to visualize your data with SQL. It supports Apache Impala (incubating), fast…
Aki Ariga
Feb 11, 2017
tabula-py: Extract table from PDF into Python DataFrame
tabula-py: Extract table from PDF into Python DataFrame
(Note: Oct 7th, 2019) As of Oct. 2019, I launched a documentation site and Google Colab notebook for tabula-py. The FAQ would be good place…
Aki Ariga
Jan 9, 2017
Livy & Jupyter Notebook & Sparkmagic = Powerful & Easy Notebook for Data Scientist
Livy & Jupyter Notebook & Sparkmagic = Powerful & Easy Notebook for Data Scientist
livy is a REST server of Spark. You can see the talk of the Spark Summit 2016, Microsoft uses livy for HDInsight with Jupyter notebook and…
Aki Ariga
Dec 30, 2016
About Democratizing Data
Latest Stories
Archive
About Medium
Terms
Privacy
Teams