Brandon KvardaSparklyr (R interface for Spark) and Kerberos on ClouderaBelow is a way (but probably not the only way) to set up and run Sparklyr on a Cloudera cluster that has Kerberos enabled. This is not, in…Nov 4, 20162Nov 4, 20162
Brandon KvardaIndex Parquet with Morphlines and SolrEvery once in a while I run into something that I know is possible to do but am unable to find evidence that anyone has ever really done…Jun 11, 20164Jun 11, 20164
Brandon KvardaInstalling Spark w/ SparkR on Cloudera/CDHBelow are instructions for installing and playing around with Spark 1.6.x on a Cloudera CDH cluster so that you can play with SparkR. This…Mar 18, 20162Mar 18, 20162
Brandon KvardaBuilding a Custom Flume InterceptorFlume is a powerful tool that can be leveraged as part of a data ingestion pipeline. Reading through the docs, you can get a pretty good…Mar 15, 2016Mar 15, 2016
Brandon KvardaCreating Transient Clusters with Cloudera DirectorCloudera Director is a great tool for spinning up CDH clusters in AWS/GCE — one that I use almost daily, as I’m sure others do as well…Mar 3, 2016Mar 3, 2016
Brandon KvardaEnable Python/Scala Notebooks in HueAs I scavenged the internet for guides on enabling Spark Notebooks in Hue, I realized that there were many great sources but none that I…Jan 25, 20161Jan 25, 20161
Brandon KvardaRapidly Deploy ScaleIO in Azure in 5 ClicksI recently wrote an Azure Resource Group deployment template as well as custom script resource extensions that can be found here. Here’s a…Jun 23, 2015Jun 23, 2015