Cloud Dataflow can autoscale R programs for massively parallel data processing
Lakshmanan V
123

I have not tried it, but installing R packages with the second approach (calling R through rpy2) seems to be only a matter of running one extra CUSTOM_COMMAND (or several ones) that will install the required R packages.

Scripting the installation of R packages on the command is done in rpy2’s Dockerfile. In essence, it is looking like:

R -e ‘install.packages(“r-package-to-install”), repos=”CRAN-mirror”)'
One clap, two clap, three clap, forty?

By clapping more or less, you can signal to us which stories really stand out.