rbahaguejr – Medium

rbahaguejr

rbahaguejr

Building data products without the Mesh

In 2019, distributed data mesh has been offered as a route for companies to leverage data at scale. The next generation enterprise data…

Feb 26, 2022

Building data products without the Mesh

Feb 26, 2022

rbahaguejr

Serve Data Models with MLFlow in Production

Serve data models using MLFlow

Feb 14, 2019

Serve Data Models with MLFlow in Production

Feb 14, 2019

rbahaguejr

Data Science for the 99% Courses

I’m developing an offline course for local activists for data science applications in the context of campaigns on social issues.

Apr 13, 2017

Data Science for the 99% Courses

Apr 13, 2017

rbahaguejr

Threaded Tasks in PySpark Jobs

There are circumstances when tasks (Spark action, e.g. save, count, etc) in a PySpark job can be spawned on separate threads. Doing so…

Apr 6, 2017

Threaded Tasks in PySpark Jobs

Apr 6, 2017

rbahaguejr

Spark in a Box: Making Apache Spark Accessible to Small Enterprises and Academic Institutions

Apache Spark is a powerful tool for data processing. However, it is becoming a restrictive tool available only to Big Enterprises…

Mar 8, 2017

Mar 8, 2017

rbahaguejr

Word cloud on Python

Last year our Vice President resigned from the Cabinet. She was appointed “Housing Czar” but the President is not able to trust her.

Feb 15, 2017

Word cloud on Python

Feb 15, 2017

rbahaguejr

Window Function on PySpark

Here’s how to get the least value of col5 for a group:

Feb 8, 2017

Feb 8, 2017

rbahaguejr

Adding Python Files to PySpark Job

There are varying suggestions on how to do this on SO. However, the pointers are creating more frustrations even for us familiar with…

Jan 26, 2017

Jan 26, 2017

rbahaguejr

Support the Relief and Rehabilitation Efforts in Catanduanes, Philippines Due to Typhoon Nina…

My first post for the year is a call for support on the on-going rehabilitation efforts in areas devastated by Typhoon Nock-Ten/Nina.

Jan 7, 2017

Support the Relief and Rehabilitation Efforts in Catanduanes, Philippines Due to Typhoon Nina…

Jan 7, 2017

rbahaguejr

How-to Declutter Your Data Science Workspace

Working on a data science project is almost always equivalent to an amazing clutter in the working directory. Data scientists would most…

Nov 11, 2016

How-to Declutter Your Data Science Workspace

Nov 11, 2016

rbahaguejr

rbahaguejr

Data Scientist | Free Software Developer and Advocate, Debian & Ubuntu user. Contact: rbahaguejr2 (at) gmail (dot) com

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams