Homepage
Open in app
Sign in
Get started
trivial.io
Data Science, Machine Learning, Statistics, AI.
Follow
Stream Processing in pysparkling
Stream Processing in pysparkling
pysparkling is a native Python implementation of PySpark. Stream processing is considered to be one of the most important features of…
Sven Kreiss
Mar 11, 2017
Databench v0.4
Databench v0.4
Short summary of the Databench v0.4 release.
Sven Kreiss
Jan 16, 2017
word2vec on Databricks
word2vec on Databricks
Example that processes a text corpus with word2vec in Spark on Databricks.
Sven Kreiss
Jan 16, 2017
Parallel Processing with pysparkling
Parallel Processing with pysparkling
Introducing parallel processing in pysparkling and benchmarking speedup.
Sven Kreiss
Jan 16, 2017
pysparkling Talks
pysparkling Talks
Collection of links to talks.
Sven Kreiss
Jan 16, 2017
pysparkling
pysparkling is a native Python implementation of the interface provided by Spark’s RDDs.
Sven Kreiss
Jan 16, 2017
Wildcardians on Twitter
Wildcardians on Twitter
Twitter network graph from few API calls.
Sven Kreiss
Jan 16, 2017
Collaborative Statistical Modeling
Collaborative Statistical Modeling
A poster made for the opening of NYU’s Center for Data Science.
Sven Kreiss
Jan 16, 2017
About trivial.io
Latest Stories
Archive
About Medium
Terms
Privacy
Teams