Kaleidos, a local open source company in Madrid, Spain hosts a #PiWeek — biannual open HackWeek event which I had a chance to attend this year, supported by Máximo Cuadros and a bunch of saved OSDs at source{d}.

One of the teams I know, introduced by Xaviju, is working with in-browser WebSpeech API for building a voice assistant for cooking.

As a person, keen on engineering aspects of applied Machine Learning, I took a quick peek under the hood of 2 major browsers to understand better how does it actually work and what is the implementation of this W3C API…


Applying ML for finding patterns in source code is a cutting-edge research topic in academia and industry known as ML on Code. But there can not be any ML without data and that is where FLOSS software projects present a trove of opportunities.

The goal of applying ML to source code is building better developer tooling that potentially might change the way we write software. That, by necessity, would affect FLOSS projects by applying automation that takes away “boring parts” and reduces the time that maintainers spend on their projects.

That is why it is very important to preserve the…


Good news everyone!

A Call for Participation for the FOSDEM 2019 DevRoom on “Machine Learning on Code” is now officially open! 🎉🎉🎉

Last year at FOSDEM 2018 my colleagues from source{d} and I stepped up and helped with organizing the first (half-day) Source Code Analysis DevRoom track. For a small initiative it was a great success: for half a day, people from multiple programming languages, tooling, parser-generator communities got together to share and discuss all kinds of automation that is used for managing the ever growing complexity and amount of source code in industry and FLOSS projects alike.

If you…


This is a cross-post, article originally published at source{d} blog here.

ML on Code is a rapidly developing field, both in academia and industry, that source{d} was set out to systematically explore throughout the last years.

So far the results published by our Data Retrieval, Machine Learning, and Infrastructure teams who collect and store millions of Git repositories were based on large-scale applications of advanced NLP techniques such as: Identifiers Embedding, Topic Modeling and sequence model for Identifier Splitting. …


A recent paper with empirical research on application of static code analysis tool caught my attention:

Paper announcement tweet, by one of the authors

It’s been a while since I started reading more papers. Following awesome initiatives like Papers We Love, where with Victoria Bondarchuk we’ve created a chapter in Seoul, and The Morning Paper — I always wanted to start experimenting with publishing notes. This will be the first attempt.

“Lessons from Building Static Analysis Tools at Google” by Caitlin Sadowski, Edward Aftandilian, Alex Eagle, Liam Miller-Cushon, Ciera Jaspan presents 2 stories: history of failed attempts of integrating FindBugs


Although last year’s GSoC Mentors Summit, a 2 days un-conference event hosted by Google at San Jose, California happened a while ago, I’d like publish this draft that was written on the plane back to Seoul.

Google Summer of Code mentors summit is an annual event, and it’s been 12th consequent year for hundreds of people who worked as mentors in open source projects during the summer to get together. …


On May 17–19 second annual tech conference J On The Beach took place in Malaga, Spain.

Venue

The conference took place at “La Térmica”, an awesome contemporary space — a former hospital, rebuilt to serve as a home for a civic center, museum, co-working space, library.


This is the second year for us at Apache Zeppelin with Google’s Summer of Code program, under umbrella of Apache Software Foundation. It may not be a lot, but we hope that by reflecting on our experience other communities, new to the program, can learn how to leverage this program more efficiently.

Apache Zeppelin logo

Apache Zeppelin is a young project. It has started in 2012 as internal product at ZeppelinX as GUI for BigData solutions and got open sourced in 2013.

One challenge of building a diverse, meritocratic community of developers is to attract new talents to the project, and early on…


Apache Zeppelin at ApacheCon BigData North America 2016

This May city of Vancouver saw a biggest get-together of Apache Software Foundation communities in North America.

Keynote at ApacheCon BigData NA 2016, Image by Linux Foundation

It is the 5th time that Apache Software Foundation (ASF) throws its annual events together with Linux Foundation and as usually the event was laid out as 2 subsequent conferences “ApacheCon” and “Apache Big Data”, both held at Hyatt Regency Vancouver to the highest possible standards. This year I had an opportunity to join it on behalf of Apache Zeppelin project and below is a short summary with few takeaways from the “Apache BigData” part.


FOSSAISA, the biggest opensource conference in Asia have been held in Singapore over the last weekend, May 18–21 2016.

FOSSASIA’16 participants. Image by Michael Cannon

More than 1000 people from all over the world, representing all biggest opensource communities came to make over 200 talks at the great venue of Singapore Science Center.

As in last year, there were many local enthusiasts and volunteers who made it happen and it was a great pleasure to meet them once again, as well as to notice that the conference, being held in Singapore for only the second year in a row, did substantially grew in size and diversity.

Seoul Engineer

Data Engineer, Apache Zeppelin PMC

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store