HPI Polyglot Programming Seminar
In the summer term 2019, we ran a seminar on Polyglot Programming as part of our graduate program at the Hasso Plattner Institute in Potsdam, Germany. The main goal of the seminar was to explore the domain of polyglot programming using various GraalVM technologies and GraalSqueak, our Squeak/Smalltalk implementation for GraalVM. This helped us to better understand both benefits and problems of polyglot programming. Since Programming Experience is one of our key research topics, we focused on language interoperability and the tools that GraalVM provides. To gain more insights on both, we encouraged our students to discuss how they are using GraalVM and to share their lessons learned throughout the seminar. Seven student teams participated in the seminar and worked on their projects over the course of four months. In this blog post, we summarize the results and highlight some of those projects.
Polyglot Jupyter Kernel
Jupyter notebooks are very popular tools for data analysis and machine learning as they allow documentation of code, text, as well as (intermediate) results in the form of tables, plots, and other rich media in a single document.
In previous work, we built PolyJuS, a polyglot notebook system written in Squeak/Smalltalk. This system can not only make use of GraalVM’s Polyglot API. More importantly, it helps the user to understand the interaction between objects and data from different programming languages.
The goal of the student project was twofold: build a GraalVM polyglot kernel for Jupyter and offer a similar experience to PolyJuS in conventional Jupyter notebooks. Here’s what our students came up with:
To help users keep track of the objects and data shared between languages, our students also forked the Variable Inspector from Jupyter notebook extensions and turned it into the Polyglot Inspector. Similar to PolyJuS’ explorer for polyglot bindings, this inspector can be used to explore, which variables are (automatically) shared and therefore are available in all code cells. We believe this makes it easier to understand what kind of objects are shared and which messages they respond to.
If you would like to give IPolyglot a try, here’s how you can run it on Docker.
Code Editor for Polyglot Programming
Code editors are essential tools for programming. They support developers with useful features such as code completion, syntax highlighting, and sometimes integrations of specific APIs.
As part of this project, the students built a code editor in GraalSqueak, which integrates the Polyglot API. Here is what they came up with:
Switching to another programming language within the same code base means switching to a different syntax and also to different semantics. We decided to use colors to visualize this type of change in the Polyglot Code Editor. In addition, the editor prompts for user input when using a functionality provided by the Polyglot API. Although this API is available in all officially supported GraalVM languages, the actual ways to interact with it are slightly different (due to language constraints for example). The editor, however, always prompts the user in the same way for the same functionality. It then automatically generates the API call in the corresponding language and may even update imports if necessary. It also keeps track of what variables have been exported to GraalVM’s polyglot bindings and lists them when the user requests an import.
The implementation of our editor is polyglot as well: For example, it uses a Ruby library for syntax highlighting and Python for managing line endings.
Helping Developers Find the Right Code
One key idea of polyglot programming is the ability to always use the right language, framework, library, or tool for the job. However, that assumes that the developer already knows the right thing for the job. To address this problem, a student team worked on a tool that helps developers find code more efficiently on StackOverflow. Here is a demo of what that looks like:
The Polyglot Code Finder tool allows the user to search for code snippets written in languages supported by the underlying GraalVM on StackOverflow. It is integrated into the code editor and into our PolyJuS notebook system. In the code editor, the tool automatically creates new code boxes, and new code cells in the notebook system. Similar to the code editor, the code finder tool is polyglot: Its UI is written in Squeak/Smalltalk while the backend uses Python and Ruby for searching, validating, and cleaning code snippets. Imagine having a tool like this in the IDE or editor of your choice with support for StackOverflow, GitHub, and language documentations!
Benchmarking the Oracle Database MLE
As part of our seminar, a student team benchmarked Oracle Database Multilingual Engine (MLE) using two different algorithms taken from real-world applications: Elo is a chess rating system, which calculates scores of players based on their previous score and a random variable. Therefore, this benchmark causes a lot of reads and writes when used in a database setup, but is relatively low on CPU usage. Available-to-promise (ATP) is a business algorithm for determining quantities and delivery dates for order enquiries. For this, the algorithm needs to read all required data once to be able to calculate the result. The calculation, however, is based on backtracking. Therefore, this benchmark has relatively low database usage, but high demands on the CPU.
Here is a summary of what our students measured:
Our students found that MLE is more efficient than the NodeJS setup in the Elo benchmark. We believe this is due to the fact that the overhead introduced by a network connection can be avoided.
In the ATP benchmark, however, MLE performed significantly worse than NodeJS. After contacting the MLE team, they are looking into what appears to be a performance problem in an MLE-specific layer. It is great that our students identified this problem as part of their project and hence made a valuable contribution that helps to evaluate and improve the performance of MLE.
Rendering Squeak/Smalltalk Tools in Java Swing
Polyglot programming enables developers to re-use software libraries and frameworks across languages. In this project, our students experimented with using a different user-interface (UI) framework in GraalSqueak. Traditionally, Squeak/Smalltalk uses ToolBuilder, a core component following the builder pattern to construct tools from a specification. The default UI framework in Squeak/Smalltalk is called Morphic and provides Morphs as basic UI elements similar to div containers in HTML. The MorphicToolBuilder builds most Squeak/Smalltalk tools including workspace, code browser, and debugger using Morphs. As part of our seminar, the students implemented a JavaToolBuilder which takes the same tool specifications, but builds them with Java Swing. Here is a demo of what that looks like:
Just like in the original Smalltalk-80 specification, Morphs are still blitted onto a display bitmap. The rendering machinery therefore does not benefit from today’s GPU acceleration. This is especially a problem on high-resolution displays that can display significantly more pixels than the original Xerox Alto. JavaToolBuilder, on the other hand, allows us to render Squeak/Smalltalk tools in native windows without having to implement a new graphical backend. We believe this is another example of what’s possible with polyglot programming and GraalVM in the future.
As we hoped, our students had lots of fun exploring GraalVM and polyglot programming as part of the seminar. Apart from finding and also fixing interoperability bugs in GraalVM languages, they produced very diverse and interesting results: A GraalVM-based Jupyter kernel, a code editor for polyglot programming, a tool to find code on the web, UI framework experiments, and MLE benchmarks. Moreover, we identified various usability problems (e.g. polyglot dependency management) and developed first ideas for how GraalVM could address them in the future.
At this point, we would like to thank Mario Wolczko and the GraalVM team for their support and feedback throughout the course of the seminar as well as our students for their outstanding work. We look forward to the next steps toward advancing polyglot programming!