CodeX
Published in

CodeX

CODEX

Science Shorts #2: Bengali Character Recognition, Perceptible Colour Maps & Python Newsletters

Scope

The Nvidia Machine Learning (ML) Grandmasters took part in the Kaggle competition for the character recognition of the World’s fifth most popular native language, Bengali. The team address some of the “unwritten” rules of the language in tuning there models. Rather than choose either the default colour scheme from your favourite plotting library or personal preference, a more scientific basis can be found in the Colorcet library. Finally, two Python based newsletter that I recently subscribed to.

Introduction

The following three articles were randomly selected from my Pocket list, which I’ve curated over the past 5 years in the field of Data Science; the motivations and background are discussed in a previous post: Data Science Shorts: An Introduction to my Pocket List.

Bengali Character Recognition

Nvidia article on Bengali Kaggle Challenge | Screenshot by Author | Article and Artwork by Nvidia

Summary

Between December 2019 and March 2020, Kaggle ran the Bengali.AI Handwritten Grapheme Classification challenge. The article describes the challenges with the training set data and subsequent strategies to mitigate the short comings. The highest position for the team was fifth but most members finished in the top 30, which is very impressive. The approach to the learning rate was as critical as the choice of model.

Context

For many people (including me), the majority of contact with Machine Learning begins (and ends with) the excellent sci-kit learn, keras and pyTorch. What this blog shows, is some of the “art” involved with fine tuning existing models for new applications. At the same time, shows some of the privilege that English speakers enjoy in the level of research conducted into our shared language.

Modern Colour Maps for Plots with Colorcet

Colorcet Library | Screenshot by Author | Article and Artwork by Colorcet

Summary

The two images below compare similar colour maps, hot from matplotlib and its alternative from Colorcet called fire using 256 colour gradient:

Hot colormap | Image by Colorcet
Fire colormap | Image by Colorcet

Context

We’ve all been there when that plot doesn’t quite look right, or we’ve spent hours choosing our categorical colours in a line plot or bar chart. If you’ve experimented with colour customisation, then you know it’s an easy productivity trap to fall into. The reason is that’s easy to get caught up in the moment and lose track of the original purpose.

Python Newsletters

Pycoders and Real Python Newsletters | Screenshot by Author | Article and Artwork by Respective Parties

Summary

Recently I’ve come across two newsletters that have introduced me to new concepts in Python, pandas and the related Data Science ecosystem. First of all PyCoders, which is an excellent resource that covers the wide range of Python applications. The second is of course the Real Python newsletter PyTricks, which focuses on Python language snippets.

Context

I enjoy the code snippets from Real Python:

PyTricks Email from Real Python | Screenshot by Author | Content from Real Python

Conclusion

Three varied topics from my Pocket list. The first shows the depth of experience required and the empirical nature of tuning existing Machine Learning models for new but similar applications in the field of handwritten character recognition. The second article shows that using a library that has researched the use of effective colour schemes can potentially enhance any visualisation. Given that for any Data Scientist, communication of the results is a critical element of the role, Colorcet should be the default. It should be noted that many tools within the PyViz ecosystem have already adopted these maps. Finally, two sources for Python news and the trade-off with sponsored material.

--

--

Everything connected with Tech & Code. Follow to join our 1M+ monthly readers

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Ashraf Miah

Data Scientist and Chartered Aeronautical Engineer (MEng CEng EUR ING MRAeS) with over 15 years experience in the Aerospace, Defence and Rail Industry.