Annotating Images & Markdown

Label images and text in their original context

By Juan Miguel Cejuela👐Open Link to this article

A recent update of tagtog, gave support to annotate Markdown files. Therefore now images, nested lists, or code blocks are fully supported. This opens many new possibilities for annotation. Let’s focus on 3.

Image for post
Image for post
Image 1: Annotating News with Images & Markdown on

1. Annotate in Context 👁 (+ it just looks better 💅)

What would you prefer to annotate? this?

NLP has different sources of bias: 1. The selection of the training data. 2. The biases of the annotators. 3. The inductive bias of the model. 4. How the task is designed overall. @eurnlp #eurnlp

or this?

Image for post
Image for post
Image 2: Tweet in its original context; better suitable for annotation

How about this? Note that some of the text’s meaning is lost without is complementing image (Image 3).

Image for post
Image for post
Image 3: Tweet’s text full meaning is lost without the image

For clarification, tagtog doesn't show tweets yet in their original formatting. We will have soon a specific presentation mode for tweets and you can follow our updates on Twitter @tagtog_net🐦. However, nicely formatted news like this post's header (Image 1), for example, are now perfectly possible.

2. Label Images 🖼

Moreover, several other NLP tasks are concerned with matching text to images. Take for instance the NLVR dataset on tagtog (Image 4), from the original NLRV dataset. In this dataset, the task was to determine whether the caption text (e.g. "There is a box with a blue circle, a black circle and a black square”) correctly described, true or false, the presented image. These types of annotations are now supported on tagtog.

Image for post
Image for post
Image 4: Labeling images and text associated to an image.

3. Annotate Markdown Documentation 🅜📝

CommonMark Logo
CommonMark Logo
CommonMark, the de-facto specification for Markdown.

How about annotating security flaws in code shared in StackOverflow? (Image 5).

Image for post
Image for post
Image 5: finding security threats in code posted in StackOverflow.

We will review the many possibilities for annotating code (and verbatim-like-styled text) in a future post.

In the meantime, hope you liked this one!

What would you annotate with markdown? 🤔

Need training data for #NLP? Find & create it for free on: 🍃tagtog

Are you on Twitter? 🐦Follow @tagtog_net🐦

👏👏👏 Clap if you like the post, and want to share it with others! 🧡💚

Written by

The text annotation tool to train #AI. Easy. 🔗

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store