How to annotate video data for object detection with Diffgram

Published in

Diffgram

4 min readOct 4, 2018

Annotating video data for computer vision can be a challenging task. At Diffgram we have Video and Image Object Detection tools to help you create the highest quality AI Data and perpetually update it.

** This article was published in 2018 (~5 years ago — a lot has changed. For the latest updates see Diffgram.com or Docs.)

One challenge can be collaborating with other people. Most visual intelligence projects are the work of teams of people. For example one person may collect data, while many annotate, and others build the machine learning system.

Here we walk through a practical example of how to annotate a video with Diffgram. Here’s the end result:

1. Create a new project

Data is isolated by project in Diffgram. Create a new project to try out using it.

Click “Projects”
Click “New”
Choose a name
Click “Create”

2. Upload a video

For tips on data set collection please see How to create AI ready data for Object Detection.

Click “Upload”
Drag and drop or click in side the box to use file selector
Click Done
Wait for images to process

Supported video formats are:

.mp4, .mov, .avi

Upload your video. Diffgram supports files up to 1 GB.

Once upload is complete files show up in the Media browser, on the Annotate page.

Large videos may take several minutes to process — click refresh to see changes.

3. Create a label

In order to map the meaning onto your data you need to create a label. A label represents the meaning behind what is in the data.

4. Create a video annotation

Click on the label “car” to open the Sequence navigator

Draw a box around all the objects in the image (for example all the cars).

Tip: Use the “speed” control to slow down playback. This is especially useful for high frame rate videos.
Play the video.
Stop the video after a few frames.
Annotate the objects, using the thumbnail images in the Sequence Navigator to help. For example, the car with id #1 is the same car throughout the whole video.

5. Interpolate

Knowing it’s the same car allows for point to point interpolation between frames. This means you annotate key frames, ie every 5th or 10th frame, and get the annotations in between filled in for you.

After you are finished annotating click “Interpolate”.
Now we can go back to the start of the video and review our work.

6. Invite teammates to help you annotate

Annotating often requires subject matter experts, such as Professionals, Engineers and Doctors, etc. You can invite teammates to help you annotate, upload data, and construct machine learning models.