Bag-of-words representation for video channels’ semantic structuring

Sileye Ba
Sileye Ba
Oct 1, 2018 · 5 min read

Bag-of-words Representation

Examples of a bag-of-words representation of a video gaming and hip-hop music channel displayed as a word cloud. The more a word appears in the metadata of a channel’s videos the more it stands out.

Clustering Channels with K-means

1. Initialization: among the samples, randomly select K centroids to represent the K clusters2. Iterate until convergence: assign each of the elements to the cluster of the closest among the K centroids, then update each one of the centroids as the barycenter of the samples inside it's class
Examples of cluster centroids displayed as word clouds. Represented clusters are mostly composed of channels about movies (top left), automotive (top right), beauty (bottom left), and cooking (bottom right).

Channels Embedding With Neural Auto-Encoders

T-SNE based two-dimensional display of video channels denominated according to their categories after neural auto-encoder embedding. Channels from the same category are mapped next to each other in the embedding space. Channels are colored according to their languages: blue for English, green for French, and red for other languages.

Dailymotion

The home for videos that matter

Thanks to Anne-So.

Sileye Ba

Written by

Sileye Ba

Senior data scientist at Dailymotion, interested in models allowing machines to infer semantics from videos and their metadata.

Dailymotion

The home for videos that matter