Amazon DSSTNE and SageMaker — Part 11

Rakesh4real

Follow

Published in

Fnplus Club

2 min readJul 29, 2019

--

Deep Scalable Sparse Tensor Neural Engine(DSSTNE)

Amazon open-sourced its recommender engine called DSSTNE, which makes it easy to apply deep neural networks to massive, sparse data sets and produce great recommendations at large scale.

Works excellently well with sparse data
No code required!
Open source

How to use?

Convert data into a specific format
Write config file

There is no code involved! All you have to do is write the config file as shown as above and remember some terminal commands.

In the above config file,

‘Kind’ is set to feed forward auto encoder. Specifically, Sparse Encoders.
Here, sparsity is constraints on hidden layer to force find patterns even if there are less number of hidden nodes
‘SparsenessPenality’ defines above metioned ‘constraints’. More details can be found at Andrew Ng’s CS294A (Sparse Autoencoders)

A point to remember is that here, if user has rated a movie, it is taken as binary value 1. Not matter what the rating is. And if user hasn’t rated any movie, binary value 0 is alloted. This approach works surprisingly well!

You may try the same approach as above in RBMs aswell.

Amazon’s DSSTNE is available in github. Use linux if planning to use GPU for training purpose.
Data is needed in NetCDF format. Arrange data such that every row must have single user followed by list of rating inputs for that user.
First row of the data file is taken as first input node
Amazon gives AWK script