Users’ Context of Use through Clustering Algorithms

  • eliminated outliers;
  • grouped by session and ordered by timestamp;
  • counted main events in the clickstreams (page visited, main CTAs, etc.);
  • ran a kPCA to extract five principal components;
  • run a k-means clustering algorithm with 6 cluster (I will discuss later why the number 6);
  • analyzed results.
  • HOME_PAGE: is the number of visualizations of the home page;
  • PHARMACY_PAGE: is the number of visualizations of the page of details of the vendor;
  • SEARCH_RESULTS: is the number of visualizations of the search results page;
  • PRODUCT_PAGE: is the number of visualizations of the product detail page;
  • MORE_RESULTS_CTA: is the number of time the visitor used the “more results” CTA;
  • ADDCART_CTA: is the number of time the visitor added a product to the cart.
Radar graph representing the mean values of each event in the last five clusters.
Table representing the number of sessions in each cluster and the most frequent starting page.
  • Cluster 5 AKA “Explorative search”: this cluster of visitors has its peak values in PRODUCT_PAGE (bigger than~1 event) and ADDCART_CTA (bigger than~1 event) plus the did few exploration using the internal search engine or the more results cta; they landed on the precise page from an external search engine and then tried to buy the product;
  • Cluster 4 AKA “Search based navigation starting from home”: the visitors in this cluster have landed to the home page (HOME_PAGE ~1 event) and then navigated the site mainly using the search engine (SEARCH_RESULTS ~1 event);
  • Cluster 3 AKA “Direct navigation”: this group is very interesting; they landed on a SEARCH_RESULTS (~2, probably they are recurrent users with saved url) and the navigated the site mainly through the internal search engine (SEARCH_RESULTS ~1 events); few part of the group added product to the cart (ADDCART_CTA ~0.0625 events); the group is the nearest to the “direct” search model;
  • Cluster 2 AKA “Product page visit”: this cluster is similar to cluster 5 but they did not add the product to the cart then they preferably left the site; they have their peak value in PRODUCT_PAGE (~2 events);
  • Cluster 1 AKA “Pharmacy page visit”: this cluster is similar to cluster 1 but they did not add the product to the cart then they preferably left the site; they have their peak value in PRODUCT_PAGE (~2 events);

https://rightright.me

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Embeddings at eCommerce

Faster Deep Learning: Optimal DNN Primitives

Machine Learning Evaluation Metrics

Predicting Credit Card Approvals

Captcha recognition — on cAInvas

Top NLP(Natural Language Processing) Projects Using Python (Includes links to Repository on Github)

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Giuseppe Sorrentino

Giuseppe Sorrentino

https://rightright.me

More from Medium

A/B testing for non-statisticians.

Polls 101: A Kickstart Guide to Knowing Your Customers and Increasing Conversions on Your Website

Framework for conducting A/B Testing.

Centring your service user in data collection

Image from above of a man standing in the middle of the centre circle of an asphalt basketball court