From Visualising high-dimensional datasets using PCA and t-SNE in Python by Luuk Derksen

*“t-Distributed stochastic neighbor embedding (t-SNE) minimizes the divergence between two distributions: a distribution that measures pairwise similarities of the input objects and a distribution that measures pairwise similarities of the corresponding low-dimensional points in the embedding”.*

From Visualising high-dimensional datasets using PCA and t-SNE in Python by Luuk Derksen

…s a technique for reducing the number of dimensions in a dataset whilst retaining most information. It is using the correlation between some dimensions and tries to provide a minimum number of variables that keeps the maximum amount of variation or information about how the original data is distributed. It does not do this using guesswork but using hard mathematics and it uses something known as the eigenvalues and eigenvectors of the data-matrix. These eigenvectors of the covariance matrix have the property that they point along the major direc…

From Why Selling Data Science Is Hard by Peter Natterer

So how can a data science delivering team constantly bring value to their customer? Each step of a data science delivery process should include at least one deliverable where you show the actual efforts and work that went into this process step. For example, when you do two days of initial data cleansing, write down each step of your methodol…

From BERT Explained: State of the art language model for NLP by Rani Horev

In the field of computer vision, researchers have repeatedly shown the value of transfer learning — pre-training a neural network model on a known task, for instance ImageNet, and then performing fine-tuning — using the trained neural network as the basis of a new purpose-specific model. In recent years, researchers have been showing that a similar technique can be useful in many natur…

From For the love of god, don’t use .npmignore by Jeff Dickey

However, what you probably don’t know is that my little action of adding the npmignore file actually causes npm to now consult that file *instead* of the gitignore files. This is a major issue—**I’ve now leaked all my AWS credentials out to the public just by adding this ****.npmignore**** to hide my test directory.**