PinnedPan CretaninTowards Data ScienceResponsible Concurrent Data RetrievalStrategies on how to throttle the data retrieval rate using PubChem chemical safety data as a use case14 min read·Jul 29, 2022--1--1
PinnedPan CretaninTowards Data ScienceMachine Learning On A Large ScaleA demonstration using binomial and multinomial logistic regression in PySpark12 min read·Jun 18, 2022--1--1
PinnedPan CretaninTowards Data SciencePySpark or pandas? Why not both?The whole is greater than the sum of the parts10 min read·May 22, 2022--4--4
PinnedPan CretaninTowards Data ScienceA Primer On PySpark Window FunctionsMany capabilities waiting to be discovered8 min read·May 27, 2022----
PinnedPan CretaninLevel Up CodingA Short Introduction To PySpark Window FunctionsAvoid slow and convoluted code4 min read·May 26, 2022----
Pan CretaninTowards Data ScienceFrom Adaline to Multilayer Neural NetworksSetting the foundations right23 min read·Jan 9, 2024----
Pan CretaninTowards Data ScienceFrom the Perceptron to AdalineSetting the foundations right11 min read·Nov 28, 2023--3--3
Pan CretaninTowards Data ScienceClassification With Rosenblatt’s PerceptronThe “hello-world” of machine learning8 min read·Sep 9, 2023----
Pan CretaninTowards Data ScienceStatistical Experiments With ResamplingBootstrapping and permutation tests14 min read·Aug 2, 2023--2--2
Pan CretaninTowards Data ScienceRecursive Chemical ReactionsAlgorithmic analysis of chemical structures using RDKit8 min read·Mar 7, 2023----