PinnedProbabilistically determining the number of components for unsupervised learningAn excursion into Bayesian Nonparametrics for changepoint modellingOct 11, 2022Oct 11, 2022
PinnedWeeding out noise with MLUsing XGBoost for improving data preprocessing and semi-automating user data selectionJul 29, 2022Jul 29, 2022
blechRNN: Denoising and Latent Space Discovery for Multivariate Time Series with Autoregressive…Time series data is ubiquitous, spanning fields from finance and healthcare to sensor networks and industrial process monitoring. However…Feb 24Feb 24
🤖 Leveraging AI Agents for Automated GitHub Issue Response: The Blech GitHub BotIn the ever-evolving landscape of software development, managing open-source projects effectively has become increasingly challenging. As a…Jan 20Jan 20
Offloading the heavy-lifting to AWS Lambda with DockerIn a previous post (link), I talked about creating a personal, searchable database for scanned documents using Optical Character…Oct 22, 2024Oct 22, 2024
Let AI do the work a.k.a. Rapid deployment using Streamlit appsIn many academic labs, despite our accelerating utilisation of computers, I have seen that a lot of important documentation still happens…Jun 24, 2024Jun 24, 2024
Automating Data-Processing Pipeline Testing using Prefect 2.0For open-source contributions to work, someone needs to make sure that new additions to the code-base don’t break anything else. This is…Mar 15, 2024Mar 15, 2024
Supervised (a.k.a. Task-specific) dimensionality reductionWe often encounter data with more features/dimensions than we know what to do with. Many of them are correlated (even collinear), many of…Feb 19, 2024Feb 19, 2024
OverlySpecificGPT — Creating your custom LLM Chatbot with Retrieval Augmented GenerationThe rapid rise of Large Language Models (LLMs) has been both very exciting and slightly concerning. While it raises questions like “Will…Jan 12, 2024Jan 12, 2024