Vinayak Shanawad – Medium

Vinayak Shanawad

Vinayak Shanawad
in
DataDrivenInvestor

How to handle a Million Vector Embeddings in the RAG Applications

Explore the use of PGVector in managing a large dataset for RAG applications and the challenges we faced along the way

Mar 25

How to handle a Million Vector Embeddings in the RAG Applications

Mar 25

Vinayak Shanawad
in
DataDrivenInvestor

Slashing Python Docker build times in half with uv

uv: Python packaging in Rust

Feb 16

Slashing Python Docker build times in half with uv

Feb 16

Vinayak Shanawad
in
DataDrivenInvestor

Beyond Relational Databases: Taming LLM and Transformer Embeddings with PGVector

From Text to Insights: An End-to-End Guide to PGVector

Feb 13

Beyond Relational Databases: Taming LLM and Transformer Embeddings with PGVector

Feb 13

Vinayak Shanawad

Build an ML Pipeline (Part 2) — Model Registration and Serving with MLflow and KServe

Seamless Model Deployment: MLflow and KServe Collaboration

Dec 4, 2023

Build an ML Pipeline (Part 2) — Model Registration and Serving with MLflow and KServe

Dec 4, 2023

Vinayak Shanawad

Build an ML Pipeline (Part 1) — Getting Started with Kubeflow V2 Pipelines

Discovering Kubeflow: Launching into ML Pipelines

Dec 4, 2023

Build an ML Pipeline (Part 1) — Getting Started with Kubeflow V2 Pipelines

Dec 4, 2023

Vinayak Shanawad

Serving Hugging Face Transformers: Optimizing Custom Model Deployment with Seldon Core

Transforming Deployment: Seldon Core Meets Hugging Face

Sep 14, 2023

Serving Hugging Face Transformers: Optimizing Custom Model Deployment with Seldon Core

Sep 14, 2023

Vinayak Shanawad

From Dev to Production: Deploying HuggingFace BERT with KServe

The Future of NLP Deployment: BERT Models and KServe in Action

Sep 11, 2023

From Dev to Production: Deploying HuggingFace BERT with KServe

Sep 11, 2023

Vinayak Shanawad

Data Analysis at Warp Speed: Explore the World of Polars

Empowering Data Scientists and Engineers with Lightning-Fast Data Analysis and Transformation Capabilities

Jul 1, 2023

Data Analysis at Warp Speed: Explore the World of Polars

Jul 1, 2023

Vinayak Shanawad

Say Goodbye to Costly BERT Inference: Turbocharge with AWS Inferentia2 and Hugging Face…

Achieve 2–3ms inference speed and high throughput for text classification tasks

Jun 1, 2023

Say Goodbye to Costly BERT Inference: Turbocharge with AWS Inferentia2 and Hugging Face…

Jun 1, 2023

Vinayak Shanawad

Monitoring and Saving AWS SageMaker Inference Expenses

Tips and Tools for Effective Monitoring and Savings

Apr 30, 2023

Monitoring and Saving AWS SageMaker Inference Expenses

Apr 30, 2023

Vinayak Shanawad

Vinayak Shanawad

Machine Learning Engineer | 3x Kaggle Expert | MLOps | LLMOps | Learning, improving and evolving.

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams