From the Edge to the Cloud and Back Again

9 min readAug 2, 2024

Milvus, Edge AI, Vector Database, MQTT, Kafka, Zilliz Cluster, Python

Not connected to the cloud at all in this one.

In today’s environment, no one has time to wait for AI and similiary results to arrive especially in potentially network challenged environments. You need to have CPU, GPU, RAM, storage, LLM inference and vector database searches at instant ready because in the real world things happen instantly. Your camera is receiving immediate results that must be processed, checked, analyzed and reacted to. If there is an obstacle we must avoid it now not in five seconds.

Quick Demo

Remember over 80% of the data in the world is unstructured and you need to store, search and process it. You will have text, documents, images, audio, video, logs, sensors and more.

Why Even Use a Vector DB?

High-Performance Search
CRUD Operations: Just like traditional databases, vector databases allow you to Create, Read, Update, and Delete data.
Data Freshness: Vector databases ensure your data remains up-to-date, reflecting the latest information for accurate searches.
Persistence: Your data is securely stored and persists even if the system restarts.
Availability: Your data is readily accessible for search and retrieval operations.
Scalability: Vector databases can handle growing data volumes efficiently.
Data Management: Vector databases provide tools to manage your data effectively, including data ingestion, indexing, and querying.
Backup and Migration: Create backups of your data for disaster recovery and easily migrate your data between different systems.
Cloud or On-Premise Deployment: Vector databases can be deployed easily on various platforms, including cloud and on-premise environments.
Observability: Monitor the health and performance of your vector database to ensure optimal operation.
Multi-tenancy: Support multiple users or applications accessing the same database instance securely.
So Many Indexes: Offer a wide range of 15 indexes support, including popular ones like Hierarchical Navigable Small Worlds (HNSW), PQ, Binary, Sparse, DiskANN and GPU index

Edge AI Use Cases

Robots
Smart Cities
Smart Factories
Autonomous Cars
Automated Retail
Smart Home

Local Search on Edge Devices

Proprietary Document Search
On-Device Object Detection
Milvus Lite on Device

Why Even Use a Vector DB on the Edge?

Cloud, Docker, Standalone or On-Premise Deployment: Can send vectors and other fields to local, remote or Cloud Milvus.
Instant Local Search: access local unstructured data for fast search and local applications.
Secure Local Data
No Network Necessary: Especially for autonomous robots and vehicles. Make instant local decisions.
Local RAG and Super Charge Edge AI: enhance local image, audio, video, text data with local LLMs. OLLAMA with RPI. Generative AI
Local Live Video

So let’s build an Edge AI App.

The first step is to pick which edge or device, where are we running. Hardware is a big decision and it’s based on requirements, budget and availability.

Since the Olympics are going on, I will show you the Gold, Silver and Bronze level devices you can run (based on TOPS).

Tim Spann 🥑 on LinkedIn: Edge Vector Olympics Gold - NVIDIA Jetson AGX Orin - 275 TOPS...

Edge Vector Olympics Gold - NVIDIA Jetson AGX Orin - 275 TOPS, 2048-core, 64 GB RAM Silver - NVIDIA Jetson Xavier NX…

www.linkedin.com

NVIDIA AGX Jetson Orin

Gold — NVIDIA Jetson AGX Orin — 275 TOPS, 2048-core, 64 GB RAM

Docker Compose with Attu Showing Collection

Orin Output Results from BLiP Image Captioning of Web Camera Image

Milvus on Orin Vector Search with Milvus + Attu Display

Milvus on Zilliz Cloud Data Preview for Orin Edge Ouput

As you can see we can run locally with Milvus Lite or Milvus on Docker or send to the cloud, all with just changing a few parameters.

NVIDIA Jetson Xavier NX

Silver — NVIDIA Jetson Xavier NX, 21 TOPS, 384-core, 8 GB RAM

Salesforce/blip-image-captioning-large · Hugging Face

We're on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

Despite being pretty old this previous generation NVIDIA edge device performs pretty well! We run image captioning on web camera images. We moved this demo to the new Orin.

Like all of our devices we can communicate to Milvus Lite, Milvus on Docker, a Milvus Cluser in the cloud, K8 or a local edge server with ease. We can also upgrade to the Zilliz Cloud by just changing URL and adding a token.

Raspberry Pi 5 + AI Kit for Pose Estimation Demo

Bronze — Raspberry Pi 5, 13 TOPS, 4-core, 8 GB RAM

We can easily run this on a small inexpensive device and send the results to Slack and Milvus. This makes for easy distributed unstructured data applications.

The source code for all of these applications and some older ones are available below.

Here is a sample of what a simple Edge AI Milvus Lite application utilizing Python can be.

PYTHON INSTALLATION

We install the Milvus Python SDK and Milvus Lite so we can run backups.

pip3 install pymilvus
pip3 install milvus-lite

MILVUS-LITE BACKUP / EXPORT

milvus-lite dump -d XavierEdgeAI.db -p /home/nvidia/nvme/AIM-XavierEdgeAI/backup/ -c XavierEdgeAI

Dump collection XavierEdgeAI’s data: 100%|████████████████| 33/33 [00:00<00:00, 188.54it/s]

Dump collection XavierEdgeAI success

Dump collection XavierEdgeAI’s data: 100%|████████████████| 33/33 [00:00<00:00, 127.16it/s]

Milvus-Lite to the Cloud

For many use cases we will want to distribute our local data to another computer, cluster or cloud. We could do that at the same time, in a batch, on a delay or at some other time.

Milvus-Lite Dump/Export to Cloud Import at some interval
Dual Ingest to local and other location concurrently
Switch to Cloud Only
Send JSON via Kafka / Pulsar / MQTT
Unstructured Data to MinIO, S3 or Cloud Object Storage

SLIDES

Unstructured Data Processing from Cloud to Edge Webinar

Unstructured Data Processing from Cloud to Edge Webinar - Download as a PDF or view online for free

www.slideshare.net

SOURCE

GitHub - tspannhw/AIM-JetsonAGXOrin: AIM-JetsonAGXOrin

AIM-JetsonAGXOrin. Contribute to tspannhw/AIM-JetsonAGXOrin development by creating an account on GitHub.

github.com

GitHub - tspannhw/AIM-RPIAIKit-PoseEstimation: Pose Estimation and Milvus

Pose Estimation and Milvus. Contribute to tspannhw/AIM-RPIAIKit-PoseEstimation development by creating an account on…

github.com

GitHub - tspannhw/AIM-RPIAIKit: AIM-RPIAIKit

AIM-RPIAIKit. Contribute to tspannhw/AIM-RPIAIKit development by creating an account on GitHub.

github.com

GitHub - tspannhw/AIM-XavierEdgeAI: Milvus Lite, Python, NVIDIA Xavier NX, Edge AI, Edge Vector DB

Milvus Lite, Python, NVIDIA Xavier NX, Edge AI, Edge Vector DB - tspannhw/AIM-XavierEdgeAI

github.com

EDGE HARDWARE SPECS

NVIDIA Jetson AGX Orin

Bring your next-gen products to life with the world's most powerful AI computer for energy-efficient autonomous…

www.nvidia.com

275 TOPS, 2048-core NVIDIA Ampere architecture GPU with 64 Tensor Cores, 12-core Arm® Cortex®-A78AE v8.2 64-bit CPU 3MB L2 + 6MB L3, 2x NVDLA v2, Vision Accelerator 1x PVA v2, 64GB 256-bit LPDDR5 204.8GB/s, 64GB eMMC 5.1

NVIDIA Jetson Xavier Series

Jetson Xavier series set a new bar for compute density, energy efficiency, and AI inferencing capabilities on edge…

www.nvidia.com

21 TOPS, 384-core NVIDIA Volta™ architecture GPU with 48 Tensor Cores, 6-core NVIDIA Carmel Arm®v8.2 64-bit CPU 6MB L2 + 4MB L3, 8GB 128-bit LPDDR4x 59.7GB/s, 16GB eMMC 5.1

https://www.raspberrypi.com/products/ai-kit/

13 TOPS of inferencing performance, Single-lane PCIe 3.0 connection running at 8Gbps. Broadcom BCM2712 2.4GHz quad-core 64-bit Arm Cortex-A76 CPU, with Cryptographic Extension, 512KB per-core L2 caches, and a 2MB shared L3 cache, 8GB LPDDR4X-4267 SDRAM, VideoCore VII GPU, supporting OpenGL ES 3.1, Vulkan 1.2.

REAL-WORLD EVENTS

Aug 13, 2024: Unstructured Data Meetup NYC

Unstructured Data Meetup New York · Luma

This is an in-person event! Registration and photo identification is required to get in. Topic: Connecting your…

lu.ma

Aug 15, 2024: AI Camp NYC

AI Meetup (NYC): AI, GenAI, LLMs and ML

AICamp: Learn and practice AI/ML from anywhere any time with webinars, workshops and courses.

www.aicamp.ai

Sept 24, 2024: Unstructured Data Meetup NYC

Unstructured Data Meetup New York · Luma

This is an in-person event! Registration is required to get in. Topic: Connecting your unstructured data with…

lu.ma

WEBINAR

Unstructured Data Processing from Cloud to Edge

Join us for a webinar on why you should add a Cloud Native Vector Database to your Data and AI platform

zilliz.com

Unstructured Data Processing from Cloud to Edge Webinar

Unstructured Data Processing from Cloud to Edge Webinar - Download as a PDF or view online for free

www.slideshare.net

RESOURCES

What Is Edge AI? | IBM

Edge AI refers to the deployment of AI models directly on local edge devices to enable real-time data processing and…

www.ibm.com

Model Zoo by Hailo | AI Model Explorer to Find The Best NN Model

Discover Hailo's AI Model Explorer for selecting the top NN models. Available in TensorFlow and ONNX formats. Explore…

hailo.ai

ollama/docs/linux.md at main · ollama/ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models. - ollama/docs/linux.md at main ·…

github.com

Welcome Gemma 2 - Google's new open LLM

We're on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

gemma2:2b

Google Gemma 2 is a high-performing and efficient model by now available in three sizes: 2B, 9B, and 27B.

ollama.com

GitHub - ollama/ollama: Get up and running with Llama 3.1, Mistral, Gemma 2, and other large…

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models. - ollama/ollama

github.com

Building RAG with Milvus Lite, Llama3, and LlamaIndex

medium.com

GitHub - NVIDIA-AI-IOT/jetson-generative-ai-playground

Contribute to NVIDIA-AI-IOT/jetson-generative-ai-playground development by creating an account on GitHub.

github.com

EdgeAI + Edge Vector Database

Open Source, Milvus, NVIDIA, Python, Milvus Lite, PyMilvus, GenAI, NVIDIA JETSON XAVIER NX, Jetson, Edge, EdgeAI, Edge…

medium.com

Unstructured Data Processing with a Raspberry Pi AI Kit

Unstuctured Data Processing, Raspberry Pi 5, Raspberry Pi AI-Kit, Milvus, Zilliz, Data, Images, Computer Vision, Deep…

medium.com

Access to NVIDIA NIM Now Available Free to Developer Program Members | NVIDIA Technical Blog

The ability to use simple APIs to integrate pretrained AI foundation models into products and experiences has…

developer.nvidia.com

Function Calling with Ollama, Llama 3.1 and Milvus

medium.com

gemma2:2b

Google Gemma 2 is a high-performing and efficient model by now available in three sizes: 2B, 9B, and 27B.

ollama.com

Unstructured Data Engineering for AI

Timothy Spann's articles, videos, images, examples, documentation and source code on Data - unstuctured and structured…

www.youtube.com

Report: Exciting Meetup on July 25, 2024

Milvus in New York

medium.com

Getting Started with Jetson AGX Orin Developer Kit

NVIDIA® Jetson AGX Orin™ Developer Kit enables development of full-featured AI applications for products based on…

developer.nvidia.com

GitHub - milvus-io/milvus-lite: A lightweight version of Milvus

A lightweight version of Milvus. Contribute to milvus-io/milvus-lite development by creating an account on GitHub.

github.com

Generative AI Resource Hub | Zilliz

Tutorials, Code Examples, and Best Practices for Developing and Deploying GenAI Applications.

zilliz.com

Milvus Uses Kafka

Star Us On GitHub and Join Our Discord!

If you liked this blog post, consider starring Milvus on GitHub, and feel free to join our Discord! 💙

GitHub — milvus-io/milvus: A cloud-native vector database, storage for next generation AI…

A cloud-native vector database, storage for next generation AI applications — milvus-io/milvus

github.com

Get Milvused!

Vector database — Milvus

Milvus is a powerful vector database tailored for processing and searching extensive vector data. It stands out for its…

milvus.io

Read my Newsletter every week!

AIM Weekly 17 June 2024

17-June-2024

medium.com

For more cool Unstructured Data, AI and Vector Database videos check out the Milvus vector database videos here:

Zilliz

Zilliz is a leading vector database company for production-ready AI. Built by the engineers who created Milvus, the…

www.youtube.com

x.com

Edit description

x.com

Edit description

x.com

https://www.linkedin.com/company/zilliz/

https://www.linkedin.com/in/timothyspann/

Join the Milvus Discord Server!

Check out the Milvus community on Discord — hang out with 1734 other members and enjoy free voice and text chat.

discord.com

https://milvusio.medium.com

Open Source Vector Databases

Open Source Vector Databaseswww.opensourcevectordb.cloud