Image Processing with Custom Python and NiFi 2.0

Tim Spann

6 min readMar 13, 2024

Apache NiFi, Image Processing, BLIP, HuggingFace, Transformers, Python, Image Captioning

Overview of Apache NiFi Data Flow

Example Flow for Processing with all the image processors

Detail flow for sending output to Discord and Slack

Step 1: CaptionImage

Image (JPG, PNG, GIF) is input.

Output is original image and caption attribute.

Step 2: FacialEmotionsImageDetection

Image (JPG, PNG, GIF) is input.

Output is original image and label# attribute and score# attribute.

label1
neutral
label2
angry
label3
happy
label4
sad
label5
fear
score1
0.19129344820976257
score2
0.18247057497501373
score3
0.16521458327770233
score4
0.16067346930503845
score5
0.1489986777305603

Step 3: RESNetImageClassification

Image (JPG, PNG, GIF) is input.

Output is original image and classificationlabel attribute

Step 4: NSFWImageDetection

Image (JPG, PNG, GIF) is input.

Output is original image and normal and nsfw attributes with scores.

Step 5: Route on NSFW Status of Image

Step 6: Create Discord and Slack Message (UpdateAttribute)

Step 7: Build new JSON File (this is required for Discord)

Step 8a: Send to Discord

Step 8b: Send to Slack

CaptionImage

The first processor I have added to assist with Image processing and analytics is the CaptionImage processor that utilizes HuggingFace Transformers and Salesforce BLIP model.

Here is the source code for the new CaptionImage processor.

https://github.com/tspannhw/FLaNK-python-processors/blob/main/CaptionImage.py

See this article for additional information and a use case:

Building an LLM Bot for Meetups and Conference Interactivity

Apache NiFi, LLM, GenAI, Slack Bot, Python, Vector Stores, ChatGPT, Chat

medium.com

Example Output

caption

someone holding a cell phone with a cat in the background

The best part of this processor is the image is not lost or changed, we just add an attribute for caption.

Sorry BLIP, it’s actually a radiation detector.

This second image was done better.

caption

there is a man standing on a stage with a microphone

For all of my new Python processors I put together a quick realistic workflow and recorded it. Let’s take a look at all of this in action.

FacialEmotionsImageDetector

The second processor is for Facial Emotions Image Detector.

FLaNK-python-processors/FacialEmotionsImageDetection.py at main · tspannhw/FLaNK-python-processors

Many processors. Contribute to tspannhw/FLaNK-python-processors development by creating an account on GitHub.

github.com

This processor extracts Facial Emotions and returns them as attributes.

RESNetImageClassification

The third processor is Res-Net 50 Image Classification.

FLaNK-python-processors/RESNetImageClassification.py at main · tspannhw/FLaNK-python-processors

Many processors. Contribute to tspannhw/FLaNK-python-processors development by creating an account on GitHub.

github.com

Output from this processor is the attribute classificationlabel.

NSFWImageDetection

This is the fourth processor to detecting NSFW images.

FLaNK-python-processors/NSFWImageDetection.py at main · tspannhw/FLaNK-python-processors

Many processors. Contribute to tspannhw/FLaNK-python-processors development by creating an account on GitHub.

github.com

Output to Slack

Image Analysis ==== NiFi 
On Date: ${date}
File Name: ${filename}
uuid : ${uuid}
Caption: ${caption}
Message Channel: ${messagechannel}
User uploaded: ${messagerealname} ${messageusername}
MSG Timestamp: ${messagetimestamp}
Time Zone: ${messageusertz}
mime-type: ${mime-type}
Title: ${title}
Classification RES-NET: ${classificationlabel}
  Label: ${label1}  Score: ${score1:trim():toDecimal():multiply(100)}
Label 2: ${label2}  Score: ${score2:toDecimal():multiply(100)}
Label 3: ${label3}  Score: ${score3:toDecimal():multiply(100)}
Label 4: ${label4}  Score: ${score4:toDecimal():multiply(100)}
Label 5: ${label5}  Score: ${score5:toDecimal():multiply(100)}
 Normal: ${normal:toDecimal():multiply(100)}
   NSFW: ${nsfw:toDecimal():multiply(100)}
=====

Slack Input of Images To Analyze

Output from NiFi to Slack

As you can see we send all the fields we filled with attribute values plus the attached JSON Flow File.

Output from NiFi to Discord

OTHER NEW PYTHON PROCESSORS

Building a Library of Python Processors

Apache NiFi has a large palette of processors to handle everything from ingest of REST Feeds, Databases, CDC, sFTP…

medium.com

Yet Another Python Processor

Python, Apache NiFi 2.0.0-M2, Data Cleansing, Data Preparation, Pre-Vectorization

medium.com

RESOURCES

Salesforce/blip-image-captioning-large · Hugging Face

We're on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

microsoft/resnet-50 · Hugging Face

We're on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

dima806/facial_emotions_image_detection · Hugging Face

We're on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

ResNet

We're on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

GitHub - keremberke/awesome-yolov5-models: Easy to use pretrained yolov5 models

Easy to use pretrained yolov5 models. Contribute to keremberke/awesome-yolov5-models development by creating an account…

github.com

transformers/examples/pytorch/image-classification at main · huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. …

github.com

Mixtral: Generative Sparse Mixture of Experts in DataFlows

Mixtral-8x7B-Instruct-v0.1

medium.com

AI Augmented DevRel Part 1

Apache NiFi, HuggingFace, BLIP, Image Captioning, Vision, AI, ML, DL

medium.com

Building an LLM Bot for Meetups and Conference Interactivity

Apache NiFi, LLM, GenAI, Slack Bot, Python, Vector Stores, ChatGPT, Chat

medium.com

Real-Time Slack Bots Powered By LLM and DataFlows

Utilizing WatsonX.AI LLM Foundation Models with Cloudera DataFlow via Apache NiFi sending, receiving and processing…

medium.com

Image Processing with Custom Python and NiFi 2.0

Overview of Apache NiFi Data Flow

CaptionImage

Building an LLM Bot for Meetups and Conference Interactivity

Apache NiFi, LLM, GenAI, Slack Bot, Python, Vector Stores, ChatGPT, Chat

Example Output

FacialEmotionsImageDetector

FLaNK-python-processors/FacialEmotionsImageDetection.py at main · tspannhw/FLaNK-python-processors

Many processors. Contribute to tspannhw/FLaNK-python-processors development by creating an account on GitHub.

RESNetImageClassification

FLaNK-python-processors/RESNetImageClassification.py at main · tspannhw/FLaNK-python-processors

Many processors. Contribute to tspannhw/FLaNK-python-processors development by creating an account on GitHub.

NSFWImageDetection

FLaNK-python-processors/NSFWImageDetection.py at main · tspannhw/FLaNK-python-processors

Many processors. Contribute to tspannhw/FLaNK-python-processors development by creating an account on GitHub.

Output to Slack

Slack Input of Images To Analyze

Output from NiFi to Slack

Output from NiFi to Discord

OTHER NEW PYTHON PROCESSORS

Building a Library of Python Processors

Apache NiFi has a large palette of processors to handle everything from ingest of REST Feeds, Databases, CDC, sFTP…

Yet Another Python Processor

Python, Apache NiFi 2.0.0-M2, Data Cleansing, Data Preparation, Pre-Vectorization

RESOURCES

Salesforce/blip-image-captioning-large · Hugging Face

We're on a journey to advance and democratize artificial intelligence through open source and open science.

microsoft/resnet-50 · Hugging Face

We're on a journey to advance and democratize artificial intelligence through open source and open science.

dima806/facial_emotions_image_detection · Hugging Face

We're on a journey to advance and democratize artificial intelligence through open source and open science.

ResNet

We're on a journey to advance and democratize artificial intelligence through open source and open science.

GitHub - keremberke/awesome-yolov5-models: Easy to use pretrained yolov5 models

Easy to use pretrained yolov5 models. Contribute to keremberke/awesome-yolov5-models development by creating an account…

transformers/examples/pytorch/image-classification at main · huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. …

Mixtral: Generative Sparse Mixture of Experts in DataFlows

Mixtral-8x7B-Instruct-v0.1

AI Augmented DevRel Part 1

Apache NiFi, HuggingFace, BLIP, Image Captioning, Vision, AI, ML, DL

Building an LLM Bot for Meetups and Conference Interactivity

Apache NiFi, LLM, GenAI, Slack Bot, Python, Vector Stores, ChatGPT, Chat

Real-Time Slack Bots Powered By LLM and DataFlows

Utilizing WatsonX.AI LLM Foundation Models with Cloudera DataFlow via Apache NiFi sending, receiving and processing…

Written by Tim Spann