Published in


OpenAI’s unCLIP Text-to-Image System Leverages Contrastive and Diffusion Models to Achieve SOTA Performance

Contrastive vision-language models such as OpenAI’s CLIP (Contrastive Language–Image Pre-training, 2021) have garnered much attention in the computer vision research community thanks to their impressive capabilities in zero-shot learning and learning robust representations of images that capture both…




We produce professional, authoritative, and thought-provoking content relating to artificial intelligence, machine intelligence, emerging technologies and industrial insights.

Recommended from Medium

ICML 2019 | Google, ETH Zurich, MPI-IS, Cambridge & Share Best Paper Honours

Optimising for beauty - the erasure of heterogeneity and an appeal for a Bayesian world view

YOLO Is Back! Version 4 Boasts Improved Speed and Accuracy

Classifying NLP Data for the Service Industry: Comparing 2 Subreddits

Analisa Topik Data Media Sosial Twitter menggunakan Latent Semantic Analysis (3)

How to get background blur using Deep Learning?

Image stacking and signal quality

Andromeda galaxy image with pixel values

Inference Attacks

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store


AI Technology & Industry Review — | Newsletter: | Share My Research | Twitter: @Synced_Global

More from Medium

Google Builds Language Models with Socratic Dialogue to Improve Zero-Shot Multimodal Reasoning…

OpenAI’s DALL·E 2 ! Text-to-Image Generation Explained

PaLM on my forehead: Not another large language model??

DALL-E 2 vs Disco Diffusion