Homepage
Open in app
Sign in
Get started
DVC — Data Version Control
Follow
Post-modern AI data stack
Post-modern AI data stack
Peeking at up-and-coming data processing infrastructure
Daniel Kharitonov
Sep 19
You Do the Math: Fine Tuning Multimodal Models (CLIP) to Match Cartoon Images to Joke Captions
You Do the Math: Fine Tuning Multimodal Models (CLIP) to Match Cartoon Images to Joke Captions
Multimodal models like CLIP have opened up new AI use cases by connecting complex objects like images to text descriptions that are easy to…
Dave Berenbaum
Sep 11
Scalable PDF document processing with DataChain and Unstructured.io
Scalable PDF document processing with DataChain and Unstructured.io
Extract and parse text from documents and create vector embeddings in a scalable and distributed way (and less than 70 lines of code)
Tibor Mach
Sep 4
Announcing DataChain
Announcing DataChain
Author: Dmitry Petrov
DVC
Jul 22
Dataset Factory: A Tool Chain for Computer Vision Datasets
Dataset Factory: A Tool Chain for Computer Vision Datasets
The fast proliferation of analytical and Generative AI solutions is driving requirements for data versioning and data curation to the next…
Jenifer De Figueiredo
Mar 20
Running DVC on a SLURM cluster
Running DVC on a SLURM cluster
Introduction
DVC
Mar 10
About DVC — Data Version Control
Latest Stories
Archive
About Medium
Terms
Privacy
Teams