A process of prompt engineering, software engineering, trial and error, and elbow grease — Unlike tabular data, datasets for computer vision tasks are unstructured — think gobs of pixels, heaps of labels, bags of tags, and some sometimes-structured metadata. …