Foundation Models
Last updated
Was this helpful?
Last updated
Was this helpful?
YOLO-World
YOLO-World is a zero-shot object detection model that allows you to perform object detection without any training, just by describing the items you want to detect.
CLIP
CLIP understands images and text together, allowing it to associate them in a semantically meaningful way by being trained on a vast amount of internet text and images, . Available through the Roboflow API and on-device using Roboflow Inference.
OCR
Use DocTR to turn words and text within images into machine-readable text.