When something does zero-shot image classification, that means it’s able to make judgments about the contents of an image without the user needing to train the system beforehand on what to look for. Watch it in action with this online demo, which uses WebGPU to implement CLIP (Contrastive Language–Image Pre-training) running in one’s browser, using the input from an attached camera.
This is a companion discussion topic for the original entry at https://hackaday.com/2024/05/20/try-image-classification-running-in-your-browser-thanks-to-webgpu/