Our Raspberry Pi deployment option runs directly on your devices in situations where you need to run your model without a reliable Internet connection.
Task Support
The following task types are supported by the hosted API:
You will need a Raspberry Pi 4 (or Raspberry Pi 400) running the 64bit version of Ubuntu. To verify that you're running a compatible system, type arch into your Raspberry Pi's command line and verify that it outputs aarch64.
The inference API is available as a Docker container optimized and configured for the Raspberry Pi. You can install and run the inference server using the following command:
You can now use your Pi as a drop-in replacement for the Hosted Inference API (see those docs for example code snippets in several programming languages).
To run inference on your model, run the following code, substituting your API key, workspace and project IDs, project version, and image name as relevant. You can learn how to find your API key and how to find your workspace and project ID.
Here is an example result of our inference on a model:
Inference Result: One Image (Visual Studio Code terminal)
You can also run as a client-server context and send images to the Pi for inference from another machine on your network. Replace localhost in the local= parameter with the Pi's local IP address.
Performance Expectations
We observed about 1.3 frames per second on the Raspberry Pi 400. These results were obtained while operating in a client-server context (so there is some minor network latency involved) and a 416x416 model.