SAM3
Meta の SAM3 モデルを Serverless Hosted API 経由で使用します
コードサンプル
PCSコードサンプル
import os
import requests
import base64
import cv2
import numpy as np
# From "https://media.roboflow.com/notebooks/examples/dog.jpeg"
image = cv2.imread("./dog.jpeg")
# Encode image as base64
_, buffer = cv2.imencode('.jpg', image)
image_base64 = base64.b64encode(buffer).decode('utf-8')
payload = {
"image": { "type": "base64", "value": image_base64 },
"prompts": [
{ "type": "text", "text": "person" },
{ "type": "text", "text": "dog" },
],
"output_prob_thresh": 0.5,
"format": "polygon",
}
url = "https://serverless.roboflow.com/sam3/concept_segment?api_key=" + os.getenv("API_KEY")
response = requests.post(url, json=payload)
data = response.json()
for key in dat
print(key) # Should be prompt_results and timePVSコードサンプル
エンドポイント
Your Roboflow API Key. Get one at https://app.roboflow.com/settings/api
One of 'polygon', 'rle'
polygonOptional ID for caching embeddings.
Score threshold for outputs.
0.5The model ID of SAM3. Use 'sam3/sam3_final' to target the generic base model.
sam3/sam3_finalIoU threshold for cross-prompt NMS. If not set, NMS is disabled. Must be in [0.0, 1.0] when set.
Successful Response
The time in seconds it took to produce the segmentation including preprocessing
Validation Error
Your Roboflow API Key. Get one at https://app.roboflow.com/settings/api
SAM2 visual segmentation request.
The ID of the image to be segmented used to retrieve cached embeddings. If an embedding is cached, it will be used instead of generating a new embedding. If no embedding is cached, a new embedding will be generated and cached.
image_idThe format of the response. Must be one of 'json', 'rle', or 'binary'. If binary, masks are returned as binary numpy arrays. If json, masks are converted to polygons. If rle, masks are converted to RLE format.
jsonExample: jsonThe version ID of SAM to be used for this request. Must be one of hiera_tiny, hiera_small, hiera_large, hiera_b_plus
hiera_largeExample: hiera_largeIf true, the model will return three masks. For ambiguous input prompts (such as a single click), this will often produce better masks than a single prediction.
trueExample: trueIf True, saves the low-resolution logits to the cache for potential future use.
falseIf True, attempts to load previously cached low-resolution logits for the given image and prompt set.
falseSuccessful Response
The time in seconds it took to produce the segmentation including preprocessing
Validation Error
Last updated
Was this helpful?