Server data from the Official MCP Registry
MCP Server for video enhancement and image segmentation (SAM3)
MCP Server for video enhancement and image segmentation (SAM3)
Valid MCP server (4 strong, 5 medium validity signals). 5 known CVEs in dependencies Package registry verified. Imported from the Official MCP Registry.
5 files analyzed · 6 issues found
Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.
This plugin requests these system permissions. Most are normal for its category.
Set these up before or after installing:
Environment variable: API_KEY
Add this to your MCP configuration file:
{
"mcpServers": {
"io-github-avclabs-media-mcp": {
"env": {
"API_KEY": "your-api-key-here"
},
"args": [
"-y",
"@avclabs.ai/media-mcp"
],
"command": "npx"
}
}
}From the project's GitHub README.
English | 中文
A video enhancement, image enhancement/colorization/denoising, and image segmentation service based on the MCP protocol, acting as an MCP Client-Server to interact with backend HTTP Servers.
Provides the following MCP Tools:
Video Enhancement
create_task - Create a video enhancement task (supports URL or local file upload)get_task_status - Query task statusenhance_video_sync - Synchronously enhance video (blocking wait, truncated at ~50s by default)Image Enhancement
enhance_image_sync - Enhance image quality and optimize faces (supports URL or local file upload)colorize_image_sync - Colorize black-and-white photos (supports URL or local file upload)denoise_image_sync - Remove noise from images (supports URL or local file upload)get_image_task_status - Query image task status (for polling after sync timeout)Image Segmentation (SAM3)
sam3_predict - SAM3 image segmentation (supports local path, URL, or Base64 image)get_sam3_task_status - Query SAM3 task status (for polling after sync timeout)node --version)If your AI Agent has a known MCP config path, just copy the line below and send it to your AI:
Install the npm package @avclabs.ai/media-mcp as an MCP server. My API Key is: sk-xxxxxxxx.
The AI will automatically:
No installation needed. Use npx directly in your MCP client config.
Run in Claude Code:
/mcp
Check the output for the "User MCPs" section to find the config file path, then edit that file.
Common paths (if /mcp is unavailable):
%USERPROFILE%\.claude.json~/.claude.json~/.claude.json~/.claude/mcp.jsonPaste this (replace your-api-key):
{
"mcpServers": {
"video-enhancement": {
"command": "npx",
"args": ["-y", "@avclabs.ai/media-mcp@latest"],
"env": {
"API_KEY": "your-api-key"
}
}
}
}
Save and run /mcp to verify it's loaded.
Go to Settings > Tools & MCPs > Add New MCP Server:
video-enhancementcommandenv API_KEY=your-api-key npx -y @avclabs.ai/media-mcp@latest
Or edit ~/.cursor/mcp.json:
{
"mcpServers": {
"video-enhancement": {
"command": "npx",
"args": ["-y", "@avclabs.ai/media-mcp@latest"],
"env": {
"API_KEY": "your-api-key"
}
}
}
}
After restarting your client, check if the tools are available:
create_task, get_task_status, enhance_video_sync, enhance_image_sync, colorize_image_sync, denoise_image_sync, get_image_task_status, sam3_predict, get_sam3_task_status| Variable | Required | Default | Description |
|---|---|---|---|
API_KEY | Yes | - | API authentication key (shared by video enhancement and SAM3) |
HTTP_API_BASE_URL | No | https://mcp.avc.ai/enhance | Video enhancement service endpoint |
SAM3_API_BASE_URL | No | https://mcp.avc.ai/sam | SAM3 service endpoint |
SAM3_POLL_INTERVAL | No | 2000 | SAM3 polling interval (milliseconds) |
SAM3_POLL_MAX_ATTEMPTS | No | 25 | SAM3 maximum polling attempts |
{
"env": {
"HTTP_API_BASE_URL": "https://your-endpoint.com",
"API_KEY": "your-api-key",
"SAM3_API_BASE_URL": "https://your-sam3-endpoint.com"
}
}
Or via CLI args:
npx -y @avclabs.ai/media-mcp@latest --base-url https://your-endpoint.com --api-key your-api-key --sam3-base-url https://your-sam3-endpoint.com
This project provides both synchronous and asynchronous modes.
Because MCP Agents typically enforce a ~60-second timeout per tool call, tasks with longer processing times (video enhancement) are strongly recommended to use asynchronous mode:
Video Enhancement:
create_task to create a task → immediately get task_idget_task_status to query the statusstatus is processing, continue waiting and repeat step 2status is completed, the task is done and the result contains video_urlstatus is failed, the task failed and the result contains error_messageVideo Enhancement:
enhance_video_sync → the server polls internallytask_id and instructions for the Agent to switch to get_task_statusImage Segmentation (SAM3):
sam3_predict → the server polls internallyOnce configured, ask your AI agent naturally:
"Enhance this video to 1080p: https://example.com/video.mp4"
"Improve the quality of /Users/me/Desktop/video.mp4 to 2k"
"Enhance this image: https://example.com/photo.jpg"
"Colorize this black-and-white photo: /Users/me/Desktop/old_photo.png"
"Remove noise from this image: C:\Users\xxx\noisy.jpg"
"Analyze this image and find all objects: C:\Users\xxx\photo.png"
"Use SAM3 to segment this image, prompt: 'find all cars'"
The agent will automatically choose sync or async tools based on task complexity.
Create an asynchronous video enhancement task.
Recommended for most use cases. Ideal for longer videos (over 10 seconds) to avoid timeouts and blocking the connection.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
video_source | string | Yes | - | Video URL or local file path (URL must be publicly accessible, links requiring login or signatures are not supported) |
type | string | No | url | url or local |
resolution | string | No | 720p | 480p, 540p, 720p, 1080p, 2k |
Returns:
{
"success": true,
"task_id": "xxx",
"status": "processing"
}
Query video enhancement task status.
The returned
statusfield can be:processing,completed, orfailed. Ifstatusisprocessing, you need to wait a few seconds and call this tool again.
| Parameter | Type | Required |
|---|---|---|
task_id | string | Yes |
Returns:
{
"success": true,
"task_id": "xxx",
"status": "completed",
"progress": 100,
"video_url": "https://...",
"message": "Task is still processing, please check again later"
}
The message field only appears when status is processing, prompting the Agent to continue waiting.
Synchronously enhance video (blocks until completion).
Best for short videos (estimated processing time < 1 minute). If the task is not completed within 50 seconds, the tool returns early with a
task_id, and you need to useget_task_statusto continue querying.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
video_source | string | Yes | - | Video URL or local file path |
type | string | No | url | url or local |
resolution | string | No | 720p | Target resolution |
poll_interval | number | No | 5 | Poll interval (seconds) |
timeout | number | No | 50 | Sync wait timeout (seconds), returns early when exceeded |
Truncated return example (not completed within 50s):
{
"success": true,
"status": "processing",
"task_id": "xxx",
"message": "Task is still processing (waited 50 seconds). Please use get_task_status to continue polling.",
"note": "The synchronous wait for this long-running task has been truncated. Switch to get_task_status polling."
}
Three image processing tools are provided, each targeting a specific use case:
| Tool | Function | Use Case |
|---|---|---|
enhance_image_sync | Image quality enhancement & face optimization | Blurry, low-resolution, or degraded photos |
colorize_image_sync | Black-and-white photo colorization | Restoring old B&W photos with realistic colors |
denoise_image_sync | Image noise removal | Noisy/grainy photos taken in low light |
All three tools share the same parameters and behavior pattern. They are synchronous — the tool blocks until the image is processed or the timeout is reached.
Supported image formats: PNG, JPG, JPEG, BMP, WebP, etc.
Two upload methods:
type: "url")type: "local", max file size: 100MB)Synchronously enhance an image to improve quality and optimize faces.
The tool internally creates a task and polls for the result. If processing completes within the timeout (default 50s), the result is returned directly. If not, the tool returns early with a
task_id— useget_image_task_statusto continue polling.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
image_source | string | Yes | - | Image URL or local file path (URL must be publicly accessible, links requiring login or signatures are not supported) |
type | string | No | url | url or local |
scale | number | No | 2 | Enhancement scale multiplier (e.g. 2 for 2x, 4 for 4x upscaling) |
poll_interval | number | No | 5 | Poll interval in seconds |
timeout | number | No | 50 | Sync wait timeout in seconds, returns early when exceeded |
Normal completion return:
{
"success": true,
"task_id": "xxx",
"status": "completed",
"progress": 100,
"image_url": "https://..."
}
Truncated return (not completed within 50s):
{
"success": true,
"status": "processing",
"task_id": "xxx",
"message": "Task is still processing (waited 50 seconds). Please use get_image_task_status to continue polling.",
"note": "The synchronous wait for this long-running task has been truncated. Switch to get_image_task_status polling."
}
Synchronously colorize a black-and-white photo with AI.
Best for old black-and-white photos. The AI will add realistic colors to the image. Supports the same parameters and return format as
enhance_image_sync.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
image_source | string | Yes | - | Image URL or local file path (URL must be publicly accessible, links requiring login or signatures are not supported) |
type | string | No | url | url or local |
poll_interval | number | No | 5 | Poll interval in seconds |
timeout | number | No | 50 | Sync wait timeout in seconds, returns early when exceeded |
Returns: Same format as enhance_image_sync.
Synchronously remove noise from an image.
Best for grainy/noisy photos taken in low-light conditions or with high ISO settings. Supports the same parameters and return format as
enhance_image_sync.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
image_source | string | Yes | - | Image URL or local file path (URL must be publicly accessible, links requiring login or signatures are not supported) |
type | string | No | url | url or local |
poll_interval | number | No | 5 | Poll interval in seconds |
timeout | number | No | 50 | Sync wait timeout in seconds, returns early when exceeded |
Returns: Same format as enhance_image_sync.
Query image processing task status. Used to poll for results when a sync tool times out.
The returned
statusfield can be:processing,completed, orfailed. Ifstatusisprocessing, wait a few seconds and call this tool again.
| Parameter | Type | Required |
|---|---|---|
task_id | string | Yes |
Returns:
{
"success": true,
"task_id": "xxx",
"status": "completed",
"progress": 100,
"image_url": "https://...",
"message": "Task is still processing, please check again later"
}
The message field only appears when status is processing, prompting the Agent to continue waiting.
enhance_image_sync / colorize_image_sync / denoise_image_sync directly — the tool handles everything and returns the resulttask_id, then use get_image_task_status to poll until status becomes completed or failederror_message field for detailsAnalyze an image using the SAM3 segmentation API to generate inference results (masks, boxes, scores).
Parameters:
Image input (choose one, must provide exactly one):
imagePath (string): Absolute path of a local image file. Supports common formats (PNG, JPG, JPEG).
"C:\\Users\\xxx\\photo.png", "/home/user/images/cat.jpg"imageUrl (string): Publicly accessible URL of the image.
"https://example.com/photo.jpg"imageBase64 (string): Base64-encoded image data.
"iVBORw0KGgoAAAANSUhEUgAA..."Other parameters:
prompt (string, required): English text prompt specifying the target object to segment. Since the SAM3 model only accepts English prompts, provide an English description. If the user provides Chinese or other non-English text, the Agent will automatically translate it before calling the tool.Normal completion return:
After inference completes, returns a JSON string containing three fields:
masks: 2D array. Each element is a binary mask (values 0 or 1) with the same dimensions as the input image, marking the pixel-level location of detected objects. The i-th mask corresponds to the i-th detected object instance.
boxes: 2D array. Each element is a bounding box in [x1, y1, x2, y2] format, representing the rectangular region of the detected object. x1, y1 are the top-left coordinates; x2, y2 are the bottom-right coordinates.
Coordinate system: The top-left corner of the image is the origin (0, 0). The x-axis increases to the right, and the y-axis increases downward, in pixels. For example, [120, 80, 300, 450] means the region starts 120px from the left edge and 80px from the top edge, extending to 300px from the left and 450px from the top. Width = x2 - x1 = 180px, Height = y2 - y1 = 370px.
scores: 1D array. Each element is a confidence score for the corresponding detection result, ranging from 0 to 1. Higher scores indicate greater model confidence.
Example result JSON:
{
"masks": [
[[0, 0, 1, ...], [0, 1, 1, ...], ...],
[[0, 0, 0, ...], [0, 0, 1, ...], ...]
],
"boxes": [
[120, 80, 300, 450],
[400, 200, 600, 500]
],
"scores": [0.95, 0.87]
}
Truncated return example (not completed within 50s):
{
"success": true,
"status": "processing",
"task_id": "xxx",
"message": "Task is still processing (waited about 50 seconds). Please retry later or record this task_id for manual follow-up.",
"note": "The synchronous wait for this long-running task has been truncated."
}
Query SAM3 segmentation task status. Used to poll for results when sam3_predict times out.
The returned
statusfield can be:processing,completed, orfailed. Ifstatusisprocessing, wait a few seconds and call this tool again.
| Parameter | Type | Required |
|---|---|---|
task_id | string | Yes |
Completed return:
{
"success": true,
"task_id": "xxx",
"status": "completed",
"result_url": "https://..."
}
Processing return:
{
"success": true,
"task_id": "xxx",
"status": "processing",
"message": "Task is still processing, please check again later."
}
Failed return:
{
"success": false,
"task_id": "xxx",
"status": "failed",
"error": "Task failed"
}
This is the primary issue this project addresses. MCP Agents (such as Claude, Cursor) typically enforce a ~60-second timeout per tool call. If task processing exceeds this limit, the Agent will error and disconnect.
Solutions:
Prefer asynchronous tools: For video enhancement and other time-consuming tasks, always use create_task + get_task_status. These tools return instantly on each call and will not trigger timeouts.
Sync tool truncation mechanism: enhance_video_sync has an internal 50-second truncation limit. If the task is not completed within 50 seconds, the tool proactively returns a task_id and instructs the Agent to use get_task_status to follow up.
SAM3 truncation mechanism: sam3_predict defaults to 25 polling attempts (~50 seconds). If the task is not completed, it returns a truncation notice indicating the task is still processing.
Adjust SAM3 polling parameters (advanced): If you are confident that SAM3 tasks are usually fast (e.g., under 10 seconds), you can increase polling attempts via environment variable:
SAM3_POLL_MAX_ATTEMPTS=60
But ensure the total wait time does not exceed your Agent's timeout limit.
This is a known limitation of stdio MCP. When dragging or uploading an attachment through the Agent interface, the file path is usually not automatically passed to the MCP Server.
Solutions:
Provide the path simultaneously (recommended): After dragging the image, add the local absolute path in your message:
"Please analyze this image
D:\\photos\\cat.jpgand find the cat"
Wait for auto-encoding: Claude may automatically encode the image as base64. If successful, no extra action is needed.
Reply to path inquiry: If Claude asks for the image path, simply reply with the local absolute path.
There is no strict priority. Claude will automatically choose the most appropriate method based on conversation context:
imagePathimageUrlimageBase64Common formats: PNG, JPG, JPEG, BMP, WebP, etc. PNG or JPG is recommended.
Ensure the URL is publicly accessible, requiring no login, cookies, or signatures. If the image is on a service requiring authentication (e.g., private S3 Bucket, login-required image host), download it locally first and use imagePath.
If the image is very large (e.g., 4K resolution), the base64-encoded data will be very large and may slow transmission. Suggestions:
imagePath insteadWhen type is "local":
Install Node.js >= 18: https://nodejs.org/
Your API Key is missing. Double-check the env.API_KEY in your config.
Check logs:
~/Library/Logs/Claude/mcp*.log%APPDATA%\Claude\logs\mcp*.logUsually a signature mismatch. Ensure your HTTP_API_BASE_URL and API_KEY are correct and active.
If you prefer not using npx every time:
npm install -g @avclabs.ai/media-mcp
Then use "command": "media-mcp" with "args": ["--api-key", "your-api-key"] in your config.
MIT License - See LICENSE file for details
Be the first to review this server!
by Modelcontextprotocol · Developer Tools
Web content fetching and conversion for efficient LLM usage
by Toleno · Developer Tools
Toleno Network MCP Server — Manage your Toleno mining account with Claude AI using natural language.
by mcp-marketplace · Developer Tools
Create, build, and publish Python MCP servers to PyPI — conversationally.