Server data from the Official MCP Registry
MCP server and CLI tool for interacting with the Wayback Machine without API keys
MCP server and CLI tool for interacting with the Wayback Machine without API keys
Valid MCP server (3 strong, 2 medium validity signals). No known CVEs in dependencies. Package registry verified. Imported from the Official MCP Registry. Trust signals: trusted author (4/4 approved).
6 files analyzed · 1 issue found
Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.
This plugin requests these system permissions. Most are normal for its category.
Set these up before or after installing:
Environment variable: WAYBACK_ACCESS_KEY
Environment variable: WAYBACK_SECRET_KEY
Add this to your MCP configuration file:
{
"mcpServers": {
"io-github-mearman-mcp-wayback-machine": {
"env": {
"WAYBACK_ACCESS_KEY": "your-wayback-access-key-here",
"WAYBACK_SECRET_KEY": "your-wayback-secret-key-here"
},
"args": [
"-y",
"mcp-wayback-machine"
],
"command": "npx"
}
}
}From the project's GitHub README.
MCP server and CLI tool for interacting with the Internet Archive's Wayback Machine. Supports full CDX search, snapshot content retrieval, screenshot listing, snapshot comparison, and optional authentication for higher SPN2 rate limits.
Stack: TypeScript · Node.js 22+ · ES Modules · pnpm · Turbo · Zod
Requires Node.js 22+ and pnpm.
pnpm install
Optional credentials (anonymous access works, but authenticated requests get higher SPN2 rate limits):
export WAYBACK_ACCESS_KEY="your-access-key"
export WAYBACK_SECRET_KEY="your-secret-key"
Obtain credentials at archive.org/account/s3.php.
pnpm validate # typecheck + lint + build + test + untested-files check (the full CI gate)
pnpm check # typecheck + lint + build only
pnpm build # compile TypeScript to dist/
pnpm test # run unit and integration tests
pnpm test:coverage # run tests with coverage (80% line/branch/function threshold)
pnpm lint # lint with ESLint
pnpm lint:fix # auto-fix lint issues
To run a single test file:
node --test tests/tools/save.unit.test.ts
End-to-end tests hit the live Wayback Machine API and are opt-in:
pnpm test:e2e # sets WAYBACK_LIVE_TESTS=1 internally via turbo
pnpm validate is the gate that must pass before a release. prepublishOnly runs it automatically.
src/bin.ts is the entry point. It detects whether it is invoked as a CLI or loaded as an MCP server and routes accordingly.
src/
bin.ts — entry point; dispatches to CLI or MCP server
cli.ts — Commander-based CLI implementation
server.ts — MCP server wiring (ListTools + CallTool handlers)
contexts.ts — shared context (rate limiter, cache, credentials)
schemas.ts — Zod schemas for all tool inputs; single source of truth
tools/
save.ts — save_url tool (SPN2 API)
retrieve.ts — get_archived_url tool
search.ts — search_archives tool (CDX API)
status.ts — check_archive_status tool (sparkline API)
screenshots.ts — list_screenshots tool
compare.ts — compare_snapshots tool
cache.ts — clear_cache tool
context.ts — injects shared context into tool handlers
utils/
http.ts — fetch wrapper with rate limiting and Retry-After handling
cache.ts — in-memory + disk cache with per-endpoint TTLs
rate-limit.ts — 15 req/min token bucket
validation.ts — shared Zod validation helpers
Each tool module exports a schema (consumed by ListToolsRequestSchema) and an execution function (consumed by CallToolRequestSchema). New tools need both registrations in server.ts.
Caching TTLs are intentional — do not normalise them:
| Resource | TTL | Reason |
|---|---|---|
| Snapshot content | 24 h | Immutable once captured |
| Availability, CDX, sparkline | 1 h | Grows but never mutates |
| Save operations | 30 min | Idempotent per URL |
| Save status polling | 30 s | Changes during active jobs |
noUncheckedIndexedAccess and exactOptionalPropertyTypes — no any, no as assertions."type": "module" in package.json. Always use .ts extensions in relative imports (rewritten to .js at build time via rewriteRelativeImportExtensions).schemas.ts defines them; zodToJsonSchema derives the MCP-compatible JSON Schema.es5), 80-char print width, LF line endings.retrieve, save, search, status, fetch, http, validation, cli, build, release, ci, deps. Commit messages must use British English.tests/tools/*.unit.test.ts and tests/utils/*.unit.test.ts; integration tests in tests/*.integration.test.ts. Use the Node.js built-in test runner — no Jest or Vitest.erasableSyntaxOnly: true — no TypeScript syntax that cannot be stripped without transformation (no enum, no decorators, no namespace).pnpm validate before pushing. CI runs check + test + coverage + untested-files. pnpm validate replicates this locally via Turbo.pnpm turbo run <task> --force if results look wrong.WAYBACK_LIVE_TESTS must be set to run test:e2e. The Turbo config passes it through via globalPassThroughEnv; don't set it in .env files — export it in your shell before running.src/contexts.ts, src/cli.ts, src/bin.ts, and src/tools/context.ts — these are wiring/entry-point files. The 80% threshold applies to the remaining surface.noUncheckedIndexedAccess means Record<string, T> lookups return T | undefined. Never fall back to ?? default — narrow explicitly or restructure to a concrete type..tool-versions. CI tests against Node 22, 24, and 26. Do not use Node APIs that aren't available in 22.Commits follow Conventional Commits and are lint-checked by commitlint on PRs. PRs target main; CI must pass (check, test, coverage). Releases are fully automated via semantic-release on push to main.
After release, alias packages (wayback-machine-mcp, mcp-internet-archive, internet-archive-mcp, @mearman/mcp-wayback-machine) are published automatically by CI — do not publish these manually.
Claude Code (MCP):
claude mcp add wayback-machine -- npx -y mcp-wayback-machine
Claude Code (plugin marketplace):
/plugin marketplace add https://github.com/Mearman/mcp-wayback-machine.git
/plugin install mcp-wayback-machine@mcp-wayback-machine
OpenAI Codex:
codex mcp add wayback-machine -- npx -y mcp-wayback-machine
To include optional credentials:
claude mcp add wayback-machine --env WAYBACK_ACCESS_KEY=xxx --env WAYBACK_SECRET_KEY=xxx -- npx -y mcp-wayback-machine
Add to the appropriate config file:
{
"wayback-machine": {
"command": "npx",
"args": ["-y", "mcp-wayback-machine"],
"env": {
"WAYBACK_ACCESS_KEY": "your-access-key",
"WAYBACK_SECRET_KEY": "your-secret-key"
}
}
}
| Harness | Config file | Config key |
|---|---|---|
| Claude Code | .mcp.json (project) / ~/.claude.json (user) | mcpServers |
| Codex | ~/.codex/config.toml | [mcp_servers.wayback-machine] |
| Gemini CLI | ~/.gemini/settings.json | mcpServers |
| Crush | .crush.json / ~/.config/crush/crush.json | mcp |
| Cline | .cline/mcp.json | mcpServers |
| Cursor | .cursor/mcp.json | mcpServers |
| Zed | ~/.config/zed/settings.json | context_servers |
| Claude Desktop | ~/Library/Application Support/Claude/claude_desktop_config.json | mcpServers |
The env block is optional — the server works anonymously without credentials.
npx mcp-wayback-machine save https://example.com
Or install globally:
npm install -g mcp-wayback-machine
wayback save https://example.com
Deploy the MCP server as a stateless Cloudflare Worker. Runs on the free tier with no paid bindings — all persistent state uses the Cache API which has no published daily limits.
pnpm add -D wrangler
wrangler deploy
The Worker uses the SDK's StreamableHTTPServerTransport in stateless mode (no session IDs), so each request is independent and cold starts are handled gracefully.
Environment variables (set via wrangler secret put):
| Variable | Required | Purpose |
|---|---|---|
WAYBACK_ACCESS_KEY | No | Fallback IA S3 credentials for higher SPN2 rate limits |
WAYBACK_SECRET_KEY | No | Fallback IA S3 credentials |
MCP_AUTH_TOKEN | No | Bearer token for client authentication |
When MCP_AUTH_TOKEN is set, clients must send Authorization: Bearer <token>. When absent, the Worker accepts unauthenticated requests.
Per-request credentials. Clients can pass their own IA S3 credentials on each request via HTTP headers, overriding the server's environment variables:
X-Archive-Access-Key: <your-access-key>
X-Archive-Secret-Key: <your-secret-key>
This lets multiple users share a single Worker deployment while each using their own credentials for higher SPN2 rate limits. When both headers are present, they take precedence over WAYBACK_ACCESS_KEY/WAYBACK_SECRET_KEY. When absent, the Worker falls back to its environment variables.
Worker-specific files are excluded from the main tsconfig.json and type-checked separately via tsconfig.worker.json (which adds @cloudflare/workers-types).
Architecture. The stdio and Worker deployments share the same tool logic through pluggable interfaces:
| Component | Stdio | Worker |
|---|---|---|
| Caching | DiskCacheBackend (OS cache dir) | CacheApiBackend (caches.open()) |
| Rate limiting | InMemoryRateLimiter | CacheApiRateLimiter |
| Auth | None | StaticTokenAuthProvider (optional) |
| Credentials | Environment variables | Request headers, then environment variables |
Archive https://example.com to the Wayback Machine
Find all archived snapshots of https://example.com from 2023
What's the earliest archived version of https://example.com?
Compare the oldest and newest snapshots of https://example.com
Check how many times https://example.com has been archived
save_urlArchive a URL to the Wayback Machine using the SPN2 API.
| Parameter | Required | Description |
|---|---|---|
url | Yes | The URL to archive |
captureScreenshot | No | Capture a screenshot as a PNG image |
captureOutlinks | No | Also archive up to 100 outlinked pages |
ifNotArchivedWithin | No | Skip if archived within timeframe, e.g. "30d" |
jsBehaviorTimeout | No | Run JavaScript for N seconds before capturing (max 30) |
forceGet | No | Use simple HTTP GET instead of browser rendering |
delayWbAvailability | No | Delay indexing ~12 hours to reduce server load |
get_archived_urlRetrieve an archived snapshot's content and metadata.
| Parameter | Required | Description |
|---|---|---|
url | Yes | The URL to retrieve |
timestamp | No | Specific timestamp (YYYYMMDDhhmmss) or "latest" |
modifier | No | URL modifier: id_ (raw), im_ (screenshot), js_ (JS), cs_ (CSS) |
search_archivesSearch the CDX API for archived versions of a URL.
| Parameter | Required | Description |
|---|---|---|
url | Yes | The URL pattern to search for |
matchType | No | exact, prefix, host, or domain |
from | No | Start date (YYYYMMDD or YYYY-MM-DD) |
to | No | End date (YYYYMMDD or YYYY-MM-DD) |
limit | No | Maximum results (default 10) |
offset | No | Skip the first N results |
collapse | No | Collapse duplicates, e.g. "timestamp:8" (per hour), "digest" |
filter | No | Filter by field regex, e.g. ["statuscode:200", "!mimetype:image.*"] |
resolveRevisits | No | Resolve warc/revisit entries to original metadata |
showDupeCount | No | Show duplicate count per capture |
page | No | Page number for pagination |
pageSize | No | Results per page |
check_archive_statusCheck archival statistics for a URL — capture counts, yearly breakdowns, and first/last capture dates.
| Parameter | Required | Description |
|---|---|---|
url | Yes | The URL to check |
list_screenshotsList available screenshots for a URL.
| Parameter | Required | Description |
|---|---|---|
url | Yes | The URL to find screenshots for |
limit | No | Maximum results (default 10) |
compare_snapshotsCompare two archived snapshots of a URL. Fetches the raw content of both and provides a visual diff URL.
| Parameter | Required | Description |
|---|---|---|
url | Yes | The URL to compare snapshots for |
timestampA | No | First timestamp. Defaults to oldest available. |
timestampB | No | Second timestamp. Defaults to newest available. |
clear_cacheClear all cached API responses. Use when fresh data is needed or after saving a new URL.
wayback save https://example.com
wayback get https://example.com
wayback get https://example.com --timestamp 20231225120000
wayback search https://example.com --from 2023-01-01 --to 2023-12-31 --limit 20
wayback status https://example.com
wayback screenshots https://example.com
wayback compare https://example.com
wayback compare https://example.com --timestamp-a 20230101000000 --timestamp-b 20240101000000
ia Python CLI; complements this server.Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International.
Be the first to review this server!
by Modelcontextprotocol · Developer Tools
Web content fetching and conversion for efficient LLM usage
by Toleno · Developer Tools
Toleno Network MCP Server — Manage your Toleno mining account with Claude AI using natural language.
by mcp-marketplace · Developer Tools
Create, build, and publish Python MCP servers to PyPI — conversationally.