Server data from the Official MCP Registry
Web scraping MCP — extract clean markdown, links, and metadata from any URL.
Web scraping MCP — extract clean markdown, links, and metadata from any URL.
Valid MCP server (2 strong, 4 medium validity signals). 2 known CVEs in dependencies Package registry verified. Imported from the Official MCP Registry. Trust signals: trusted author (15/16 approved).
8 files analyzed · 3 issues found
Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.
This plugin requests these system permissions. Most are normal for its category.
Add this to your MCP configuration file:
{
"mcpServers": {
"io-github-ofershap-scraper": {
"args": [
"-y",
"mcp-server-scraper"
],
"command": "npx"
}
}
}From the project's GitHub README.
Extract clean, readable content from any URL. Returns markdown text, links, and metadata. No API keys, no config. A free alternative to Firecrawl for scraping docs, blogs, and articles.
npx mcp-server-scraper
Works with Claude Desktop, Cursor, VS Code Copilot, and any MCP client. No accounts or API keys needed.

Demo built with remotion-readme-kit
When you're working with an AI assistant and need to reference a docs page, a blog post, or an API reference, you usually end up copy-pasting content manually. Tools like Firecrawl solve this but require a paid API key. This server does the same thing for free. It fetches a URL, runs it through Mozilla Readability (the same engine behind Firefox Reader View), and returns clean markdown. It works well for server-rendered content like documentation sites, blog posts, and articles. It won't handle JavaScript-heavy SPAs, but for the most common use case of "read this docs page and summarize it," it does the job.
| Tool | What it does |
|---|---|
scrape_url | Extract clean text content from a URL (Readability-powered) |
extract_links | Get all links with href and anchor text |
extract_metadata | Get title, description, OG tags, canonical, favicon |
search_page | Search for a query string within the page, return matching lines |
scrape_multiple | Batch scrape multiple URLs, get title + excerpt per URL |
Add to .cursor/mcp.json:
{
"mcpServers": {
"scraper": {
"command": "npx",
"args": ["-y", "mcp-server-scraper"]
}
}
}
Add to claude_desktop_config.json:
{
"mcpServers": {
"scraper": {
"command": "npx",
"args": ["-y", "mcp-server-scraper"]
}
}
}
Add to your MCP settings (e.g. .vscode/mcp.json):
{
"mcp": {
"servers": {
"scraper": {
"command": "npx",
"args": ["-y", "mcp-server-scraper"]
}
}
}
}
Uses Mozilla Readability (the engine behind Firefox Reader View) plus linkedom for fast HTML parsing in Node. No headless browser needed. Works best with server-rendered pages: docs, blogs, articles, news sites.
npm install
npm run typecheck
npm run build
npm test
More MCP servers and developer tools on my portfolio.
README built with README Builder
Be the first to review this server!
by Toleno · Developer Tools
Toleno Network MCP Server — Manage your Toleno mining account with Claude AI using natural language.
by mcp-marketplace · Developer Tools
Create, build, and publish Python MCP servers to PyPI — conversationally.
by Microsoft · Content & Media
Convert files (PDF, Word, Excel, images, audio) to Markdown for LLM consumption
by mcp-marketplace · Developer Tools
Scaffold, build, and publish TypeScript MCP servers to npm — conversationally
by mcp-marketplace · Finance
Free stock data and market news for any MCP-compatible AI assistant.
by Taylorwilsdon · Productivity
Control Gmail, Calendar, Docs, Sheets, Drive, and more from your AI