Server data from the Official MCP Registry
Source-first AI knowledge control tower MCP server for YouTube, Bilibili, RSSHub, and RSS.
Source-first AI knowledge control tower MCP server for YouTube, Bilibili, RSSHub, and RSS.
Valid MCP server (1 strong, 0 medium validity signals). No known CVEs in dependencies. Package registry verified. Imported from the Official MCP Registry. Trust signals: 3 highly-trusted packages.
8 files analyzed · 1 issue found
Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.
Set these up before or after installing:
Environment variable: SOURCE_HARBOR_API_BASE_URL
Add this to your MCP configuration file:
{
"mcpServers": {
"io-github-xiaojiou176-open-sourceharbor-mcp": {
"env": {
"SOURCE_HARBOR_API_BASE_URL": "your-source-harbor-api-base-url-here"
},
"args": [
"sourceharbor"
],
"command": "uvx"
}
}
}From the project's GitHub README.
SourceHarbor helps you turn long-form sources into a reader-first flow of grounded search results, finished reading surfaces, and inspectable job runs. It stays source-first and proof-first: you can inspect it, run it locally, and verify each surface instead of trusting product copy on vibes alone.
Current reading specimen
Bilibili history milestone: the earliest surviving AV2.One readable title up front. One short excerpt. Proof one click away.
Choose one first move:
| If you want to... | Open this first | Why this is the right first door |
|---|---|---|
| See the product first | docs/see-it-fast.md, then docs/proof.md | start with the reader and the evidence before you boot anything |
| Run one real local flow | docs/start-here.md | this is the shortest truthful path from clone to /reader, /feed, /search, /ask, and one real job |
If you are here as a builder, skip straight to docs/builders.md and docs/public-distribution.md after you understand the reader-first front door.
If you only remember one sentence, remember this:
SourceHarbor is a reader-first, source-first, proof-first product repo.
Current intake truth:
/subscriptions, with changelog/status/blog as confirmed truth and X explicitly held as an observation layer| Surface | Why open it first | Current truth |
|---|---|---|
| Reader | See the finished surface instead of starting in a control panel | real local route after boot: /reader |
| Subscriptions + Feed | Follow a few sources, then watch the reading flow fill in | real local routes after boot: /subscriptions and /feed |
| Search + Ask | Search saved material or ask for the current story with evidence nearby | real local routes after boot: /search and /ask |
| Jobs + Proof | Inspect the pipeline, artifacts, and truth layers instead of trusting marketing copy | /jobs, docs/proof.md, and docs/project-status.md |
SourceHarbor is a multi-surface product repo, not a single skill package.
Public starter packs and plugin-grade bundles are adoption layers inside that
repo. They are not the whole product, and they are not raw exports of the
internal .agents/skills tree.
Think of this as the label on the box, not fine print:
If you need the explicit bet boundaries, read:
Keep these truth layers separate when you read or share the repo:
main truth can move ahead of the latest release tagmain are already greenThat is why SourceHarbor keeps proof.md, project-status.md, and the public-reference docs as separate ledgers instead of one blanket “ready” claim.
These are the surfaces that make SourceHarbor reusable instead of one-and-done:
| Compounder | What it does | Current truth |
|---|---|---|
| Watchlists | Save a topic, claim kind, or source matcher as a durable tracking object | Real route: /watchlists |
| Trends | Compare recent matched runs for a watchlist and show what was added or removed | Real route: /trends |
| Briefings | Collapse one watchlist into a unified story surface that starts with the current summary, highlights recent deltas, and keeps evidence one click away | Real route: /briefings; now backed by a server-owned briefing page payload that shares one canonical selected-story object with Ask |
| Evidence bundle | Export one job as a reusable internal bundle with digest, trace summary, knowledge cards, and artifact manifest | Real route on demand: /api/v1/jobs/<job-id>/bundle |
| Playground | Explore clearly labeled sample corpus and demo outputs without pretending they are live operator state | Real route: /playground + docs/samples/README.md |
| Use-case pages | Route newcomer traffic into truthful capability stories for YouTube, Bilibili, RSS, MCP, and research workflows | Real routes: /use-cases/youtube, /use-cases/bilibili, /use-cases/rss, /use-cases/mcp-use-cases, /use-cases/research-pipeline |
These are real directions, but they are not current product claims:
If you only have half a minute, do not start with setup.
Start with three surfaces:
Source -> queued job -> timeline -> reader / proof -> MCP / API reuse
| Surface | What you can do | Why it matters |
|---|---|---|
| Subscriptions | Start from strong YouTube/Bilibili templates or widen into RSSHub and generic RSS intake through the shared backend template catalog | Build a durable intake layer without pretending every source family is equally proven |
| Timeline | Read generated summaries in one calm flow, then jump into the current reader edition when a story is ready | Turn long-form content into an actionable daily reading stream without hiding the finished published-doc layer |
| Search & Ask | Search raw evidence and turn a watchlist or selected story briefing into an answer + change + citation flow on one page, with Briefings and Ask now sharing a server-owned story read-model instead of parallel browser-side selection glue | Make the knowledge layer visible without pretending every question already has a global answer engine |
| Job trace | Inspect pipeline status, retries, degradations, and artifacts | Debug with evidence instead of guessing what happened |
| Notifications | Configure and send digests outward when the notification lane is enabled | Push results outward instead of trapping them in a database |
| Retrieval | Search over generated artifacts | Reuse digests as a searchable knowledge layer |
| MCP tools | Expose subscriptions, ingestion, jobs, artifacts, search, and notifications to agents | Let assistants act on the same system without custom glue code |
README is no longer the full operator walkthrough.
Use it to choose the next page, then leave quickly:
If you need the live operator-side log trail after a local run, start at
.runtime-cache/logs/components/full-stack.
If you are here as a builder, use the builder path on purpose:
packages/sourceharbor-cli, packages/sourceharbor-sdk, starter-packs/README.mdmcp-name: io.github.xiaojiou176-open/sourceharbor-mcpContainer truth also stays separate on purpose:
Most repos in this space stop at one of these layers:
SourceHarbor is built around the full knowledge flow:
See the full comparison in docs/compare.md.
This repository does not ask you to trust product copy on its own.
GitHub profile intent is tracked in config/public/github-profile.json. Use
python3 scripts/github/apply_public_profile.py --verify to compare the live
description, homepage, and topics against the current tracked intent, and use
python3 scripts/github/apply_public_profile.py when you intentionally want to
sync those settings after current main truth is ready. Social preview upload
still requires a manual GitHub Settings check.
Operator-generated pointers and historical planning ledgers can help maintainers inspect deeper evidence, but they are not the public truth route.
SourceHarbor is a public, source-first engineering repository.
It is inspectable, and you can run it locally. It is not marketed as a turnkey hosted product, and external distribution claims are valid only when live remote workflows prove them for the current
maincommit.
For local verification, the repo-managed route snapshot under
.runtime-cache/run/full-stack/resolved.env is the runtime truth for API/Web
ports. Do not assume any process already listening on 9000, 3000, or
5432 belongs to the clean-path stack.
Start where you are:
No. SourceHarbor is a source-first repository you can inspect, run locally, adapt, and extend.
No. The public surface is strongest around long-form video today, but the feed and retrieval layers already model both video and article content types.
Because it sits at the intersection of source ingestion, digest pipelines, retrieval, operator UI, and MCP reuse. Even if you are not adopting it immediately, it is a strong reference point for how to turn long-form inputs into reusable knowledge products.
More questions are answered in docs/faq.md.
apps/api: FastAPI service for ingestion, jobs, artifacts, retrieval, notifications, and operator controlsapps/worker: pipeline runner, Temporal workflows, and delivery automationapps/mcp: MCP tool surface for agentsapps/web: reader-first web surfaces for operators and builderscontracts: shared schemas and generated contract artifactsdocs: layered public navigation, proof, and architectureSourceHarbor is released under the MIT License. See LICENSE.
Be the first to review this server!
by Modelcontextprotocol · Developer Tools
Read, search, and manipulate Git repositories programmatically
by Toleno · Developer Tools
Toleno Network MCP Server — Manage your Toleno mining account with Claude AI using natural language.
by mcp-marketplace · Developer Tools
Create, build, and publish Python MCP servers to PyPI — conversationally.
by Microsoft · Content & Media
Convert files (PDF, Word, Excel, images, audio) to Markdown for LLM consumption
by mcp-marketplace · Developer Tools
Scaffold, build, and publish TypeScript MCP servers to npm — conversationally
by mcp-marketplace · Finance
Free stock data and market news for any MCP-compatible AI assistant.