How do I install Dataraum?

Dataraum is a local plugin. Install it using PyPI package: dataraum and add the generated configuration snippet to your AI app's MCP config file. Then restart your AI app.

What credentials does Dataraum need?

Dataraum requires the following credentials or environment variables: ANTHROPIC_API_KEY, DATARAUM_HOME. You can find setup instructions on the server detail page.

What AI apps work with Dataraum?

Dataraum uses the Model Context Protocol (MCP) and works with any MCP-compatible AI app, including Claude, ChatGPT / Codex, Gemini, Copilot, Cursor, and more.

Back to Browse

Dataraum MCP Server

by Dataraum

Developer ToolsLow Risk9.5MCP RegistryLocal

Free

Server data from the Official MCP Registry

Pre-computed metadata context engine for AI-driven data analytics

About

Pre-computed metadata context engine for AI-driven data analytics

Security Report

9.5

Low Risk9.5Low Risk

Valid MCP server (2 strong, 1 medium validity signals). 1 known CVE in dependencies Package registry verified. Imported from the Official MCP Registry.

4 files analyzed · 2 issues found

Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.

Permissions Required

This plugin requests these system permissions. Most are normal for its category.

file_system

Check that this permission is expected for this type of plugin.

env_vars

Check that this permission is expected for this type of plugin.

database

Check that this permission is expected for this type of plugin.

What You'll Need

Set these up before or after installing:

Anthropic API key for LLM-powered semantic analysisRequired

Environment variable: ANTHROPIC_API_KEY

Root directory for workspaces, sessions, and exportsOptional

Environment variable: DATARAUM_HOME

How to Install

Add this to your MCP configuration file:

{
  "mcpServers": {
    "io-github-dataraum-dataraum": {
      "env": {
        "DATARAUM_HOME": "your-dataraum-home-here",
        "ANTHROPIC_API_KEY": "your-anthropic-api-key-here"
      },
      "args": [
        "dataraum"
      ],
      "command": "uvx"
    }
  }
}

Documentation

View on GitHub

From the project's GitHub README.

DataRaum Context Engine

A rich metadata context engine for AI-driven data analytics.

Traditional semantic layers tell BI tools "what things are called." DataRaum tells AI "what the data means, how it behaves, how it relates, and what you can compute from it."

The core insight: AI agents don't need tools to discover metadata at runtime. They need rich, pre-computed context delivered in a format optimized for LLM consumption.

Quick Start — MCP Server

The most common way to use DataRaum is as an MCP server inside Claude Desktop (or any MCP-compatible client).

# Install
pip install dataraum

# Or with uv
uv pip install dataraum

Add to your Claude Desktop config (claude_desktop_config.json):

{
  "mcpServers": {
    "dataraum": {
      "command": "dataraum-mcp"
    }
  }
}

Then in Claude Desktop:

Add the CSV files in /path/to/my/data and measure data quality

The server runs a 17-phase analysis pipeline and makes these tools available:

Tool	Description
`begin_session`	Start an investigation session with a contract
`add_source`	Register a data source (CSV, Parquet, JSON, or directory)
`look`	Explore data structure, relationships, and semantic metadata
`measure`	Measure entropy scores, readiness, and data quality
`query`	Natural language query against the data
`run_sql`	Execute SQL directly with export support
`end_session`	Archive workspace and end the session

Typical Workflow

add_source(name="accounting", path="/path/to/data")
  → begin_session(intent="explore data quality", contract="exploratory_analysis")
  → look()                    # Understand the data
  → measure()                 # Check quality scores and readiness
  → query("total revenue?")   # Ask questions
  → run_sql(sql="...", export_format="csv", export_name="report")
  → end_session(outcome="delivered")

Quick Start — CLI

# Run analysis pipeline (writes metadata.db + data.duckdb to ./pipeline_output)
dataraum run /path/to/data

# Inspect what was produced
dataraum dev context ./pipeline_output

See CLI Reference for all options.

What It Produces

DataRaum analyzes your data and generates:

Statistical metadata — distributions, cardinality, null rates, patterns
Semantic metadata — column roles, entity types, business terms (LLM-powered)
Topological metadata — relationships, join paths, hierarchies
Temporal metadata — granularity, gaps, seasonality, trends
Quality metadata — rules, scores, anomalies
Entropy scores — uncertainty quantification across all dimensions
Ontological context — domain-specific interpretation (financial, marketing, etc.)

LLM Configuration

Semantic analysis requires an Anthropic API key:

export ANTHROPIC_API_KEY="sk-..."

Configure the LLM provider in config/llm/config.yaml. See Configuration for details.

Development

git clone https://github.com/dataraum/dataraum
cd dataraum

# Install with dev dependencies (using uv)
uv sync --group dev

# Run tests
uv run pytest --testmon tests/unit -q

# Type check
uv run mypy src/

# Lint
uv run ruff check src/
uv run ruff format --check src/

Documentation

Architecture — system design and pipeline overview
Pipeline — 17-phase pipeline reference
Entropy — uncertainty quantification system
Data Model — metadata schema
CLI Reference — command-line interface
MCP Setup — MCP server configuration
Configuration — config directory reference
Contributing — development setup and patterns

License

Apache 2.0 — see LICENSE.

Reviews

No reviews yet

Be the first to review this server!

More Developer Tools MCP Servers

Git

Free

by Modelcontextprotocol · Developer Tools

Read, search, and manipulate Git repositories programmatically

Toleno

Free

by Toleno · Developer Tools

Toleno Network MCP Server — Manage your Toleno mining account with Claude AI using natural language.

mcp-creator-python

Free

by mcp-marketplace · Developer Tools

Create, build, and publish Python MCP servers to PyPI — conversationally.

MarkItDown

Free

by Microsoft · Content & Media

Convert files (PDF, Word, Excel, images, audio) to Markdown for LLM consumption

mcp-creator-typescript

Free

by mcp-marketplace · Developer Tools

Scaffold, build, and publish TypeScript MCP servers to npm — conversationally

FinAgent

Free

by mcp-marketplace · Finance

Free stock data and market news for any MCP-compatible AI assistant.

Dataraum MCP Server

About

Security Report

Findings (2)

Permissions Required

What You'll Need

How to Install

Documentation

DataRaum Context Engine

Quick Start — MCP Server

Typical Workflow

Quick Start — CLI

What It Produces

LLM Configuration

Development

Documentation

License

Reviews

No reviews yet

More Developer Tools MCP Servers

Git

Toleno

mcp-creator-python

MarkItDown

mcp-creator-typescript

FinAgent

Dataraum MCP Server

About

Security Report

Findings (2)

Permissions Required

What You'll Need

How to Install

Documentation

DataRaum Context Engine

Quick Start — MCP Server

Typical Workflow

Quick Start — CLI

What It Produces

LLM Configuration

Development

Documentation

License

Reviews

No reviews yet

More Developer Tools MCP Servers

Git

Toleno

mcp-creator-python

MarkItDown

mcp-creator-typescript

FinAgent