mcp-server-datahub
Verified Safeby acryldata
Overview
The DataHub MCP Server enables AI agents to discover, explore lineage, understand usage, and generate SQL queries for data assets within a DataHub ecosystem.
Installation
uv run mcp-server-datahub --transport stdioEnvironment Variables
- DATAHUB_GMS_URL
- DATAHUB_GMS_TOKEN
- TOOL_RESPONSE_TOKEN_LIMIT
- ENTITY_SCHEMA_TOKEN_BUDGET
- DISABLE_NEWER_GMS_FIELD_DETECTION
- DATAHUB_MCP_DISABLE_DEFAULT_VIEW
- SEMANTIC_SEARCH_ENABLED
Security Notes
The server uses `json.loads` for filter strings, which is then passed through a Pydantic-validated `load_filters` function from the DataHub SDK, providing input sanitization and validation. Descriptions are sanitized against HTML/Markdown injection and ReDoS attacks. Authentication relies on environment variables (`DATAHUB_GMS_URL`, `DATAHUB_GMS_TOKEN`). GraphQL queries are executed via the DataHub SDK. No direct `eval` or `exec` on user-controlled input is identified within the core server logic. The `subprocess.run` usage is confined to development/testing scripts and not part of the runtime server operation.
Similar Servers
OpenMetadata
Exposes REST APIs for managing Apache Airflow DAGs and OpenMetadata workflow definitions, enabling programmatic deployment and lifecycle management of data pipelines.
powerbi-modeling-mcp
Automate Power BI semantic model development and management using AI agents via the MCP protocol.
Web-Algebra
Translates natural language instructions into RDF operations for loading, querying, and writing Linked Data, consumable via executable JSON or as MCP tools for AI agents.
chip
Provides AI agents with access to Collibra Data Governance Center capabilities for data asset discovery, business glossary queries, and detailed asset information retrieval.