Back to Home
acryldata icon

mcp-server-datahub

Verified Safe

by acryldata

Overview

The DataHub MCP Server enables AI agents to discover, explore lineage, understand usage, and generate SQL queries for data assets within a DataHub ecosystem.

Installation

Run Command
uv run mcp-server-datahub --transport stdio

Environment Variables

  • DATAHUB_GMS_URL
  • DATAHUB_GMS_TOKEN
  • TOOL_RESPONSE_TOKEN_LIMIT
  • ENTITY_SCHEMA_TOKEN_BUDGET
  • DISABLE_NEWER_GMS_FIELD_DETECTION
  • DATAHUB_MCP_DISABLE_DEFAULT_VIEW
  • SEMANTIC_SEARCH_ENABLED

Security Notes

The server uses `json.loads` for filter strings, which is then passed through a Pydantic-validated `load_filters` function from the DataHub SDK, providing input sanitization and validation. Descriptions are sanitized against HTML/Markdown injection and ReDoS attacks. Authentication relies on environment variables (`DATAHUB_GMS_URL`, `DATAHUB_GMS_TOKEN`). GraphQL queries are executed via the DataHub SDK. No direct `eval` or `exec` on user-controlled input is identified within the core server logic. The `subprocess.run` usage is confined to development/testing scripts and not part of the runtime server operation.

Similar Servers

Stats

Interest Score43
Security Score9
Cost ClassLow
Avg Tokens12000
Stars64
Forks29
Last Update2025-11-20

Tags

Data DiscoveryData GovernanceData LineageSQL GenerationAI Agents