Back to Home
yjcyxky icon

research-hub-mcp

by yjcyxky

Overview

The server converts raw text into TSV tables by extracting entities with GLiNER and leveraging an OpenAI-compatible vLLM endpoint, facilitating structured data extraction for research and analysis.

Installation

Run Command
cargo run --release -- server

Environment Variables

  • TEXT2TABLE_VLLM_URL
  • TEXT2TABLE_GLINER_URL
  • HUGGINGFACE_HUB_TOKEN
  • GOOGLE_SCHOLAR_API_KEY
  • OPENAI_API_KEY
  • TEXT2TABLE_API_KEY
  • TEXT2TABLE_GLINER_API_KEY

Security Notes

The server's Rust core integrates Python plugins for PDF downloading, which utilize `playwright` for headless browser automation. Running a full browser introduces a significant attack surface, potentially vulnerable to browser-based exploits if interacting with malicious or compromised web content during PDF retrieval. While the Rust code implements robust security practices for network requests (HTTPS-only, TLS validation, user-agent rotation) and file handling (path sanitization, integrity verification, size limits), the dynamic nature of web interaction through `playwright` inherently elevates risk. Reliance on external Python environments and services like GROBID (potentially containerized) also adds to the complexity and potential attack vectors if not meticulously secured. API keys for services like Google Scholar, PubMed, Semantic Scholar, CORE, and arXiv are expected via environment variables, not hardcoded. The project also incorporates circuit breakers and rate limiting for external services, enhancing stability but not directly mitigating the browser-based risks.

Similar Servers

Stats

Interest Score0
Security Score6
Cost ClassHigh
Avg Tokens10000
Stars0
Forks0
Last Update2025-12-23

Tags

text-to-tableNLPLLMentity-extractiondata-extractionvLLMGLiNERacademic-researchpdf-processing