Back to Home
jessaminesimple608 icon

scraper-mcp

by jessaminesimple608

Overview

An MCP server for efficient web scraping, offering tools to extract raw HTML, convert to markdown, extract plain text, and discover links from webpages.

Installation

Run Command
docker-compose up -d

Environment Variables

  • TRANSPORT
  • HOST
  • PORT
  • CACHE_DIR
  • HTTP_PROXY
  • HTTPS_PROXY
  • NO_PROXY
  • SCRAPEOPS_API_KEY
  • SCRAPEOPS_RENDER_JS
  • SCRAPEOPS_RESIDENTIAL
  • SCRAPEOPS_COUNTRY
  • SCRAPEOPS_KEEP_HEADERS
  • SCRAPEOPS_DEVICE
  • ENABLE_CACHE_TOOLS

Security Notes

Critical security risks identified: 1. Default SSL verification is disabled: The `RequestsProvider` uses `verify_ssl=False` by default, making all HTTPS requests vulnerable to Man-in-the-Middle (MITM) attacks. 2. Unauthenticated Admin API and Dashboard: The `/healthz`, `/api/stats`, `/api/cache/clear`, `/api/config` endpoints and the root dashboard (`/`) are exposed without any visible authentication or authorization. This allows anyone with network access to query server statistics, clear the cache, and modify runtime configuration (e.g., proxy settings, concurrency). 3. Exposure of Cache Management Tools: If `ENABLE_CACHE_TOOLS` is set, cache management tools are also exposed via MCP without authentication. These issues make the server unsafe for deployment in public or untrusted networks without additional security measures (e.g., a reverse proxy with authentication).

Similar Servers

Stats

Interest Score30
Security Score3
Cost ClassMedium
Avg Tokens5000
Stars1
Forks0
Last Update2026-01-17

Tags

Web ScrapingHTML ProcessingMarkdown ConversionLink ExtractionCaching