Turkish-Content-Automation
Verified Safeby HamdiOzkurt
Overview
This server analyzes Turkish web content by scraping URLs, summarizing text using an LLM (optional), extracting keywords, and generating visual reports (word cloud, frequency chart) for integration with n8n automation workflows.
Installation
python mcp_server.pyEnvironment Variables
- GOOGLE_API_KEY
- GEMINI_MODEL
Security Notes
The code demonstrates good practices for handling sensitive information (loading API keys from .env). It performs extensive input sanitization using regex to clean scraped text from HTML tags, scripts, styles, and common spam patterns, significantly mitigating injection risks before processing or feeding to an LLM. No 'eval' or similar dangerous patterns are used directly on user input. The server listens on `0.0.0.0:8000`, which means it is externally accessible by default if not firewalled, a standard consideration for any network-facing service.
Similar Servers
DevDocs
DevDocs is a web crawling and content extraction platform designed to accelerate software development by converting documentation into LLM-ready formats for intelligent data querying and fine-tuning.
yargi-mcp
Provides programmatic access to various Turkish legal databases (e.g., Yargıtay, Danıştay, Constitutional Court) as a Model Context Protocol (MCP) server for integration with Large Language Model (LLM) applications like Claude AI.
lyra-tool-discovery
This MCP server is designed to fetch, parse, and organize documentation from websites implementing the llms.txt standard. It transforms raw documentation into structured, agent-ready formats, exposing tools for AI agents, LLMs, and automation workflows to consume documentation programmatically.
mcp-server-requests
An MCP server that provides HTTP request capabilities, enabling LLMs to fetch and process web content, including saving to files.