Back to Home
Traves-Theberge icon

PDFlow

Verified Safe

by Traves-Theberge

Overview

Transform PDF documents into structured data (Markdown, JSON, XML, etc.) using multimodal AI, with web UI, CLI, and AI agent integration.

Installation

Run Command
export GEMINI_API_KEY="your-api-key-here" && USER_ID=$(id -u) GROUP_ID=$(id -g) docker-compose up -d

Environment Variables

  • GEMINI_API_KEY
  • PDFLOW_BASE_URL

Security Notes

The project demonstrates a strong focus on security. Explicit path validation functions (`isValidPathComponent`, `getSecureFilePath`) are implemented to prevent directory traversal in API routes. The `child_process.spawn` method is used over `exec` with arguments passed as an array to prevent command injection when executing shell scripts. API keys are managed client-side in `sessionStorage` or via environment variables, not hardcoded. Docker configuration uses a non-root user (`nextjs:nodejs`), `read_only: true` for the filesystem (with exceptions for necessary write directories), `no-new-privileges:true`, and resource limits, adhering to robust container security practices. The MCP server also implements file path validation against a configurable list of `ALLOWED_DIRECTORIES`.

Similar Servers

Stats

Interest Score36
Security Score9
Cost ClassMedium
Avg Tokens5000
Stars4
Forks1
Last Update2025-11-21

Tags

PDFAI ExtractionMultimodal AIData TransformationAPI