vibe
by COBACOBAINI
Overview
A fast, cross-platform desktop application for transcribing audio locally using Whisper models, with optional integration for OpenAI/Claude for summarization.
Installation
vibe --model <path_to_model.bin> --file <path_to_audio.wav> [--language <language>] [--temperature <temp>] [--n-threads <threads>] [--translate] [--max-text-ctx <tokens>] [--init-prompt <prompt>] [--write <output_path>] [--format <format>] [--word-timestamps] [--max-sentence-len <len>] [--diarize] [--diarize-vad-model <path>] [--diarize-speaker-id-model <path>]Environment Variables
- RUST_LOG
Security Notes
The application executes external binaries (FFmpeg, yt-dlp) and modifies system configurations (Windows Registry, apt packages on Linux). The `scripts/pre_build.js` downloads these external dependencies, which introduces supply chain risks if the download URLs or the binaries themselves are compromised. A critical concern is the `custom_command` option for FFmpeg, which could allow arbitrary command injection if user input is not properly sanitized. The `tauri-plugin-http` uses `unsafe-headers` for Ollama communication, potentially allowing arbitrary `Origin` headers. Screen capture on macOS also requires specific permissions.
Similar Servers
5ire
A desktop AI assistant client that integrates with various LLM providers and supports extensible tool and prompt functionalities via the Model Context Protocol (MCP).
kreuzberg
High-performance document intelligence platform for extracting text, metadata, and structured information (tables, images, chunks) from over 50 diverse document formats (PDFs, Office, images, HTML, etc.). It offers advanced OCR capabilities, multilingual support, and features like chunking, embeddings, and keyword extraction. Functionality is exposed via multiple language bindings and a Micro-service Communication Protocol (MCP) server for flexible integration.
code-assistant
An AI coding assistant that integrates with code editors to provide autonomous code analysis and modification, supporting multiple UI modes and the Agent Client Protocol (ACP).
gemini-cli-desktop
A cross-platform desktop and web application providing a modern UI for various AI CLIs (Gemini, Qwen, LLxprt), enabling structured interaction with AI models, visual tool confirmation, real-time thought processes, code diff viewing, chat history management, and file system integration.