能力: - 语音转录支持本地(WhisperCpp/FasterWhisper) 和在线(B接口/J接口??) - 字幕翻译支持传统引擎和LLM - 传统引擎: DeepL/微软/谷歌 - LLM: Ollama、DeepSeek、硅基流动以及【OpenAI兼容接口】 (配套提供LLM API中转站)
安装部署 - Windows提供一键安装包 - MacOS需要自行基于python搭建,且作者说未验证过 👎 。另外本地 whisper 功能尚不支持macos)
能力: - 语音转录支持本地(WhisperCpp/FasterWhisper) 和在线(B接口/J接口??) - 字幕翻译支持传统引擎和LLM - 传统引擎: DeepL/微软/谷歌 - LLM: Ollama、DeepSeek、硅基流动以及【OpenAI兼容接口】 (配套提供LLM API中转站)
安装部署 - Windows提供一键安装包 - MacOS需要自行基于python搭建,且作者说未验证过 👎 。另外本地 whisper 功能尚不支持macos)
有在线版本,也可自部署(nodejs 18.18+)
LLM只支持OpenAI 和 DeepSeek(不支持 【兼容OpenAI的API】)
传统翻译引擎支持DeepL, Google, Azure,不支持国内翻译平台
The analysis uncovered an average of 11 different types of data out of the 35 possible. As mentioned earlier, Google Gemini stands out as the most data-hungry service, collecting 22 of these data types, including highly sensitive data like precise location, user content, the device's contacts list, browsing history, and more.Among the analyzed applications, only Google Gemini, Copilot, and Perplexity were found to collect precise location data. The controversial DeepSeek chatbot stands right in the middle, collecting 11 unique types of data, such as user input like chat history.
Detailed explanation of what DeepSeek model is doing differently to improve performance and training time over ChatGPT.
Italian DPA bans DeepSeek after the Chinese company refused to provide GDPR related info. Pertaining to app, pd collection, contactpoint and transmission of data to outside the EU. In 2023 it did the same for a few weeks until OpenAI provided answers and point of contact. (And it fined OpenAI). Will other DPAs follow suit?
Take aways: AI will become cheaper and more efficient. - closed source models can cache responses and save computations for repetitive queries - closed source also has possibility of iterative improvements using constant reinforcement learning. - Prioritizing capabilities and deliberate strategy in data selection, carefully designed training objectives.
Chinese LLM spooking US based corps it seems. #openvraag why?
How to pick a LLM (Jan 2025): — Claude Sonnet is my daily driver. Fast, great writing and great code. — o1 / o1 pro for complex reasoning tasks (tough refactor) — Deepseek v3 for fast cheap API / 4-o replacement — Gemini for ultra long context, Flash and video understanding