快速阅读
体验:访问DeepSeek官方网站chat.deepseek.com,与DeepSeek-R1进行聊天并开启“深度思考”功能。支持兼容OpenAI格式的API。
总结:上下文长度128K,每百万输出令牌费用2.19美元(大模型界拼多多),性能与OpenAI-o1相当,全开源模型支持自由蒸馏和商业化使用,前端功能编写极具想象力
基础模型:DeepSeek-R1-Zero与DeepSeek-R1均基于DeepSeek-V3-Base训练,DeepSeek-R1经过少量长CoT数据强化学习,输出内容更结构化且简约。
蒸馏模型(提升现有开源小模型能力):将DeepSeek-R1蒸馏到多个更小的模型,包括(Ollama中可下载)Qwen2.5-Math-1.5B、Qwen2.5-Math-7B、Qwen2.5-14B、Qwen2.5-32B、Llama-3.1-8B、Llama-3.3-70B-Instruct。
(adsbygoogle=window.adsbygoogle||[]).push({});

Qwen2.5-Math-1.5B、Qwen2.5-Math-7B、Qwen2.5-14B、Qwen2.5-32B、Llama-3.1-8B、Llama-3.3-70B-Instruct
问题,API的上下文长度在编程中可能不够
<img decoding="async" class="aligncenter wp-image-19046" title="DeepSeek-R1详解-1" src="https://www.aisharenet.com/wp-content/uploads/2025/01/696bdcec4240a75.jpg" alt="DeepSeek-R1详解-1" width="1161" height="488" srcset="https://www.aisharenet.com/wp-content/uploads/2025/01/696bdcec4240a75.jpg 2128w, https://www.aisharenet.com/wp-content/uploads/2025/01/696bdcec4240a75-300x126.jpg 300w, https://www.aisharenet.com/wp-content/uploads/2025/01/696bdcec4240a75-1024x430.jpg 1024w, https://www.aisharenet.com/wp-content/uploads/2025/01/696bdcec4240a75-768x323.jpg 768w, https://www.aisharenet.com/wp-content/uploads/2025/01/696bdcec4240a75-1536x645.jpg 1536w, https://www.aisharenet.com/wp-content/uploads/2025/01/696bdcec4240a75-2048x860.jpg 2048w, https://www.aisharenet.com/wp
暂无评论