DeepSeek-V3：小版本升級，大能力進階

發布于 2025-3-28 00:26

瀏覽

0收藏

在人工智能領域，模型的迭代升級一直是推動技術進步的關鍵力量。近期，DeepSeek 團隊發布了 DeepSeek-V3-0324 模型，雖然官方將其定義為小版本升級，但其帶來的能力提升卻令人矚目。本文將詳細介紹 DeepSeek-V3-0324 模型的更新內容、技術亮點以及使用建議，幫助大家更好地了解和應用這一強大的 AI 模型。

一、模型概述

DeepSeek-V3-0324 模型于 2025 年 3 月 24 日發布，是 DeepSeek-V3 的小版本升級版本。該版本在推理任務、前端開發能力、中文寫作能力、中文搜索能力等多個方面進行了優化和提升，同時保持了與前代版本一致的 API 接口和使用方式，用戶可以無縫切換至新版本，享受更加流暢和高效的對話體驗。

DeepSeek-V3：小版本升級，大能力進階-AI.x社區

二、技術亮點

（一）推理任務表現顯著提升

DeepSeek-V3-0324 模型借鑒了 DeepSeek-R1 模型訓練過程中的強化學習技術，在推理類任務上取得了顯著的進步。在多個評測集上，該模型的得分超過了 GPT-4.5，具體表現如下：

MMLU-Pro：從 75.9 提升至 81.2，提升了 5.3 分。
GPQA：從 59.1 提升至 68.4，提升了 9.3 分。
AIME：從 39.6 提升至 59.4，提升了 19.8 分。
LiveCodeBench：從 39.2 提升至 49.2，提升了 10.0 分。

這些數據表明，DeepSeek-V3-0324 模型在處理復雜推理任務時的準確性和效率都有了顯著提高。

（二）前端開發能力增強

在前端開發領域，DeepSeek-V3-0324 模型生成的 HTML 代碼不僅可用性更高，而且在視覺效果上也更加美觀和富有設計感。例如，模型能夠生成一個演示多個小球在指定空間內運動的 p5.js 程序，并以賽博朋克風格的 HTML 呈現，同時提供可調整重力、摩擦力等參數的滑動按鈕。

（三）中文寫作能力升級

DeepSeek-V3-0324 模型在中文寫作任務上進行了進一步優化，特別是在中長篇文本創作的內容質量上有了顯著提升。模型能夠生成符合 R1 寫作風格的文本，并在多輪交互式重寫、翻譯質量和書信寫作等方面表現出色。

（四）中文搜索能力優化

在聯網搜索場景下，DeepSeek-V3-0324 模型能夠輸出更詳實準確、排版更清晰美觀的結果。這對于需要生成報告類指令的用戶來說是一個巨大的福音。

（五）函數調用改進

DeepSeek-V3-0324 模型提高了函數調用的準確性，修復了前代版本中的一些問題，使得工具調用更加穩定和可靠。

三、快速使用

（一）模型部署

這里我們使用SGLang 進行推理，SGLang 是一個支持 MLA 優化、DP Attention、FP8（W8A8）、FP8 KV Cache 和 Torch Compile 的推理框架。它在開源框架中提供了最先進的延遲和吞吐量性能，并且完全支持在 NVIDIA 和 AMD GPU 上運行 DeepSeek-V3-0324 模型。推理代碼示例如下：

# 安裝 SGLang
pip install "sglang[all]>=0.4.3" --find-links https://flashinfer.ai/whl/cu124/torch2.5/flashinfer-python
# 啟動推理服務
python3 -m sglang.launch_server --model /Your_Model_Path/DeepSeek-V3-0324 --tp 8 --trust-remote-code

根據需要添加性能優化選項。

（二）API調用

import openai
client = openai.Client(
  base_url="http://127.0.0.1:30000/v1", 
  api_key="EMPTY")


  # Chat completion
response = client.chat.completions.create(
  model="default",
  messages=[
    {"role": "system", "content": "You are a helpful AI assistant"},
    {"role": "user", "content": "List 3 countries and their capitals."},
  ],
  temperature=0,
  max_tokens=64,
)
print(response)

四、使用建議

（一）系統提示

在使用 DeepSeek-V3-0324 模型時，建議在官方 DeepSeek 網站或應用程序中使用帶有特定日期的系統提示，例如：

```

該助手為 DeepSeek Chat，由深度求索公司創造。今天是 3 月 24 日，星期一。

```

（二）溫度設置

在 DeepSeek 的網頁和應用環境中，溫度參數被設置為 0.3。如果用戶在 API 調用時使用默認的溫度 1.0，DeepSeek 會自動將其調整為最適合模型的溫度設置 0.3。

（三）文件上傳提示

file_template = """[file name]: {file_name}[file content begin]{file_content}[file content end]{question}"""

（四）網頁查詢提示

{search_results}、{cur_date}和{question}是網頁查詢的參數。

中文查詢提示模版：

search_answer_zh_template = \
'''# 以下內容是基于用戶發送的消息的搜索結果:
{search_results}
在我給你的搜索結果中，每個結果都是
[webpage X begin]...[webpage X end]格式的，X代表每篇文章的數字索引。請在適當的情況下在句子末尾引用上下文。請按照引用編號[citation:X]的格式在答案中對應部分引用上下文。如果一句話源自多個上下文，請列出所有相關的引用編號，例如[citation:3][citation:5]，切記不要將引用集中在最后返回引用編號，而是在答案對應部分列出。
在回答時，請注意以下幾點：
- 今天是{cur_date}。
- 并非搜索結果的所有內容都與用戶的問題密切相關，你需要結合問題，對搜索結果進行甄別、篩選。
- 對于列舉類的問題（如列舉所有航班信息），盡量將答案控制在10個要點以內，并告訴用戶可以查看搜索來源、獲得完整信息。優先提供信息完整、最相關的列舉項；如非必要，不要主動告訴用戶搜索結果未提供的內容。
- 對于創作類的問題（如寫論文），請務必在正文的段落中引用對應的參考編號，例如[citation:3][citation:5]，不能只在文章末尾引用。你需要解讀并概括用戶的題目要求，選擇合適的格式，充分利用搜索結果并抽取重要信息，生成符合用戶要求、極具思想深度、富有創造力與專業性的答案。你的創作篇幅需要盡可能延長，對于每一個要點的論述要推測用戶的意圖，給出盡可能多角度的回答要點，且務必信息量大、論述詳盡。
- 如果回答很長，請盡量結構化、分段落總結。如果需要分點作答，盡量控制在5個點以內，并合并相關的內容。
- 對于客觀類的問答，如果問題的答案非常簡短，可以適當補充一到兩句相關信息，以豐富內容。
- 你需要根據用戶要求和回答內容選擇合適、美觀的回答格式，確保可讀性強。
- 你的回答應該綜合多個相關網頁來回答，不能重復引用一個網頁。
- 除非用戶要求，否則你回答的語言需要和用戶提問的語言保持一致。
# 用戶消息為：
{question}'''

英文查詢提示模版：

search_answer_en_template = \
'''# The following contents are the search results related to the user's message:
{search_results}
In the search results I provide to you, each result is formatted as [webpage X begin]...[webpage X end], where X represents the numerical index of each article. Please cite the context at the end of the relevant sentence when appropriate. Use the citation format [citation:X] in the corresponding part of your answer. If a sentence is derived from multiple contexts, list all relevant citation numbers, such as [citation:3][citation:5]. Be sure not to cluster all citations at the end; instead, include them in the corresponding parts of the answer.
When responding, please keep the following points in mind:
- Today is {cur_date}.
- Not all content in the search results is closely related to the user's question. You need to evaluate and filter the search results based on the question.
- For listing-type questions (e.g., listing all flight information), try to limit the answer to 10 key points and inform the user that they can refer to the search sources for complete information. Prioritize providing the most complete and relevant items in the list. Avoid mentioning content not provided in the search results unless necessary.
- For creative tasks (e.g., writing an essay), ensure that references are cited within the body of the text, such as [citation:3][citation:5], rather than only at the end of the text. You need to interpret and summarize the user's requirements, choose an appropriate format, fully utilize the search results, extract key information, and generate an answer that is insightful, creative, and professional. Extend the length of your response as much as possible, addressing each point in detail and from multiple perspectives, ensuring the content is rich and thorough.
- If the response is lengthy, structure it well and summarize it in paragraphs. If a point-by-point format is needed, try to limit it to 5 points and merge related content.
- For objective Q&A, if the answer is very brief, you may add one or two related sentences to enrich the content.
- Choose an appropriate and visually appealing format for your response based on the user's requirements and the content of the answer, ensuring strong readability.
- Your answer should synthesize information from multiple relevant webpages and avoid repeatedly citing the same webpage.
- Unless the user requests otherwise, your response should be in the same language as the user's question.
# The user's message is:
{question}'''

五、總結

DeepSeek-V3-0324 模型的發布，標志著 DeepSeek 團隊在人工智能領域的又一重要進步。通過在推理任務、前端開發能力、中文寫作能力、中文搜索能力等多個方面的優化和提升，該模型為用戶帶來了更加高效和優質的使用體驗。無論是開發者還是普通用戶，都可以從這一強大的 AI 模型中受益。如果你對 DeepSeek-V3-0324 模型感興趣，不妨嘗試使用它，感受其帶來的技術魅力。

六、項目資料

DeepSeek-V3-0324 模型的權重已經開源，用戶可以在以下地址下載模型權重：

Model Scope：??https://modelscope.cn/models/deepseek-ai/DeepSeek-V3-0324??
Huggingface：??https://huggingface.co/deepseek-ai/DeepSeek-V3-0324??

開源倉庫（包括模型權重）遵循MIT License，允許用戶利用模型輸出或通過模型蒸餾等方式訓練其他模型。

本文轉載自??小兵的AI視界??，作者：AGI小兵

標簽

DeepSeek-V3

小版本

已于2025-3-31 10:37:54修改

贊

回復

舉報

社區頭條

回復

成人免费xxxxx在线视频软件_久久精品久久久_亚洲国产精品久久久_天天色天天色_亚洲人成一区_欧美一级欧美三级在线观看

51CTO

51CTO博客

51CTO學堂