Celery 4.0 - IT文库_程序员IT互联网编程电子书和文档免费下载，助您码力十足！

首页文库资料文章资讯上传文档发布文章登录账户

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Moreover, both DeepSeek-V2 Chat (SFT) and DeepSeek-V2 Chat (RL) outperform GPT-4-0613 and ERNIEBot 4.0, solidifying the position of our models in the top-tier LLMs that support Chinese. Specifically, DeepSeek-V2 the reasoning capability of DeepSeek-V2 Chat (RL) still lags behind giant models, such as Erniebot-4.0 and GPT-4s. 19 Model Overall Reasoning 中文推理 Language 中文语言 Avg. Math. Logi. Avg. Fund. Chi. Open 67 8.47 8.65 DeepSeek-V2 Chat (RL) 7.91 7.45 7.77 7.14 8.36 8.10 8.28 8.37 8.53 8.33 8.53 ERNIEBot-4.0-202404*（文心一言） 7.89 7.61 7.81 7.41 8.17 7.56 8.53 8.13 8.45 8.24 8.09 DeepSeek-V2 Chat (SFT) 7.74 7

0 码力 | 52 页 | 1.23 MB | 1 年前
3

共 1 条前往

页

DeepSeek V2 Strong Economical and Efficient Mixture of Experts Language Model