GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

table 5 comparison of deepseek distilled models

Download link

https://assets.chaos.social/media_attachments/files/113/900/526/392/664/519/original/b0af5ea8b0f56df9.png

Notices where this attachment appears

  1. Embed this notice
    Daniel (djh@chaos.social)'s status on Monday, 27-Jan-2025 22:24:06 JST Daniel Daniel

    @obrhoff It's all open research

    https://arxiv.org/search/cs?searchtype=author&query=DeepSeek-AI

    For details on deepseek-r1 and the qwen / llama distilled models, see

    https://arxiv.org/pdf/2501.12948

    for the distilled model benchmark see table 5.

    They're qwen / llama model architectures and different compared to their main contribution.

    In conversation about 3 months ago from chaos.social permalink
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.