Vue d'ensemble

  • Missions postés 0

Description de l'entreprise

DeepSeek’s First-generation Reasoning Models

DeepSeek’s first-generation thinking designs, accomplishing efficiency similar to OpenAI-o1 throughout math, code, and thinking jobs.

Models

DeepSeek-R1

Distilled designs

DeepSeek group has demonstrated that the of larger designs can be distilled into smaller sized designs, leading to better efficiency compared to the thinking patterns discovered through RL on small designs.

Below are the models produced through fine-tuning against a number of dense models commonly used in the research study neighborhood utilizing thinking data produced by DeepSeek-R1. The assessment results show that the distilled smaller dense designs perform extremely well on benchmarks.

DeepSeek-R1-Distill-Qwen-1.5 B

DeepSeek-R1-Distill-Qwen-7B

DeepSeek-R1-Distill-Llama-8B

DeepSeek-R1-Distill-Qwen-14B

DeepSeek-R1-Distill-Qwen-32B

DeepSeek-R1-Distill-Llama-70B

License

The model weights are certified under the MIT License. DeepSeek-R1 series assistance commercial use, enable any adjustments and derivative works, including, but not restricted to, distillation for training other LLMs.