
Point Hub
Ajouter un commentaireVue d'ensemble
-
Missions postés 0
Description de l'entreprise
DeepSeek’s First-generation Reasoning Models
DeepSeek’s first-generation thinking designs, accomplishing efficiency similar to OpenAI-o1 throughout math, code, and thinking jobs.
Models
DeepSeek-R1
Distilled designs
DeepSeek group has demonstrated that the of larger designs can be distilled into smaller sized designs, leading to better efficiency compared to the thinking patterns discovered through RL on small designs.
Below are the models produced through fine-tuning against a number of dense models commonly used in the research study neighborhood utilizing thinking data produced by DeepSeek-R1. The assessment results show that the distilled smaller dense designs perform extremely well on benchmarks.
DeepSeek-R1-Distill-Qwen-1.5 B
DeepSeek-R1-Distill-Qwen-7B
DeepSeek-R1-Distill-Llama-8B
DeepSeek-R1-Distill-Qwen-14B
DeepSeek-R1-Distill-Qwen-32B
DeepSeek-R1-Distill-Llama-70B
License
The model weights are certified under the MIT License. DeepSeek-R1 series assistance commercial use, enable any adjustments and derivative works, including, but not restricted to, distillation for training other LLMs.