Compare commits

...

11 Commits

Author SHA1 Message Date
epic miner
92444bed68
Merge 1e506b329de5e1d9477e429275ba7144563a10ba into 0cf78561f1d51c84a21b2190626b21116d5c68bb 2025-04-09 13:36:35 +08:00
Xingkai Yu
0cf78561f1
Merge pull request #129 from peti562/patch-2
fixing a typo
2025-04-09 13:36:23 +08:00
Xingkai Yu
4a6d53cac8
Merge pull request #189 from eladb/patch-1
chore: add syntax highlighting to citation
2025-04-09 13:34:09 +08:00
Xingkai Yu
f1e82facf1
Merge pull request #100 from aBurmeseDev/main
docs: fix contact email link README.md
2025-04-09 13:32:25 +08:00
Elad Ben-Israel
6a023be7cf
chore: add syntax highlighting to citation 2025-01-30 10:06:19 +02:00
Peter Makadi
6e59fa73e6
fixing a typo
in licences section, Llama should be capitalized
2025-01-28 21:52:11 +01:00
John L.
c942e96852
chore: fix contact mailto link 2025-01-27 18:06:01 -08:00
epic miner
1e506b329d
Update README.md 2025-01-22 13:42:35 +00:00
epic miner
6f1dc6e281
Update README.md 2025-01-22 13:41:58 +00:00
epic miner
52e58ae62b
Rename Screenshot_2025-01-22_133155-yYdQLBGbN-transformed.png to benchmarks.png 2025-01-22 13:41:15 +00:00
epic miner
0465545d78
Add files via upload 2025-01-22 19:10:41 +05:30
2 changed files with 5 additions and 5 deletions

View File

@ -41,7 +41,7 @@ To support the research community, we have open-sourced DeepSeek-R1-Zero, DeepSe
**NOTE: Before running DeepSeek-R1 series models locally, we kindly recommend reviewing the [Usage Recommendation](#usage-recommendations) section.**
<p align="center">
<img width="80%" src="figures/benchmark.jpg">
<img width="80%" src="figures/benchmarks.png">
</p>
## 2. Model Summary
@ -257,11 +257,11 @@ When responding, please keep the following points in mind:
This code repository and the model weights are licensed under the [MIT License](https://github.com/deepseek-ai/DeepSeek-R1/blob/main/LICENSE).
DeepSeek-R1 series support commercial use, allow for any modifications and derivative works, including, but not limited to, distillation for training other LLMs. Please note that:
- DeepSeek-R1-Distill-Qwen-1.5B, DeepSeek-R1-Distill-Qwen-7B, DeepSeek-R1-Distill-Qwen-14B and DeepSeek-R1-Distill-Qwen-32B are derived from [Qwen-2.5 series](https://github.com/QwenLM/Qwen2.5), which are originally licensed under [Apache 2.0 License](https://huggingface.co/Qwen/Qwen2.5-1.5B/blob/main/LICENSE), and now finetuned with 800k samples curated with DeepSeek-R1.
- DeepSeek-R1-Distill-Llama-8B is derived from Llama3.1-8B-Base and is originally licensed under [llama3.1 license](https://huggingface.co/meta-llama/Llama-3.1-8B/blob/main/LICENSE).
- DeepSeek-R1-Distill-Llama-70B is derived from Llama3.3-70B-Instruct and is originally licensed under [llama3.3 license](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct/blob/main/LICENSE).
- DeepSeek-R1-Distill-Llama-8B is derived from Llama3.1-8B-Base and is originally licensed under [Llama3.1 license](https://huggingface.co/meta-llama/Llama-3.1-8B/blob/main/LICENSE).
- DeepSeek-R1-Distill-Llama-70B is derived from Llama3.3-70B-Instruct and is originally licensed under [Llama3.3 license](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct/blob/main/LICENSE).
## 8. Citation
```
```bibtex
@misc{deepseekai2025deepseekr1incentivizingreasoningcapability,
title={DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning},
author={DeepSeek-AI},
@ -274,4 +274,4 @@ DeepSeek-R1 series support commercial use, allow for any modifications and deriv
```
## 9. Contact
If you have any questions, please raise an issue or contact us at [service@deepseek.com](service@deepseek.com).
If you have any questions, please raise an issue or contact us at [service@deepseek.com](mailto:service@deepseek.com).

BIN
figures/benchmarks.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 1.9 MiB