Commit Graph

  • 020b89c640
    Merge pull request #11 from nodoubtz/nodoubtz-patch-9 Nodoubtz 2025-03-22 14:04:14 -04:00
  • b10ecfe48f
    Create codeql.yml Nodoubtz 2025-03-22 14:03:03 -04:00
  • 47993f9318
    Create docker-publish.yml Nodoubtz 2025-03-22 14:01:33 -04:00
  • 2f21798107
    Create manual.yml Nodoubtz 2025-03-22 13:55:20 -04:00
  • 3e1d4cfa25
    Merge pull request #7 from nodoubtz/nodoubtz-patch-6 Nodoubtz 2025-03-22 13:51:33 -04:00
  • f0ccd7666e
    Create pylint.yml Nodoubtz 2025-03-22 13:50:30 -04:00
  • aa72c158bf my intro ritikajindal1127 2025-03-22 22:52:05 +05:30
  • e6cbe50ea7
    Create static.yml Nodoubtz 2025-03-21 22:48:32 -04:00
  • 45ba56b1a9
    Merge branch 'deepseek-ai:main' into patch-2 Nodoubtz 2025-03-21 22:08:27 -04:00
  • cf5db96939
    Create ibm.yml Nodoubtz 2025-03-21 21:59:29 -04:00
  • d659f51464
    Merge branch 'deepseek-ai:main' into main Nodoubtz 2025-03-21 21:53:41 -04:00
  • 0fbd7b02d2
    Merge 8941732439b791ad5ec5bf827e9ed93434cccb0d into a878eada08ea6913f5a2ae80a43afeffdef082ef Jafar Saad 2025-03-22 04:54:19 +07:00
  • 0fbb04d6be
    DeepSeek Commit Mohamed Elgendy 2025-03-20 17:44:07 +02:00
  • 08b00b7bdc
    DeepSeek Commit2 Mohamed Elgendy 2025-03-20 17:41:21 +02:00
  • 33e384c101
    DeepSeek Commit Mohamed Elgendy 2025-03-20 17:37:41 +02:00
  • 31c621793d
    abstract musvaage 2025-03-20 10:06:38 -05:00
  • a878eada08
    Delete DeepSeek_V3.pdf DeepSeekDDM 2025-03-16 23:42:21 +08:00
  • 98e67a71f4
    Update paper link DeepSeekDDM 2025-03-16 23:41:52 +08:00
  • 79a8026ae8
    Create devcontainer.json Nodoubtz 2025-03-15 13:39:16 -04:00
  • e4f555ca6f intro musvaage 2025-03-14 13:23:44 -05:00
  • 100a5aca1e
    Merge branch 'deepseek-ai:main' into patch-2 Nodoubtz 2025-03-14 07:40:03 -04:00
  • 2988c05daa
    Update generate.py Antonio Gallardo García 2025-03-13 13:38:21 +01:00
  • 3eefc21c1c
    Add files via upload sana329 2025-03-11 14:27:35 +05:30
  • 9844d3f642
    Add files via upload sana329 2025-03-11 14:26:22 +05:30
  • d3d00f45be
    deepseek file uploded sana329 2025-03-11 14:25:23 +05:30
  • 0cd41b2a5b
    Merge branch 'deepseek-ai:main' into main Nodoubtz 2025-03-08 19:11:07 -05:00
  • ac83e3fb95
    Update fp8_cast_bf16.py SS7896 2025-03-09 05:25:34 +07:00
  • 7fbf99b32b Add zh version of README windsonsea 2025-03-05 16:24:01 +08:00
  • 3421621d7b
    NoneType check A-transformer 2025-03-06 19:33:10 +04:00
  • be411d69f4 Fix: add metadata to bf16 safetensors for loading using transformers root 2025-03-06 14:25:47 +08:00
  • 408e6e188a
    Update README.md shihaobai 2025-03-03 20:16:37 +08:00
  • 73f2954fa8 polish shihaobai 2025-03-03 20:10:18 +08:00
  • ebd889518d
    Update kernel.py sunndy 2025-03-03 19:38:53 +08:00
  • 1ab09c8780 Docs: add LightLLM as supported engine shihaobai 2025-03-03 19:23:08 +08:00
  • d29a967601 modify the explanation of MLA huxuedan 2025-02-26 17:06:54 +08:00
  • 9539eba28d
    Rename DeepSeek_V3.pdf to DeepSeekv3pdf krackn88 2025-02-25 17:07:37 -05:00
  • 6db27b90e0
    Merge branch 'deepseek-ai:main' into main Can Deliktaş 2025-02-25 19:01:24 +03:00
  • d257cbe733
    Critical Improvements for Model Correctness, Efficiency, and Robustness Abdur Rahman 2025-02-25 21:58:34 +06:00
  • 3432a23b65
    Merge daba5c1f78885750c16181cdaa56324f710e7c02 into 592fd5daf8177b205af11651bbb31a1834a8b0e0 Ivan Lloyd Roquero 2025-02-25 21:11:38 +08:00
  • b156e5450b
    Update README.md Dhieu 2025-02-25 12:26:30 +03:00
  • 00da010bd1
    Merge 687f06b00410e759771e79087344e5371a930b3b into 592fd5daf8177b205af11651bbb31a1834a8b0e0 sudopacman 2025-02-24 11:44:08 +06:00
  • 592fd5daf8
    Delete CITATION.cff DeepSeekDDM 2025-02-24 11:50:20 +08:00
  • c9353aba6c
    Update bib info DeepSeekDDM 2025-02-24 11:25:44 +08:00
  • 57582c60f4
    Absorb w_uk into wo Xu Song 2025-02-22 14:50:32 +08:00
  • 973f949e94
    Update feature_request.md Adugna Gizaw 2025-02-22 04:11:12 +03:00
  • f23598fb22
    Create CODE_OF_CONDUCT.md Yen Huynh 2025-02-21 16:21:39 -05:00
  • 39cc27e8f0
    Merge 79f733dda5fad8900f8c1e790be5747fbee813a9 into f09f5fa321f5a421704136c0463b1eaca6557712 Yen Huynh 2025-02-21 16:18:01 -05:00
  • 79f733dda5
    Create CODE_OF_CONDUCT.md Yen Huynh 2025-02-21 16:17:45 -05:00
  • a3d882baf8
    Merge 1766d255bcd62b280119dc890bac68eee8ecc3a7 into f09f5fa321f5a421704136c0463b1eaca6557712 NKCSRairdrop NFT Finance Guardian 2025-02-21 18:33:37 +08:00
  • e1d0b2ad64 update by rr roshan1727 2025-02-20 19:45:53 +05:30
  • cb8d1f72e6
    Update README_Turkish.md Can Deliktaş 2025-02-19 18:55:04 +03:00
  • c909a3b3d5
    Delete languages/turkish directory Can Deliktaş 2025-02-19 18:28:30 +03:00
  • 3bca5239dc
    Create README_WEIGHTS_Turkish.md Can Deliktaş 2025-02-19 18:26:46 +03:00
  • 6b1cd5993a
    Create README_Turkish.md Can Deliktaş 2025-02-19 18:26:21 +03:00
  • 556c115fff
    Delete languages/turkish directory Can Deliktaş 2025-02-19 18:24:07 +03:00
  • 3fa7464346
    README_Turkish Can Deliktaş 2025-02-19 18:23:37 +03:00
  • 4aca6bd241
    Delete languages /Turkish directory Can Deliktaş 2025-02-19 18:20:45 +03:00
  • 4cbd0ab179
    Create t Can Deliktaş 2025-02-19 18:20:24 +03:00
  • 06ab453160
    Merge branch 'deepseek-ai:main' into main Can Deliktaş 2025-02-19 18:10:35 +03:00
  • 5bb008364b
    Add files via upload Can Deliktaş 2025-02-19 18:10:06 +03:00
  • 213bbf5ecf
    Rename README_WEIGHTS_Turkish.mdmd to README_WEIGHTS_Turkish.md Can Deliktaş 2025-02-19 18:08:32 +03:00
  • 76fd958ed4
    Rename README_turkish.md to README_Turkish.md Can Deliktaş 2025-02-19 18:08:10 +03:00
  • 085ed17781
    Rename README_WEIGHTS.md to README_WEIGHTS_Turkish.mdmd Can Deliktaş 2025-02-19 18:07:49 +03:00
  • 14fddd7cb7
    Rename README.md to README_turkish.md Can Deliktaş 2025-02-19 18:07:09 +03:00
  • 79d72ecd8d Optimize Multi-head Latent Attention (MLA) with Fast Path for Short Sequences XxAlonexX 2025-02-19 10:35:28 +05:30
  • f8b7c3b6e7 Merge branch 'main' of github.com:XxAlonexX/DeepSeek-V3 XxAlonexX 2025-02-19 10:32:29 +05:30
  • cc66d60c67 Optimize Multi-head Latent Attention (MLA) for Short Sequences XxAlonexX 2025-02-19 10:31:28 +05:30
  • 92931a9514
    Update model.py helme 2025-02-18 06:55:48 -12:00
  • 5c2346ddff
    Update model.py helme 2025-02-18 06:52:08 -12:00
  • f09f5fa321
    Merge pull request #616 from Konano/chore-readme Huang Panpan 2025-02-18 18:04:06 +08:00
  • 3189313c24
    sudo su && README.md deepseekr1d2 2025-02-17 15:16:29 -05:00
  • 1766d255bc
    Create python-publish.yml NKCSRairdrop NFT Finance Guardian 2025-02-17 15:36:16 +07:00
  • e598e95674
    Update LICENSE-CODE codechamp12345 2025-02-16 23:44:22 +05:30
  • 4a65fd9221 fix an args description. oyzh 2025-02-15 11:02:28 +08:00
  • 1398800ebf
    fix scores mask Xingkai Yu 2025-02-14 20:26:45 +08:00
  • 4e570a99a7 Fix incorrect comment in linear function regarding weight.element_size() iamvalenciia 2025-02-14 03:09:07 -05:00
  • f07bccc49e
    fix: resolve center alignment issue in preview Konano 2025-02-14 12:12:16 +08:00
  • 0866cab5f9
    chore: update README.md to improve layout and image attributes Konano 2025-02-14 12:02:10 +08:00
  • bd38425b0e [Edited] Fix minor bug in the main function MayureshMore 2025-02-13 08:58:05 -08:00
  • b3dfcef550 Automated change: No ML label MayureshMore 2025-02-12 10:07:29 -08:00
  • fed8284309
    Update README.md Can Deliktaş 2025-02-11 16:32:03 +03:00
  • d98f935545
    Update README.md Can Deliktaş 2025-02-11 16:28:36 +03:00
  • 7f8ae677e4
    Update translates TR README.md Can Deliktaş 2025-02-11 16:21:30 +03:00
  • 2c1de9ff1c
    Update translates TR README_WEIGHTS.md Can Deliktaş 2025-02-11 16:20:40 +03:00
  • 5342a74995
    Update translates TR README.md Can Deliktaş 2025-02-11 16:18:36 +03:00
  • 22db85a39a
    Update translates TR README_WEIGHTS.md Can Deliktaş 2025-02-11 16:17:25 +03:00
  • 9a9554dfe6
    Update translates TR README_WEIGHTS.md Can Deliktaş 2025-02-11 16:16:13 +03:00
  • 315dcb7e20
    Update translates TR README_WEIGHTS.md Can Deliktaş 2025-02-11 16:14:53 +03:00
  • 6939d4380f
    Update translates TR README_WEIGHTS.md Can Deliktaş 2025-02-11 16:13:21 +03:00
  • a68a1814de
    Update README.md Can Deliktaş 2025-02-11 16:08:31 +03:00
  • 339e500ec2
    Update README.md Can Deliktaş 2025-02-11 16:07:58 +03:00
  • 117240e2b8
    Update README.md Can Deliktaş 2025-02-11 16:06:34 +03:00
  • a8f596f900
    Update README.md Can Deliktaş 2025-02-11 16:05:31 +03:00
  • ca2dc67021
    Update README.md Can Deliktaş 2025-02-11 16:04:25 +03:00
  • c0511bfa74
    Update README.md Can Deliktaş 2025-02-11 16:02:37 +03:00
  • d389e53687
    Update README.md Can Deliktaş 2025-02-11 15:59:59 +03:00
  • 854ddc8ee9
    Create 陈诚 cc-helper 2025-02-11 11:47:34 +08:00
  • 897291478c
    Refactor checkpoint conversion script for improved readability and efficiency Tanmay Das 2025-02-10 18:40:56 -05:00
  • 83cdc4c226
    Create DeepSeek-V3، ASA700 2025-02-10 06:33:19 +03:00
  • d700e0056d
    Merge pull request #1 from wowrakibul/imgbot Wow Rakibul 2025-02-09 02:02:52 +06:00