DeepSeek (deepseek-ai)

DeepSeek

deepseek-ai

Organization data from Github https://github.com/deepseek-ai

Home Page:https://www.deepseek.com/

GitHub:@deepseek-ai

DeepSeek's repositories

awesome-deepseek-integration

Integrate the DeepSeek API into popular softwares

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:22138Issues:220Issues:202

Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Language:PythonLicense:MITStargazers:17543Issues:150Issues:170

FlashMLA

FlashMLA: Efficient MLA kernels

3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

Language:C++License:MITStargazers:9299Issues:86Issues:207

DeepEP

DeepEP: an efficient expert-parallel communication library

Language:CudaLicense:MITStargazers:8512Issues:81Issues:275

open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

License:CC0-1.0Stargazers:7913Issues:479Issues:0

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileLicense:MITStargazers:6550Issues:103Issues:56

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Language:CudaLicense:MITStargazers:5712Issues:54Issues:109

DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Language:PythonLicense:MITStargazers:5045Issues:80Issues:108

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

smallpond

A lightweight data processing framework built on DuckDB and 3FS.

Language:PythonLicense:MITStargazers:4772Issues:47Issues:26

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:3958Issues:39Issues:58

DreamCraft3D

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Language:PythonLicense:MITStargazers:2969Issues:125Issues:65

DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Language:PythonLicense:MITStargazers:2888Issues:34Issues:38

DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Language:PythonLicense:MITStargazers:2858Issues:29Issues:11

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonLicense:MITStargazers:1796Issues:27Issues:42

EPLB

Expert Parallelism Load Balancer

Language:PythonLicense:MITStargazers:1265Issues:21Issues:17
License:NOASSERTIONStargazers:1187Issues:0Issues:0

profile-data

Analyze computation-communication overlap in V3/R1.

awesome-deepseek-coder

A curated list of open-source projects related to DeepSeek Coder

ESFT

Expert Specialized Fine-Tuning

Language:PythonLicense:MITStargazers:696Issues:15Issues:12