daiwk

daiwk

User data from Github https://github.com/daiwk

Location:beijing

Home Page:https://www.daiwk.net/

GitHub:@daiwk

daiwk's repositories

collections

https://www.daiwk.net/

Language:PythonStargazers:117Issues:5Issues:0
Language:Jupyter NotebookStargazers:4Issues:3Issues:0

build-nanogpt

Video+code lecture on building nanoGPT from scratch

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

License:MITStargazers:0Issues:0Issues:0

DeepEP

DeepEP: an efficient expert-parallel communication library

License:MITStargazers:0Issues:0Issues:0

DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

generative-recommenders

Repository hosting code used to reproduce results in "Actions Speak Louder than Words Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

HLLM

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Langchain-Chatchat

Langchain-Chatchat๏ผˆๅŽŸLangchain-ChatGLM๏ผ‰ๅŸบไบŽ Langchain ไธŽ ChatGLM ็ญ‰่ฏญ่จ€ๆจกๅž‹็š„ๆœฌๅœฐ็Ÿฅ่ฏ†ๅบ“้—ฎ็ญ” | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

llm-twin-course

๐Ÿค– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป for ๐—ณ๐—ฟ๐—ฒ๐—ฒ how to ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ an end-to-end ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป-๐—ฟ๐—ฒ๐—ฎ๐—ฑ๐˜† ๐—Ÿ๐—Ÿ๐—  & ๐—ฅ๐—”๐—š ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ using ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€ best practices: ~ ๐˜ด๐˜ฐ๐˜ถ๐˜ณ๐˜ค๐˜ฆ ๐˜ค๐˜ฐ๐˜ฅ๐˜ฆ + 12 ๐˜ฉ๐˜ข๐˜ฏ๐˜ฅ๐˜ด-๐˜ฐ๐˜ฏ ๐˜ญ๐˜ฆ๐˜ด๐˜ด๐˜ฐ๐˜ฏ๐˜ด

License:MITStargazers:0Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

LLMs-from-scratch-CN

LLMs-from-scratch้กน็›ฎไธญๆ–‡็ฟป่ฏ‘

License:NOASSERTIONStargazers:0Issues:0Issues:0

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

open-r1

Fully open reproduction of DeepSeek-R1

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

recommenders-addons

Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.

Language:CudaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

s1

s1: Simple test-time scaling

Stargazers:0Issues:0Issues:0

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

SSLRec

[WSDM'2024 Oral] "SSLRec: A Self-Supervised Learning Framework for Recommendation"

Language:PythonStargazers:0Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

transformers

๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

trlx_new

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0