Han Yang (yanghanxy)

yanghanxy

Geek Repo

Company:Xiaohongshu

Location:Shanghai, China

Github PK Tool:Github PK Tool

Han Yang's starred repositories

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:1691Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9027Issues:0Issues:0

pytorch-tutorial

PyTorch Tutorial for Deep Learning Researchers

Language:PythonLicense:MITStargazers:29327Issues:0Issues:0

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:2001Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9033Issues:0Issues:0

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain

Language:PythonLicense:Apache-2.0Stargazers:28822Issues:0Issues:0

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6305Issues:0Issues:0

facebook-hive-udfs

Facebook's Hive UDFs

Language:JavaLicense:Apache-2.0Stargazers:265Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:1502Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33320Issues:0Issues:0

WebCPM

Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"

Language:HTMLLicense:Apache-2.0Stargazers:960Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53853Issues:0Issues:0

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17730Issues:0Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6306Issues:0Issues:0

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

Stargazers:8868Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4367Issues:0Issues:0

the-algorithm-ml

Source code for Twitter's Recommendation Algorithm

Language:PythonLicense:AGPL-3.0Stargazers:9937Issues:0Issues:0

the-algorithm

Source code for Twitter's Recommendation Algorithm

Language:ScalaLicense:AGPL-3.0Stargazers:61573Issues:0Issues:0

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:11853Issues:0Issues:0

ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Language:PythonLicense:Apache-2.0Stargazers:1416Issues:0Issues:0

python_backend

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

Language:C++License:BSD-3-ClauseStargazers:493Issues:0Issues:0

Algorithm-Practice-in-Industry

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Language:PythonLicense:BSD-2-ClauseStargazers:1697Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:33158Issues:0Issues:0

mastodon

Your self-hosted, globally interconnected microblogging community

Language:RubyLicense:AGPL-3.0Stargazers:46221Issues:0Issues:0

AutoPhrase

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

Language:C++License:Apache-2.0Stargazers:1167Issues:0Issues:0

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7219Issues:0Issues:0

alphaFM_softmax

Multi-thread implementation of Factorization Machines with FTRL for multi-class classification problem which uses softmax as hypothesis.

Language:C++License:MITStargazers:70Issues:0Issues:0

cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Language:PythonLicense:AGPL-3.0Stargazers:8907Issues:0Issues:0

Diffusion-LM

Diffusion-LM

Language:PythonLicense:Apache-2.0Stargazers:1000Issues:0Issues:0

LexiconAugmentedNER

Reject complicated operations for incorporating lexicon for Chinese NER.

Language:PythonStargazers:431Issues:0Issues:0