Johnny Ho (hewu2008)

hewu2008

Geek Repo

Company:Huawei Technologies Co. LTD

Location:Hefei, AnHui, China

Home Page:http://blog.csdn.net/hewu_2008

Github PK Tool:Github PK Tool

Johnny Ho's starred repositories

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:89775Issues:675Issues:7264

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35898Issues:348Issues:1732

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35151Issues:355Issues:305

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27787Issues:188Issues:4393

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:24495Issues:207Issues:208

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19248Issues:297Issues:1339

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11064Issues:202Issues:2165

nvim-lspconfig

Quickstart configs for Nvim LSP

Language:LuaLicense:Apache-2.0Stargazers:10078Issues:84Issues:1145

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9744Issues:137Issues:31

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9507Issues:160Issues:617

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonLicense:Apache-2.0Stargazers:9036Issues:88Issues:709

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonLicense:GPL-3.0Stargazers:8653Issues:57Issues:480

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonLicense:MITStargazers:8640Issues:94Issues:180
Language:PythonLicense:NOASSERTIONStargazers:8231Issues:153Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7404Issues:110Issues:150
Language:PythonLicense:Apache-2.0Stargazers:7028Issues:66Issues:68

corenet

CoreNet: A library for training deep neural networks

Language:PythonLicense:NOASSERTIONStargazers:6836Issues:62Issues:19

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5966Issues:36Issues:955

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonLicense:Apache-2.0Stargazers:5662Issues:66Issues:127

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4247Issues:42Issues:175

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:4163Issues:33Issues:434

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2729Issues:26Issues:174

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language:Jupyter NotebookLicense:MITStargazers:2725Issues:30Issues:180

detrex

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:1923Issues:26Issues:160

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:1762Issues:24Issues:172

dlrover

DLRover: An Automatic Distributed Deep Learning System

Language:PythonLicense:NOASSERTIONStargazers:1100Issues:50Issues:217

BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

Language:C++License:Apache-2.0Stargazers:782Issues:37Issues:232

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:581Issues:7Issues:90
Language:GoStargazers:5Issues:0Issues:0