ShenDezhou

ShenDezhou

Geek Repo

Company:Tsinghua University

Location:Beijing

Home Page:http://www.tsinghuaboy.com

Github PK Tool:Github PK Tool

ShenDezhou's starred repositories

llm-playground

Experiments with open source LLMs

Language:PythonLicense:MITStargazers:53Issues:0Issues:0

litellm

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Language:PythonLicense:NOASSERTIONStargazers:9399Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

Language:GoLicense:MITStargazers:70573Issues:0Issues:0
Language:PythonLicense:MITStargazers:168Issues:0Issues:0

LDDL

Distributed preprocessing and data loading for language datasets

Language:PythonLicense:NOASSERTIONStargazers:36Issues:0Issues:0

character-bert-pretraining

Code for pre-training CharacterBERT models (as well as BERT models).

Language:PythonLicense:Apache-2.0Stargazers:34Issues:0Issues:0

ColossalAI-Examples

Examples of training models with hybrid parallelism using ColossalAI

Language:PythonLicense:Apache-2.0Stargazers:333Issues:0Issues:0

LLM-Workshop

LLM Workshop by Sourab Mangrulkar

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:283Issues:0Issues:0

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7165Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38112Issues:0Issues:0

Llama2-chinese

Llama2 chinese finetuning

Language:PythonLicense:MITStargazers:37Issues:0Issues:0

llama2-lora-fine-tuning

llama2 finetuning with deepspeed and lora

Language:PythonLicense:MITStargazers:153Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:9944Issues:0Issues:0

MemGPT

Create LLM agents with long-term memory and custom tools 📚🦙

Language:PythonLicense:Apache-2.0Stargazers:10381Issues:0Issues:0
Language:PythonLicense:MITStargazers:3316Issues:0Issues:0

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language:C++License:Apache-2.0Stargazers:9298Issues:0Issues:0

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:1608Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:6991Issues:0Issues:0

vscode-extension-samples

Sample code illustrating the VS Code extension API.

Language:TypeScriptLicense:MITStargazers:8244Issues:0Issues:0

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonLicense:MITStargazers:40428Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:59377Issues:0Issues:0
Language:PythonStargazers:349Issues:0Issues:0

natural-instructions

Expanding natural instructions

Language:PythonLicense:Apache-2.0Stargazers:911Issues:0Issues:0

nanoT5

Fast & Simple repository for pre-training and fine-tuning T5-style models

Language:PythonLicense:Apache-2.0Stargazers:931Issues:0Issues:0

bert_distill

BERT distillation(基于BERT的蒸馏实验 )

Language:PythonStargazers:304Issues:0Issues:0

beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Language:PythonLicense:Apache-2.0Stargazers:1425Issues:0Issues:0

duckdb-pgq

DuckDB is an in-process SQL OLAP Database Management System

Language:C++License:MITStargazers:34Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:10626Issues:0Issues:0

duckdb

DuckDB is an in-process SQL OLAP Database Management System

Language:C++License:MITStargazers:17723Issues:0Issues:0

arrow-tools

A collection of handy CLI tools to convert CSV and JSON to Apache Arrow and Parquet

Language:RustLicense:Apache-2.0Stargazers:126Issues:0Issues:0