zhang-ge-hao

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Apache-2.0300

MFTCoder

High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.

Language:PythonNOASSERTION59900

EnergonAI

Large-scale model inference.

Language:PythonApache-2.062800

FasterTransformer4CodeFuse

High-performance LLM inference based on our optimized version of FastTransfomer

Language:C++NOASSERTION12400

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

Language:C++Apache-2.044200

mac-precision-touchpad

Windows Precision Touchpad Driver Implementation for Apple MacBook / Magic Trackpad

Language:CNOASSERTION878100

LOFFER

博客主题 A Jekyll theme with Chinese UI and document

Language:SCSSMIT37600

fastllm

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

Language:C++Apache-2.0322700

BMInf

Efficient Inference for Big Models

Language:PythonApache-2.057100

NLPMetrics

Python code for various NLP metrics

Language:Jupyter NotebookMIT16700

picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

Language:PythonMIT314100

compose-spec

The Compose specification

Language:DockerfileApache-2.0217900

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonMIT797500

music_source_separation

Language:PythonNOASSERTION124100

funcom

Funcom Source Code Summarization Tool - Public Release

Language:PythonGPL-3.03400

Attn-to-FC

Language:Python1800

javalang

Pure Python Java parser and tools

Language:PythonMIT71700

aibolit

Static Analyzer for Java Code with Machine Learning in Mind

Language:Java5000

datasetforTBCCD

Language:Python2600

MSMARCO-Document-Ranking

MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, and passage/document ranking

Language:PythonCC-BY-4.011700