Tiantian Han (Tiantian-Han)

Tiantian-Han

Geek Repo

Company:Xilinx Technology Beijing Limited

Github PK Tool:Github PK Tool

Tiantian Han's repositories

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

bitsandbytes

8-bit CUDA functions for PyTorch

License:MITStargazers:0Issues:0Issues:0

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

License:NOASSERTIONStargazers:0Issues:0Issues:0

Efficient-LLMs-Survey

Efficient Large Language Models: A Survey

Stargazers:0Issues:0Issues:0

float8_experimental

This repository contains the experimental PyTorch native float8 training UX

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

fp6_llm

An efficient GPU support for LLM inference with 6-bit quantization (FP6).

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:CC0-1.0Stargazers:0Issues:0Issues:0

ggml

Tensor library for machine learning

License:MITStargazers:0Issues:0Issues:0

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLM-FP4

The official implementation of the EMNLP 2023 paper LLM-FP4

License:MITStargazers:0Issues:0Issues:0

llm_interview_note

大模型面试题及答案,大模型八股文

Stargazers:0Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

LSQuantization

The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

License:NOASSERTIONStargazers:0Issues:0Issues:0

microxcaling

PyTorch emulation library for Microscaling (MX)-compatible data formats

License:MITStargazers:0Issues:0Issues:0

ml_dtypes

A stand-alone implementation of several NumPy dtype extensions used in machine learning.

License:Apache-2.0Stargazers:0Issues:0Issues:0

onnx2torch

Convert ONNX models to PyTorch.

License:Apache-2.0Stargazers:0Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

License:Apache-2.0Stargazers:0Issues:0Issues:0

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

License:MITStargazers:0Issues:0Issues:0

qa-lora

Official PyTorch implementation of QA-LoRA

License:MITStargazers:0Issues:0Issues:0

QAQ-KVCacheQuantization

QAQ: Quality Adaptive Quantization for LLM KV Cache

License:Apache-2.0Stargazers:0Issues:0Issues:0

QuaRot

Code for QuaRot, an end-to-end 4-bit inference of large language models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

serverchan-demo

Server酱多语言调用实例

License:MITStargazers:0Issues:0Issues:0

tiny-asic-4bit-matrix-mul

Tiny matrix multiplication ASIC with 4-bit math

License:Apache-2.0Stargazers:0Issues:0Issues:0

UltraEval

An open source framework for evaluating foundation models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0

what-is

Important concepts in numerical linear algebra and related areas

Stargazers:0Issues:0Issues:0