Zheng.Deng (dengzheng-cloud)

dengzheng-cloud

Geek Repo

Location:shanghai

Github PK Tool:Github PK Tool

Zheng.Deng's repositories

mlc-lcm

implement LCM(Latent Consistency Model) via tvm, then use it in Android, all of this is for work.

License:GPL-3.0Stargazers:1Issues:1Issues:0

chill

chill for me

Language:CLicense:Apache-2.0Stargazers:0Issues:1Issues:0

competition_compute

hom many situations TES will meet.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

cuda_holiday_draft

happy qingming festival, totally cuda draft

Language:CudaStargazers:0Issues:1Issues:0

CUGfred

Config files for my GitHub profile.

Stargazers:0Issues:1Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:0Issues:0Issues:0

llama2.c

Inference Llama 2 in one file of pure C

Language:CLicense:MITStargazers:0Issues:0Issues:0

MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

Language:C++Stargazers:0Issues:0Issues:0

TensorRT

TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

trt-samples-for-hackathon-cn

Simple samples for TensorRT programming

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0