Jinze Xue (yueyericardo)

yueyericardo

Geek Repo

Company:University of Florida

Location:Gainesville, FL

Github PK Tool:Github PK Tool

Jinze Xue's starred repositories

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27023Issues:206Issues:203

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:22666Issues:186Issues:178

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:21271Issues:213Issues:118

llama2.c

Inference Llama 2 in one file of pure C

mlx

MLX: An array framework for Apple silicon

overleaf

A web-based collaborative LaTeX editor

Language:JavaScriptLicense:AGPL-3.0Stargazers:13057Issues:208Issues:1001

khoj

Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our cloud instance. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.

Language:PythonLicense:AGPL-3.0Stargazers:11888Issues:70Issues:405

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:10797Issues:74Issues:12

phidata

Build AI Assistants with memory, knowledge and tools.

Language:PythonLicense:MPL-2.0Stargazers:10388Issues:81Issues:130

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8655Issues:81Issues:34

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7345Issues:84Issues:1524
Language:PythonLicense:Apache-2.0Stargazers:6987Issues:65Issues:66

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:5487Issues:57Issues:392
Language:PythonLicense:Apache-2.0Stargazers:4358Issues:73Issues:73

llm-viz

3D Visualization of an GPT-style LLM

ompi

Open MPI main development repository

Language:CLicense:NOASSERTIONStargazers:2065Issues:118Issues:3565

llama2.mojo

Inference Llama 2 in one file of pure 🔥

Language:MojoLicense:MITStargazers:2051Issues:27Issues:44

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language:PythonLicense:Apache-2.0Stargazers:1530Issues:17Issues:12

cali.so

Cali 的个人官网开源项目

benchmark

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Language:PythonLicense:BSD-3-ClauseStargazers:815Issues:227Issues:860
Language:PythonLicense:MITStargazers:502Issues:15Issues:16

cuda-training-series

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Language:CudaStargazers:416Issues:18Issues:0

AgentKit

An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natural language prompts.

Language:PythonLicense:CC-BY-4.0Stargazers:268Issues:8Issues:9

westpa

WESTPA: The Weighted Ensemble Simulation Toolkit with Parallelization and Analysis

Language:PythonLicense:MITStargazers:183Issues:24Issues:68

flash-llm

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Language:CudaLicense:Apache-2.0Stargazers:155Issues:5Issues:4

fp6_llm

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Language:CudaLicense:Apache-2.0Stargazers:149Issues:4Issues:7
Language:HTMLLicense:MITStargazers:107Issues:1Issues:2

ohara

Collection of autoregressive model implementation

Language:PythonStargazers:61Issues:0Issues:0