Jie (luome)

luome

Geek Repo

Location:Beijing

Github PK Tool:Github PK Tool

Jie's starred repositories

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9439Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49363Issues:0Issues:0

flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Language:CudaLicense:Apache-2.0Stargazers:534Issues:0Issues:0

algebraic-nnhw

Deep learning accelerator architectures requiring half the multipliers

Language:PythonStargazers:258Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15541Issues:0Issues:0

jsonformer

A Bulletproof Way to Generate Structured JSON from Language Models

Language:Jupyter NotebookLicense:MITStargazers:4294Issues:0Issues:0

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:7983Issues:0Issues:0

induced-rationales-markup-tokens

Paper reproduction code: Induced Natural Language Rationales and Interleaved Markup Tokens Enable Extrapolation in Large Language Models

Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:0
Language:PythonLicense:MITStargazers:311Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:7053Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8940Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:31381Issues:0Issues:0

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:7688Issues:0Issues:0

human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Language:PythonLicense:MITStargazers:2255Issues:0Issues:0

chroma

the AI-native open-source embedding database

Language:RustLicense:Apache-2.0Stargazers:14198Issues:0Issues:0

litellm

Python SDK, Proxy Server to call 100+ LLM APIs using the OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Language:PythonLicense:NOASSERTIONStargazers:11550Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:8269Issues:0Issues:0

GPTs

leaked prompts of GPTs

Stargazers:28091Issues:0Issues:0

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7794Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:63866Issues:0Issues:0

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:86155Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7976Issues:0Issues:0

aimoneyhunter

ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English version for more insights.

Stargazers:12788Issues:0Issues:0

llm-inference-benchmark

LLM Inference benchmark

Language:PythonLicense:MITStargazers:315Issues:0Issues:0

RRHF

[NIPS2023] RRHF & Wombat

Language:PythonStargazers:786Issues:0Issues:0

lobe-chat

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / DeepSeek),Knowledge Base(file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.

Language:TypeScriptLicense:NOASSERTIONStargazers:37346Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29285Issues:0Issues:0

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:5726Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1966Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5465Issues:0Issues:0