hiyouga

followers

following

stars

Millennium Science School

Beijing, China

https://scholar.google.com/citations?user=QQtacXUAAAAJ&hl=en

@llamafactory_ai

Organizations

the-seeds

hoshi-hiyouga's starred repositories

developer-roadmap

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

Language:TypeScriptNOASSERTION278667 6815 1937

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookMIT44901 404 82

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.032382 340 59

chroma

the AI-native open-source embedding database

Language:RustApache-2.012896 78 992

Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.011003 73 446

mamba

Mamba SSM architecture

Language:PythonApache-2.010440 101 318

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

ML-Papers-Explained

Explanation to key concepts in ML

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause5250 60 87

clip-interrogator

Image to prompt with BLIP and CLIP

Language:PythonMIT2532 30 93

Qwen-Agent

Agent framework and applications built upon Qwen1.5 & Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonNOASSERTION1837 28 166

DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Language:PythonApache-2.01698 40 276

Data-Copilot

Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow

Language:PythonMIT1302 11 45

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileMIT1267 20 32

pymilvus

Python SDK for Milvus.

Language:PythonApache-2.0916 17 797

Yuan-2.0

Yuan 2.0 Large Language Model

Language:PythonNOASSERTION659 5 90

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

Language:PythonMIT586 7 65

gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Language:PythonApache-2.0431 6 10

H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Language:Python296 5 27

llm-inference-benchmark

LLM Inference benchmark

Language:PythonMIT259 2 2

URIAL

Language:PythonApache-2.0255 1 7

unicom

universal visual model trained on LAION-400M

Language:Python203 9 21

LoftQ

Language:PythonMIT168 3 22

text2image_safety

Language:PythonMIT102 2 6

factoid-wiki

Dense X Retrieval: What Retrieval Granularity Should We Use?

Apache-2.0101 8 3

Efficient-LLM-Survey

The Efficiency Spectrum of LLM

enhance_long

This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without training, and can be used directly in the LLM inference phase.

Language:Python4700

chain-of-thought

Research papers about Chain of Thought (CoT)

Blue-arXiv-Theme

Blue theme for arXiv website

Language:JavaScript24 10