hoshi-hiyouga (hiyouga)

hiyouga

Geek Repo

Company:Millennium Science School

Location:Beijing, China

Home Page:https://scholar.google.com/citations?user=QQtacXUAAAAJ&hl=en

Twitter:@llamafactory_ai

Github PK Tool:Github PK Tool


Organizations
the-seeds

hoshi-hiyouga's starred repositories

developer-roadmap

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

Language:TypeScriptLicense:NOASSERTIONStargazers:278667Issues:6815Issues:1937

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:44901Issues:404Issues:82

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:32382Issues:340Issues:59

chroma

the AI-native open-source embedding database

Language:RustLicense:Apache-2.0Stargazers:12896Issues:78Issues:992

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:11003Issues:73Issues:446

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:10440Issues:101Issues:318

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5250Issues:60Issues:87

clip-interrogator

Image to prompt with BLIP and CLIP

Language:PythonLicense:MITStargazers:2532Issues:30Issues:93

Qwen-Agent

Agent framework and applications built upon Qwen1.5 & Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:1837Issues:28Issues:166

DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Language:PythonLicense:Apache-2.0Stargazers:1698Issues:40Issues:276

Data-Copilot

Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow

Language:PythonLicense:MITStargazers:1302Issues:11Issues:45

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileLicense:MITStargazers:1267Issues:20Issues:32

pymilvus

Python SDK for Milvus.

Language:PythonLicense:Apache-2.0Stargazers:916Issues:17Issues:797

Yuan-2.0

Yuan 2.0 Large Language Model

Language:PythonLicense:NOASSERTIONStargazers:659Issues:5Issues:90

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

Language:PythonLicense:MITStargazers:586Issues:7Issues:65

gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Language:PythonLicense:Apache-2.0Stargazers:431Issues:6Issues:10

H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

llm-inference-benchmark

LLM Inference benchmark

Language:PythonLicense:MITStargazers:259Issues:2Issues:2
Language:PythonLicense:Apache-2.0Stargazers:255Issues:1Issues:7

unicom

universal visual model trained on LAION-400M

Language:PythonLicense:MITStargazers:168Issues:3Issues:22

factoid-wiki

Dense X Retrieval: What Retrieval Granularity Should We Use?

Efficient-LLM-Survey

The Efficiency Spectrum of LLM

enhance_long

This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without training, and can be used directly in the LLM inference phase.

Language:PythonStargazers:47Issues:0Issues:0

chain-of-thought

Research papers about Chain of Thought (CoT)

Blue-arXiv-Theme

Blue theme for arXiv website

Language:JavaScriptStargazers:24Issues:1Issues:0