init's starred repositories

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

llama.cpp

LLM inference in C/C++

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:28048Issues:301Issues:89

QAnything

Question and Answer based on Anything.

Language:PythonLicense:AGPL-3.0Stargazers:11523Issues:102Issues:403

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11291Issues:160Issues:300

vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Language:PythonLicense:MITStargazers:10999Issues:79Issues:294

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:7097Issues:66Issues:71

ai-chatbot

A full-featured, hackable Next.js AI chatbot built by Vercel

Language:TypeScriptLicense:NOASSERTIONStargazers:6223Issues:78Issues:222

langgraph

Build resilient language agents as graphs.

Language:PythonLicense:MITStargazers:5920Issues:64Issues:257

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5246Issues:39Issues:37

Qwen-Agent

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:3225Issues:29Issues:347

ChatRTX

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Language:TypeScriptLicense:NOASSERTIONStargazers:2672Issues:57Issues:75

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2588Issues:36Issues:100

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:1413Issues:23Issues:60

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1177Issues:18Issues:63

raptor

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Language:PythonLicense:MITStargazers:876Issues:10Issues:39

DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Language:PythonLicense:MITStargazers:790Issues:13Issues:28

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:538Issues:8Issues:47

Awesome-Scientific-Language-Models

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery

License:MITStargazers:429Issues:9Issues:0

resp

Fetch Academic Research Papers from different sources

Language:PythonLicense:Apache-2.0Stargazers:380Issues:3Issues:4

ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:341Issues:7Issues:21

semanticscholar

Unofficial Python client library for Semantic Scholar APIs.

Language:PythonLicense:MITStargazers:295Issues:10Issues:50

TravelPlanner

[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Language:PythonLicense:MITStargazers:223Issues:9Issues:24

CEPE

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Language:PythonLicense:MITStargazers:135Issues:5Issues:5

TableLlama

[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".

Language:PythonLicense:MITStargazers:109Issues:3Issues:11

co-llm

Co-LLM: Learning to Decode Collaboratively with Multiple Language Models

ADaPT

Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"

Language:PythonLicense:MITStargazers:69Issues:3Issues:7

llm-planning-eval

[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"