There are 185 repositories under language-model topic.
21 Lessons, Get Started Building with Generative AI
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Code and documentation to train Stanford's Alpaca models, and generate the data.
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.
The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
An open source implementation of CLIP.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
A PyTorch-based Speech Toolkit
A framework for few-shot evaluation of language models.
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
Google AI 2018 BERT pytorch implementation
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
GPT 3.5/4 with a Chat Web UI. No API key required.
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Aligning pretrained language models with instruction data generated by themselves.
Speech To Speech: an effort for an open-sourced and modular GPT4-o
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣