luciencho

followers

following

stars

LucienCho's starred repositories

ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:PythonMIT87000

ttt-lm-jax

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:Python31300

align-anything

Align Anything: Training Any Modality Model with Feedback

Language:PythonApache-2.06800

Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

LLM101n

LLM101n: Let's build a Storyteller

Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Language:PythonMIT73100

magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Language:PythonMIT28400

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell666300

KsanaLLM

Language:C++NOASSERTION21700

denser-retriever

An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.

Language:TypeScriptMIT14200

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptMIT1177900

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptNOASSERTION4015500

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT1397100

Chinese-AMR

Chinese AMR Corpus

Language:Python3500

Llama3-Chinese-Chat

This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.

Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Language:Jupyter NotebookApache-2.044100

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION2516700

Mixture-of-depths

Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:Python11600

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonApache-2.0816000

HuixiangDou

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Language:PythonBSD-3-Clause121100

cobra

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference

Language:PythonMIT22400

AndroidArena

Language:HTML2200

uiautomator2

Android Uiautomator2 Python Wrapper

Language:PythonMIT627400

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonMIT473900

AutoWebGLM

An LLM-based Web Navigating Agent (KDD'24)

Language:PythonApache-2.055200

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonApache-2.098700

MINI_LLM

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Language:Python29300

ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Language:PythonApache-2.0103100

veScale

A PyTorch Native LLM Training Framework

Language:PythonApache-2.052100

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02104800