LucienCho's starred repositories

ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:PythonLicense:MITStargazers:870Issues:0Issues:0

ttt-lm-jax

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:PythonStargazers:313Issues:0Issues:0

align-anything

Align Anything: Training Any Modality Model with Feedback

Language:PythonLicense:Apache-2.0Stargazers:68Issues:0Issues:0

Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Stargazers:513Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:26432Issues:0Issues:0

Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Language:PythonLicense:MITStargazers:731Issues:0Issues:0

magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Language:PythonLicense:MITStargazers:284Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:6663Issues:0Issues:0
Language:C++License:NOASSERTIONStargazers:217Issues:0Issues:0

denser-retriever

An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.

Language:TypeScriptLicense:MITStargazers:142Issues:0Issues:0

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptLicense:MITStargazers:11779Issues:0Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:40155Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13971Issues:0Issues:0

Chinese-AMR

Chinese AMR Corpus

Language:PythonStargazers:35Issues:0Issues:0

Llama3-Chinese-Chat

This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.

Stargazers:295Issues:0Issues:0

Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:441Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25167Issues:0Issues:0

Mixture-of-depths

Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:PythonStargazers:116Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:8160Issues:0Issues:0

HuixiangDou

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Language:PythonLicense:BSD-3-ClauseStargazers:1211Issues:0Issues:0

cobra

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference

Language:PythonLicense:MITStargazers:224Issues:0Issues:0
Language:HTMLStargazers:22Issues:0Issues:0

uiautomator2

Android Uiautomator2 Python Wrapper

Language:PythonLicense:MITStargazers:6274Issues:0Issues:0

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonLicense:MITStargazers:4739Issues:0Issues:0

AutoWebGLM

An LLM-based Web Navigating Agent (KDD'24)

Language:PythonLicense:Apache-2.0Stargazers:552Issues:0Issues:0

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonLicense:Apache-2.0Stargazers:987Issues:0Issues:0

MINI_LLM

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Language:PythonStargazers:293Issues:0Issues:0

ChatLM-mini-Chinese

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Language:PythonLicense:Apache-2.0Stargazers:1031Issues:0Issues:0

veScale

A PyTorch Native LLM Training Framework

Language:PythonLicense:Apache-2.0Stargazers:521Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21048Issues:0Issues:0