dsindex

Myungchul Shin's starred repositories

gpt-crawler

Crawl a site to generate knowledge files to create your own custom GPT from a URL

Language:TypeScriptISC18383 118 115

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonApache-2.014706 138 2102

mamba

Mamba SSM architecture

Language:PythonApache-2.012246 99 474

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++MIT7798 76 154

rags

Build ChatGPT over your data, all with natural language

Language:PythonMIT6168 55 39

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonMIT4375 32 111

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.02510 23 26

prompt2model

prompt2model - Generate Deployable Models from Natural Language Instructions

Language:PythonApache-2.01936 25 167

KwaiAgents

A generalized information-seeking agent system with Large Language Models (LLMs).

Language:PythonNOASSERTION1062 21 41

ATLAS

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

Language:PythonApache-2.0893 24 8

Sharing the learning along the way we been gathering to enable Azure OpenAI at enterprise scale in a secure manner. GPT-RAG core is a Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

Language:BicepMIT823 15 68

mamba-chat

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

Language:PythonApache-2.0756 3 16

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonApache-2.0669 6 20

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonMIT524 24 69

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonMIT473 8 6

deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Language:PythonApache-2.0455 6 24

MathPile

Generative AI for Math: MathPile

Language:PythonApache-2.0367 7 5

galactic

data cleaning and curation for unstructured text

Language:PythonApache-2.0323 8 4

RetNet

Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.

Language:Jupyter NotebookMIT225 5 31