Monohydroxides's starred repositories

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:82614Issues:1744Issues:45414

gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34953Issues:342Issues:2746

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:12132Issues:99Issues:532

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonLicense:Apache-2.0Stargazers:10162Issues:89Issues:753

chibicc

A small C compiler

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7700Issues:108Issues:156

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6579Issues:37Issues:1091

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:6016Issues:74Issues:534

ToonCrafter

[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation

Language:PythonLicense:Apache-2.0Stargazers:5185Issues:58Issues:53

silk-v3-decoder

[Skype Silk Codec SDK]Decode silk v3 audio files (like wechat amr, aud files, qq slk files) and convert to other format (like mp3). Batch conversion support.

Language:CLicense:MITStargazers:2651Issues:72Issues:0

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonLicense:Apache-2.0Stargazers:2608Issues:12Issues:173

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonLicense:NOASSERTIONStargazers:1777Issues:26Issues:46

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:1123Issues:38Issues:54

Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Language:PythonLicense:Apache-2.0Stargazers:878Issues:12Issues:27

LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonLicense:MITStargazers:597Issues:10Issues:37

qq-win-db-key

全平台 QQ 聊天数据库解密

Language:PythonLicense:NOASSERTIONStargazers:465Issues:10Issues:31

FIFO-Diffusion_public

Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)

InfiniTransformer

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Language:PythonLicense:MITStargazers:336Issues:8Issues:24

QQ-History-Backup

【停更】QQ/TIM 聊天记录导出为 HTML,支持图片、语音,可 GUI 与 非 GUI 操作 (Python)

Language:PythonLicense:MITStargazers:308Issues:7Issues:12

LongAlign

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Language:PythonLicense:Apache-2.0Stargazers:199Issues:8Issues:11

diffusion-nlp-paper-arxiv

Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion_NLP_Papers".

qq_msg_decode

解码qq聊天数据库

vscode-hpc

Remote development on HPC clusters with VSCode

Language:Jupyter NotebookLicense:MITStargazers:31Issues:4Issues:0

Shmily-Get-MobileQQ-Andriod

Shmily-Get-QQ-Andriod

Language:JavaLicense:GPL-3.0Stargazers:26Issues:2Issues:23

Diffusion_NLP_Papers

Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.

Language:PythonStargazers:12Issues:0Issues:0