wtmarvel's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66842Issues:555Issues:706

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:38702Issues:290Issues:1439

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23490Issues:252Issues:283

llama2.c

Inference Llama 2 in one file of pure C

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:13860Issues:159Issues:169

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12611Issues:117Issues:921

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11286Issues:167Issues:224

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10586Issues:141Issues:338

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonLicense:Apache-2.0Stargazers:7509Issues:111Issues:289

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7393Issues:110Issues:150

consistency_models

Official repo for consistency models.

Language:PythonLicense:MITStargazers:6037Issues:60Issues:51

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5702Issues:66Issues:406

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5387Issues:64Issues:96

CoDeF

[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Language:PythonLicense:NOASSERTIONStargazers:4798Issues:73Issues:79

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4232Issues:61Issues:92

FaceDetection-DSFD

腾讯优图高精度双分支人脸检测器

Language:PythonLicense:NOASSERTIONStargazers:2885Issues:106Issues:89

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2219Issues:30Issues:109

DragGAN

Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold

Language:PythonLicense:MITStargazers:2160Issues:49Issues:10

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2157Issues:32Issues:101

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:2126Issues:22Issues:310

Candle

GRBL controller application with G-Code visualizer written in Qt.

Language:C++License:GPL-3.0Stargazers:1351Issues:125Issues:576

voxceleb_trainer

In defence of metric learning for speaker recognition

Language:PythonLicense:MITStargazers:1009Issues:30Issues:172

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

DPE

[CVPR 2023] DPE: Disentanglement of Pose and Expression for General Video Portrait Editing

Language:PythonLicense:MITStargazers:417Issues:22Issues:24

vampnet

music generation with masked transformers!

Language:Jupyter NotebookLicense:MITStargazers:274Issues:8Issues:30

AttentionIsOFFByOne

Implementation of "Attention Is Off By One" by Evan Miller

Language:PythonLicense:MITStargazers:175Issues:5Issues:6

HADAR

This is an LWIR stereo-hyperspectral database to develop HADAR algorithms for thermal navigation. Based on this database, one can develop algorithms for TeX decomposition to generate TeX vision. One can also develop algorithms about object detection, semantic or scene segmentation, optical or scene flow, stereo depth etc. based on TeX vision instead of traditional RGB or thermal vision.

Language:PythonLicense:MITStargazers:160Issues:5Issues:24

TTS-TextAnalyzer

TTS Text Analyzer

License:Apache-2.0Stargazers:32Issues:6Issues:0