foreverpiano's starred repositories

Language:PythonStargazers:25Issues:0Issues:0

attention-gym

Helpful tools and examples for working with flex-attention

Language:PythonLicense:BSD-3-ClauseStargazers:275Issues:0Issues:0

PKU-Auto-Reservation

华清大学自动预约入校

Language:PythonLicense:GPL-3.0Stargazers:16Issues:0Issues:0

zotero-style

Ethereal Style for Zotero

Language:JavaScriptLicense:AGPL-3.0Stargazers:3362Issues:0Issues:0

cold-compress

Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of GPT-Fast, a simple, PyTorch-native generation codebase.

Language:PythonLicense:BSD-3-ClauseStargazers:62Issues:0Issues:0

CogVideo

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:5745Issues:0Issues:0

SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Language:PythonLicense:NOASSERTIONStargazers:656Issues:0Issues:0

anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Language:PythonStargazers:605Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1396Issues:0Issues:0

InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Language:PythonLicense:MITStargazers:259Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:8239Issues:0Issues:0

flux

Official inference repo for FLUX.1 models

Language:PythonLicense:Apache-2.0Stargazers:9883Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11148Issues:0Issues:0

shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

Language:PythonLicense:MITStargazers:9169Issues:0Issues:0

ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Language:PythonLicense:Apache-2.0Stargazers:337Issues:0Issues:0

cook

🍲 好的,今天我们来做菜!OK, Let's Cook!

Language:VueLicense:MITStargazers:4923Issues:0Issues:0

patch_conv

Patch convolution to avoid large GPU memory usage of Conv2D

Language:PythonLicense:MITStargazers:72Issues:0Issues:0

retrieval-scaling

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

Language:PythonStargazers:73Issues:0Issues:0

DiT-MoE

Scaling Diffusion Transformers with Mixture of Experts

Language:PythonStargazers:146Issues:0Issues:0

ngram

The n-gram Language Model

Language:CStargazers:1259Issues:0Issues:0

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:47601Issues:0Issues:0

ShareGPT4Video

An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Language:PythonStargazers:1205Issues:0Issues:0

Awesome-Efficient-Diffusion-Models

Paper survey of efficient computation for large scale models.

License:Apache-2.0Stargazers:22Issues:0Issues:0

VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Language:PythonLicense:Apache-2.0Stargazers:453Issues:0Issues:0

DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Language:PythonLicense:Apache-2.0Stargazers:726Issues:0Issues:0

onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1537Issues:0Issues:0

stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Language:PythonLicense:MITStargazers:1117Issues:0Issues:0

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:14498Issues:0Issues:0

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10121Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1586Issues:0Issues:0