Beast code in Giters

Okan Köpüklü's starred repositories

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT2079800

TrackTacular

Official Code for "Lifting Multi-View Detection and Tracking to the Bird’s Eye View"

Language:Python1400

acoustic-simulator

Implementation of audio degradation processes

Language:PythonGPL-3.09800

Neural-Network-Parameter-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Language:Python77600

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonGPL-3.0840500

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION548100

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonApache-2.0611400

audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Language:PythonNOASSERTION257200

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT2672500

Pointcept

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)

Language:PythonMIT125900

SinSR

[CVPR 2024] SinSR: Diffusion-Based Image Super-Resolution in a Single Step

Language:PythonNOASSERTION16600

DemoFusion

Let us democratise high-resolution generation! (CVPR 2024)

Language:Jupyter Notebook190100

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonMIT427000

Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Language:PythonMIT70200

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonApache-2.0592300

EarlyBird

Official Code for "EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye View"

Language:Python3500

Code to reproduce the experiments described in "Do We Still Need Non-Maximum Suppression? Accurate Confidence Estimates and Implicit Duplication Modeling with IoU-Aware Calibration" (https://arxiv.org/pdf/2309.03110.pdf)

Language:Python1400