qingsong99's repositories
AutoKernel
AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。
av_hubert
A self-supervised learning framework for audio-visual speech
Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
awesome-NeRF
A curated list of awesome neural radiance fields papers
BEVDet
Official code base of the BEVDet series .
BEVFormer
This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
ddia
《Designing Data-Intensive Application》DDIA中文翻译
GroundingDINO
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
jukebox
Code for the paper "Jukebox: A Generative Model for Music"
LLM-Training-Puzzles
What would you do with 1000 H100s...
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
MAE-code
Pytorch implementation of Masked Auto-Encoder
mctx
Monte Carlo tree search in JAX
MLQuestions
Machine Learning and Computer Vision Engineer - Technical Interview Questions
omnizart
Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.
open-Chinese-ChatLLaMA
The complete training code of the open-source Chinese-Llama model, including the full process from pre-training instructing and RLHF.
open_clip
An open source implementation of CLIP.
pbrtbook
pbrt 中文整合翻译 基于物理的渲染:从理论到实现 Physically Based Rendering: From Theory To Implementation
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
regmix
🧬 RegMix: Data Mixture as Regression for Language Model Pre-training
self_supervised
Implementation of popular SOTA self-supervised learning algorithms as Fastai Callbacks.
spleeter
Deezer source separation library including pretrained models.
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
x-stable-diffusion
Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention.