qingsong99's repositories

AutoKernel

AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。

License:Apache-2.0Stargazers:0Issues:0Issues:0

av_hubert

A self-supervised learning framework for audio-visual speech

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

Stargazers:0Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

Stargazers:0Issues:0Issues:0

awesome-NeRF

A curated list of awesome neural radiance fields papers

License:MITStargazers:0Issues:0Issues:0

BEVDet

Official code base of the BEVDet series .

License:Apache-2.0Stargazers:0Issues:0Issues:0

BEVFormer

This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

License:MITStargazers:0Issues:0Issues:0

ddia

《Designing Data-Intensive Application》DDIA中文翻译

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

GroundingDINO

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

License:Apache-2.0Stargazers:0Issues:0Issues:0

InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

Stargazers:0Issues:0Issues:0

JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

License:Apache-2.0Stargazers:0Issues:0Issues:0

jukebox

Code for the paper "Jukebox: A Generative Model for Music"

License:NOASSERTIONStargazers:0Issues:0Issues:0

LLM-Training-Puzzles

What would you do with 1000 H100s...

License:MITStargazers:0Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

License:NOASSERTIONStargazers:0Issues:0Issues:0

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

License:NOASSERTIONStargazers:0Issues:0Issues:0

MAE-code

Pytorch implementation of Masked Auto-Encoder

License:GPL-3.0Stargazers:0Issues:0Issues:0

mctx

Monte Carlo tree search in JAX

License:Apache-2.0Stargazers:0Issues:0Issues:0

MLQuestions

Machine Learning and Computer Vision Engineer - Technical Interview Questions

Stargazers:0Issues:0Issues:0

omnizart

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

License:MITStargazers:0Issues:0Issues:0

open-Chinese-ChatLLaMA

The complete training code of the open-source Chinese-Llama model, including the full process from pre-training instructing and RLHF.

Stargazers:0Issues:0Issues:0

open_clip

An open source implementation of CLIP.

License:NOASSERTIONStargazers:0Issues:0Issues:0

pbrtbook

pbrt 中文整合翻译 基于物理的渲染:从理论到实现 Physically Based Rendering: From Theory To Implementation

License:NOASSERTIONStargazers:0Issues:0Issues:0

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

License:Apache-2.0Stargazers:0Issues:0Issues:0

regmix

🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

License:MITStargazers:0Issues:0Issues:0

self_supervised

Implementation of popular SOTA self-supervised learning algorithms as Fastai Callbacks.

License:Apache-2.0Stargazers:0Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

License:MITStargazers:0Issues:0Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

License:Apache-2.0Stargazers:0Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

License:MITStargazers:0Issues:0Issues:0

x-stable-diffusion

Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention.

License:Apache-2.0Stargazers:0Issues:0Issues:0