hxdaze's repositories
Fast-BEV
Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline
mmfewshot
OpenMMLab FewShot Learning Toolbox and Benchmark
open_clip
An open source implementation of CLIP.
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
simple_bev
A Simple Baseline for BEV Perception
RenderPipeline
Physically Based Shading and Deferred Rendering for the Panda3D game engine
detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
OpenScene
3D Occupancy Prediction Benchmark in Autonomous Driving
SMARTS
Scalable Multi-Agent RL Training School for Autonomous Driving
HojiChar
The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.
minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
dkt
A Tutorial on Manipulator Differential Kinematics
text-generation-webui
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.
llmtune
4-Bit Finetuning of Large Language Models on One Consumer GPU
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
VisualRWKV
VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.
YabLoc
Open source visual localization for self-driving vehicles
UniAD
[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving
Awesome-Open-Vocabulary-Object-Detection
A curated list of papers, datasets and resources pertaining to open vocabulary object detection.
mmselfsup
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
SegFormer
Official PyTorch implementation of SegFormer
LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Multimodal-GPT
Multimodal-GPT
simpleAI
An easy way to host your own AI API and expose alternative models, while being compatible with "open" AI clients.
aws-mlops-handson
This repository provides a comprehensive ML infrastructure for CTR prediction, focusing on AWS services and offering practical learning experience for MLOps.
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".