Zeno's repositories
BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
build-your-own-x
🤓 Build your own (insert technology here)
CameraCalibration
Fisheye or Normal Camera Intrinsic and Extrinsic Calibration. Surround Camera Bird Eye View Generator.
chatgpt-mac
ChatGPT for Mac, living in your menubar.
ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
direct_lidar_odometry
[IEEE RA-L & ICRA'22] A lightweight and computationally-efficient frontend LiDAR odometry solution with consistent and accurate localization.
dss
Darwin Streaming Server is Apple's open source version of the QuickTime Streaming Server technology allowing you to send streaming media across the Internet using the industry standard RTP and RTSP protocols.
Fast-BEV
Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline
fprime
F' - A flight software and embedded systems framework
FS19_modROS
(partial) ROS1 integration for FarmSim19
fs_mod_ros_windows
The Windows side of FS19_modROS
gortsplib
RTSP 1.0 client and server library for the Go programming language
gst-rtsp-server
RTSP server based on GStreamer
kkndme_tianya
天涯 kkndme 神贴聊房价
KNN_CUDA
pytorch knn [cuda version]
mesh-transformer-jax
Model parallel transformers in JAX and Haiku
PCT_Pytorch
Pytorch implementation of PCT: Point Cloud Transformer
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
rtabmap
RTAB-Map library and standalone application
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
VCD
The Video Conferencing Dataset (VCD) to evaluate video codecs for video conferencing.
visual-slam-roadmap
Roadmap to becoming a Visual-SLAM developer in 2023