_Ouya's repositories
autovideo
AutoVideo: An Automated Video Action Recognition System
awesome-point-cloud-scene-flow
A list of point cloud scene flow papers, codes and datasets.
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
d2l-zh
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被60多个国家的400多所大学用于教学。
docker-pytorch
A Docker image for PyTorch
E2E-TAD
[CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection
Firefly
Firefly: 大模型训练工具,支持训练Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
gmflow
[CVPR 2022 Oral] GMFlow: Learning Optical Flow via Global Matching
kaggle-solutions
🏅 Collection of Kaggle Solutions and Ideas 🏅
LLM-Tuning
Tuning LLMs with no tears💦, sharing LLM-tools with love❤️.
MiDaS
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
movenet
Google's Next Gen Pose Estimation in PyTorch
ONNX-SCDepth-Monocular-Depth-Estimation
Python scripts for performing monocular depth estimation using the SC_Depth model in ONNX
ParC-Net
[ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"
Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
random_quantize
a novel data augmentation method across data modalities
SwinTextSpotter
Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)
TimeChat
[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
TriDet
[CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling
unimatch
Unifying Flow, Stereo and Depth Estimation
VideoFlow
Official implementation of ICCV2023 VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation
VTimeLLM
[CVPR2024] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".
yas-train
yas CRNN/SVTR model training
yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
yolov7-object-tracking
YOLOv7 Object Tracking Using PyTorch, OpenCV and Sort Tracking
yolov7-opencv-onnxrun-cpp-py
分别使用OpenCV、ONNXRuntime部署YOLOV7目标检测,一共包含12个onnx模型,依然是包含C++和Python两个版本的程序