Mingze Xu (xumingze0308)

xumingze0308

Geek Repo

Company:Cruise AI

Location:Bellevue, WA

Home Page:https://xumingze0308.github.io/

Github PK Tool:Github PK Tool

Mingze Xu's starred repositories

PLLaVA

Official repository for the paper PLLaVA

Language:PythonStargazers:488Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:34944Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11208Issues:0Issues:0

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

Stargazers:1093Issues:0Issues:0

cs231n

Shortest solutions for CS231n 2021-2024

Language:Jupyter NotebookStargazers:229Issues:0Issues:0

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:25074Issues:0Issues:0

ML-foundations

Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science

Language:Jupyter NotebookLicense:MITStargazers:3177Issues:0Issues:0

long-short-term-transformer

[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection

Language:PythonLicense:Apache-2.0Stargazers:125Issues:0Issues:0
License:Apache-2.0Stargazers:17Issues:0Issues:0

detrex

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:1920Issues:0Issues:0

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:6405Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23736Issues:0Issues:0

yoloair

🔥🔥🔥 专注于YOLOv5,YOLOv7、YOLOv8、YOLOv9改进模型,Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀

Language:PythonLicense:GPL-3.0Stargazers:2427Issues:0Issues:0

ConditionalDETR

This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org/abs/2108.06152)

Language:PythonLicense:Apache-2.0Stargazers:351Issues:0Issues:0

AdelaiDet

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Language:PythonLicense:NOASSERTIONStargazers:3345Issues:0Issues:0

TeSTra

Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"

Language:PythonLicense:Apache-2.0Stargazers:95Issues:0Issues:0

boxmot

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Language:PythonLicense:AGPL-3.0Stargazers:6403Issues:0Issues:0

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Language:PythonLicense:Apache-2.0Stargazers:9193Issues:0Issues:0

leetcode-linghu-templete

算法面试必备,推荐刷题网站www.lintcode.com。北大学霸的《LeetCode刷题模板》+V领取: jiuzhangfeifei

Stargazers:3179Issues:0Issues:0

NLP_ability

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Language:PythonStargazers:6495Issues:0Issues:0

ByteTrack

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Language:PythonLicense:MITStargazers:4498Issues:0Issues:0

FairMOT

[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking

Language:PythonLicense:MITStargazers:3959Issues:0Issues:0

Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation

Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation

Stargazers:406Issues:0Issues:0

temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Language:PythonLicense:MITStargazers:2038Issues:0Issues:0

LeetCode-Go

✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解

Language:GoLicense:MITStargazers:32462Issues:0Issues:0

torchvggish

Pytorch port of Google Research's VGGish model used for extracting audio features.

Language:PythonLicense:Apache-2.0Stargazers:368Issues:0Issues:0

COTR

Code release for "COTR: Correspondence Transformer for Matching Across Images"(ICCV 2021)

Language:PythonLicense:Apache-2.0Stargazers:453Issues:0Issues:0

Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Language:Jupyter NotebookLicense:MITStargazers:1723Issues:0Issues:0

Informer2020

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

Language:PythonLicense:Apache-2.0Stargazers:5152Issues:0Issues:0

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language:PythonLicense:MITStargazers:279Issues:0Issues:0