Blakey Wu's repositories
wikiscenes
Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.
Front-end-Homework
front-end final work by cocosStudio
1-stage-wseg
Single-Stage Semantic Segmentation from Image Labels (CVPR 2020)
3D-mesh-renderer
Mesh renderer implemented from scratch
ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
DataBase2020
清华大学数据库原理课程大作业(框架为4.23助教更新后版本)开发者:武笑石、黎思宇、陈语凝
ftp-server-client
A ftp server and a client
magic-ruler-simulator
A graphical simulator for magic ruler. You can create, operate, save, load your magic ruler, and watch it from different views.
ray-tracer
Ray tracing implemented from scratch.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
EuclideanMST
Implementations of different algorithms for building Euclidean minimum spanning tree in k-dimensional space.
ICCV2023-Diffusion-Papers
ICCV2023-Diffusion-Papers
Introduction-to-Artificial-Intelligence
银行精准营销解决方案+青蛙叫声聚类分析
mmdetection
OpenMMLab Detection Toolbox and Benchmark
RegionCLIP
[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
THUSS-datalab-solution
csapp homework solution
ViViT-pytorch
Implementation of ViViT: A Video Vision Transformer