Yicong (yl3800)

yl3800

Geek Repo

Company:National University of Singapore

Location:Singapore

Github PK Tool:Github PK Tool

Yicong's starred repositories

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

License:MITStargazers:12414Issues:0Issues:0

awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

License:MITStargazers:4760Issues:0Issues:0

Awesome-AIGC-3D

A curated list of awesome AIGC 3D papers

License:MITStargazers:407Issues:0Issues:0

awesome-3d-diffusion

A collection of papers on diffusion models for 3D generation.

License:MITStargazers:584Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

Stargazers:9841Issues:0Issues:0

awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

Language:MarkdownLicense:CC0-1.0Stargazers:119Issues:0Issues:0

awesome-3D-generation

A curated list of awesome 3d generation papers

Stargazers:980Issues:0Issues:0

Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

Stargazers:786Issues:0Issues:0

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonLicense:NOASSERTIONStargazers:482Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:15334Issues:0Issues:0

Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

License:MITStargazers:752Issues:0Issues:0

einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Language:PythonLicense:MITStargazers:8034Issues:0Issues:0

PointMetaBase

This is a PyTorch implementation of PointMetaBase proposed by our paper "Meta Architecure for Point Cloud Analysis"

Language:PythonLicense:MITStargazers:84Issues:0Issues:0

Point-Transformers

Point Transformers

Language:PythonLicense:MITStargazers:596Issues:0Issues:0

GenPromp

[ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization

Language:PythonLicense:Apache-2.0Stargazers:53Issues:0Issues:0

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonLicense:MITStargazers:6186Issues:0Issues:0

Open3D

Open3D: A Modern Library for 3D Data Processing

Language:C++License:NOASSERTIONStargazers:10661Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29006Issues:0Issues:0

MultiModal_BigModels_Survey

[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models

Stargazers:255Issues:0Issues:0

VGT

Video Graph Transformer for Video Question Answering (ECCV'22)

Language:PythonLicense:Apache-2.0Stargazers:43Issues:0Issues:0

CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

Stargazers:16638Issues:0Issues:0

ChatReviewer

ChatReviewer: 使用ChatGPT分析论文优缺点,提出改进建议

Language:PythonLicense:NOASSERTIONStargazers:1231Issues:0Issues:0

ai-edu

AI education materials for Chinese students, teachers and IT professionals.

Language:HTMLLicense:NOASSERTIONStargazers:13268Issues:0Issues:0

uvadlc_notebooks

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Language:Jupyter NotebookLicense:MITStargazers:2219Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33168Issues:0Issues:0

InternVideo

Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1029Issues:0Issues:0
Stargazers:71Issues:0Issues:0

In-the-wild-QA

In-the-wild Question Answering

Language:PythonStargazers:14Issues:0Issues:0

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:30288Issues:0Issues:0

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:24626Issues:0Issues:0