LuckyLiYcon's starred repositories

MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Language:PythonLicense:BSD-3-ClauseStargazers:474Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:12069Issues:0Issues:0

Five-year-algorithm-interview-three-year-simulation

算法岗笔试面试大全,励志做算法届的《五年高考,三年模拟》!

Stargazers:148Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:92Issues:0Issues:0

BertWithPretrained

An implementation of the BERT model and its related downstream tasks based on the PyTorch framework

Language:PythonStargazers:546Issues:0Issues:0

Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:726Issues:0Issues:0

towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Language:PythonLicense:Apache-2.0Stargazers:3089Issues:0Issues:0

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonLicense:MITStargazers:821Issues:0Issues:0

DGL

[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval. Also, visualization and qb norm search for best performance will be updated ASAP.

Language:PythonLicense:NOASSERTIONStargazers:22Issues:0Issues:0

T-MASS-text-video-retrieval

Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"

Language:PythonStargazers:33Issues:0Issues:0

STTNS

Spatial-Temporal Transformer Networks for Traffic Flow Forecasting

Language:PythonStargazers:27Issues:0Issues:0

awesome-video-text-retrieval

A curated list of deep learning resources for video-text retrieval.

Stargazers:566Issues:0Issues:0

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Language:PythonLicense:GPL-3.0Stargazers:4764Issues:0Issues:0

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:15130Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:25Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18274Issues:0Issues:0

SimSwap

An arbitrary face-swapping framework on images and videos with one single trained model!

Language:PythonLicense:NOASSERTIONStargazers:4340Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:136389Issues:0Issues:0

hello-algo

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Language:JavaLicense:NOASSERTIONStargazers:88631Issues:0Issues:0

DeepLearning-500-questions

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06

Language:JavaScriptLicense:GPL-3.0Stargazers:53844Issues:0Issues:0

smplx

SMPL-X

Language:PythonLicense:NOASSERTIONStargazers:1716Issues:0Issues:0

Detailed-VideoAvatar

Reproduced Detailed-VideoAvatar project for 3D Human-Body Reconstruction

Language:CStargazers:74Issues:0Issues:0

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4224Issues:0Issues:0

Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonLicense:Apache-2.0Stargazers:727Issues:0Issues:0

pytorch3d

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Language:PythonLicense:NOASSERTIONStargazers:8523Issues:0Issues:0
Language:MathematicaLicense:Apache-2.0Stargazers:569Issues:0Issues:0

PRTR

(CVPR 2021) PRTR: Pose Recognition with Cascade Transformers

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:141Issues:0Issues:0

Awesome-Diffusion-Models-in-Medical-Imaging

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

License:MITStargazers:1290Issues:0Issues:0

Thermal-IM

Thermal Indoor Motion Dataset

License:BSD-3-ClauseStargazers:10Issues:0Issues:0

ICON

[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals

Language:PythonLicense:NOASSERTIONStargazers:1573Issues:0Issues:0