LuckyLiYcon

followers

following

stars

LuckyLiYcon's starred repositories

MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Language:PythonBSD-3-Clause47400

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonMIT1206900

Five-year-algorithm-interview-three-year-simulation

算法岗笔试面试大全，励志做算法届的《五年高考，三年模拟》！

IG-VLM

Language:PythonBSD-3-Clause9200

BertWithPretrained

An implementation of the BERT model and its related downstream tasks based on the PyTorch framework

Language:Python54600

Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonApache-2.072600

towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Language:PythonApache-2.0308900

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonMIT82100

DGL

[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval. Also, visualization and qb norm search for best performance will be updated ASAP.

Language:PythonNOASSERTION2200

T-MASS-text-video-retrieval

Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"

Language:Python3300

STTNS

Spatial-Temporal Transformer Networks for Traffic Flow Forecasting

Language:Python2700

awesome-video-text-retrieval

A curated list of deep learning resources for video-text retrieval.

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Language:PythonGPL-3.0476400

MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonMIT1513000

MuscleMap

Language:PythonApache-2.02500

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01827400

SimSwap

An arbitrary face-swapping framework on images and videos with one single trained model!

Language:PythonNOASSERTION434000

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.013638900

hello-algo

《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing

Language:JavaNOASSERTION8863100

DeepLearning-500-questions

深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06

Language:JavaScriptGPL-3.05384400

smplx

SMPL-X

Language:PythonNOASSERTION171600

Detailed-VideoAvatar

Reproduced Detailed-VideoAvatar project for 3D Human-Body Reconstruction

Language:C7400

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonMIT422400

Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonApache-2.072700

pytorch3d

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Language:PythonNOASSERTION852300

MASS

Language:MathematicaApache-2.056900

PRTR

(CVPR 2021) PRTR: Pose Recognition with Cascade Transformers

Language:Jupyter NotebookApache-2.014100

Awesome-Diffusion-Models-in-Medical-Imaging

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

MIT129000

Thermal-IM

Thermal Indoor Motion Dataset

BSD-3-Clause1000

ICON

[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals

Language:PythonNOASSERTION157300