Shrek Wang's starred repositories
youtube-dl
Command-line program to download videos from YouTube.com and other video sites
FoodRecNet
FoodRecNet: A comprehensively personalized Food Recommender system using deep neural Networks
video-pretrained-transformer
Multi-model video-to-text by combining embeddings from Flan-T5 + CLIP + Whisper + SceneGraph. The 'backbone LLM' is pre-trained from scratch on YouTube (YT-1B dataset).
VidChapters
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
YouCook2-Leaderboard
A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.
coursera-deep-learning-specialization
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv) Convolutional Neural Networks; (v) Sequence Models
Video-Captioning-Using-LSTM-and-Keras
Generating Video Caption Using LSTM
UICKeyChainStore
UICKeyChainStore is a simple wrapper for Keychain on iOS, watchOS, tvOS and macOS. Makes using Keychain APIs as easy as NSUserDefaults.
tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
WhisperKit
On-device Speech Recognition for Apple Silicon
open-im-server
IM Chat
awesome-go
A curated list of awesome Go frameworks, libraries and software
awesome-ai-agents
A list of AI autonomous agents
stable-diffusion-webui
Stable Diffusion web UI
HWThrottle
A lite Objective-C library for throttle and debounce, supporting leading and trailing. 节流/限流/防反跳/防重复点击/防重复调用
RealChar
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖