墨问's starred repositories
Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
PathFinding.js
A comprehensive path-finding library for grid based games
AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
ollama-python
Ollama Python library
speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
KalmanFilter
This is a Kalman filter used to calculate the angle, rate and bias from from the input of an accelerometer/magnetometer and a gyroscope.
InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
DQN_play_sekiro
DQN_play_sekiro
LLM101n-CN
LLM101n: Let's build a Storyteller 中文版
Flash-VStream
This is the official implementation of "Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams"