johnwick123f's repositories
Grasp-Anything
Dataset and Code for "Grasp-Anything: Large-scale Grasp Dataset from Foundation Models."
piecewise-rectified-flow
perflow but library
DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
PersonalROS
Personal stuff for robots
sussy
Code for subgoal synthesis via image editing
graspnetAPI
Toolbox for our GraspNet-1Billion dataset.
llama-cpp-python
Python bindings for llama.cpp
Bunny
A family of lightweight multimodal models.
LLaMA2-Accessory
An Open-source Toolkit for LLM Development
groundingLMM
Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
tokenize-anything
Tokenize Anything via Prompting
GLEE
GLEE: General Object Foundation Model for Images and Videos at Scale
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
multi_token
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (GGUF), Llama models.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
exllamav2-hf
Using exllama with hf
exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
LISAKaggle
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
bitsandbytes
8-bit CUDA functions for PyTorch
Video-ChatGPT
"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
PandaGPT2
PandaGPT: One Model To Instruction-Follow Them All