johnwick123f

johnwick123f

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

johnwick123f's repositories

Project

Simple repository for personal project

Language:PythonLicense:CC0-1.0Stargazers:1Issues:0Issues:0

Grasp-Anything

Dataset and Code for "Grasp-Anything: Large-scale Grasp Dataset from Foundation Models."

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

piecewise-rectified-flow

perflow but library

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PersonalROS

Personal stuff for robots

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

sussy

Code for subgoal synthesis via image editing

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

graspnetAPI

Toolbox for our GraspNet-1Billion dataset.

Language:PythonStargazers:0Issues:0Issues:0

llama-cpp-python

Python bindings for llama.cpp

License:MITStargazers:0Issues:0Issues:0

Bunny

A family of lightweight multimodal models.

Language:PythonStargazers:0Issues:0Issues:0

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

groundingLMM

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Language:PythonStargazers:0Issues:0Issues:0

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

License:Apache-2.0Stargazers:0Issues:0Issues:0

tokenize-anything

Tokenize Anything via Prompting

License:Apache-2.0Stargazers:0Issues:0Issues:0

GLEE

GLEE: General Object Foundation Model for Images and Videos at Scale

License:MITStargazers:0Issues:0Issues:0

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

multi_token

Embed arbitrary modalities (images, audio, documents, etc) into large language models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (GGUF), Llama models.

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

License:Apache-2.0Stargazers:0Issues:0Issues:0

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

exllamav2-hf

Using exllama with hf

Language:PythonStargazers:0Issues:0Issues:0

exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LISAKaggle

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bitsandbytes

8-bit CUDA functions for PyTorch

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Video-ChatGPT

"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

PandaGPT2

PandaGPT: One Model To Instruction-Follow Them All

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0