Mustard Bean's repositories

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ADer

ADer is an open source visual anomaly detection toolbox based on PyTorch, which supports multiple popular AD datasets and approaches.

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

Chat-UniVi

[CVPR 2024🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

License:Apache-2.0Stargazers:0Issues:0Issues:0

chatgpt_system_prompt

store all agent's system prompt

License:MITStargazers:0Issues:0Issues:0

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

License:MITStargazers:0Issues:0Issues:0

face_recognition

The world's simplest facial recognition api for Python and the command line

License:MITStargazers:0Issues:0Issues:0
License:GPL-3.0Stargazers:0Issues:0Issues:0

GPTs

leaked prompts of GPTs

Stargazers:0Issues:0Issues:0

HuggingFists

A low-code data flow tool that allows for convenient use of LLM and HuggingFace models, with some features considered as a low-code version of Langchain.

Stargazers:0Issues:0Issues:0
License:BSD-3-ClauseStargazers:0Issues:0Issues:0

insightface

State-of-the-art 2D and 3D Face Analysis Project

Stargazers:0Issues:0Issues:0

instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

License:Apache-2.0Stargazers:0Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源模型

License:MITStargazers:0Issues:0Issues:0

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

License:Apache-2.0Stargazers:0Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.0: An Efficient End-side MLLM with Strong OCR and Understanding Capabilities

License:Apache-2.0Stargazers:0Issues:0Issues:0

MiniGPT4Qwen

Personal Project: MPP-Qwen14B(Multimodal Pipeline Parallel-Qwen14B). Don't let the poverty limit your imagination! Train your own 14B LLaVA-like MLLM on RTX3090/4090 24GB.

Stargazers:0Issues:0Issues:0

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

OneLLM

OneLLM: One Framework to Align All Modalities with Language

License:NOASSERTIONStargazers:0Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

License:Apache-2.0Stargazers:0Issues:0Issues:0

prismatic-vlms

*****A flexible and efficient codebase for training visually-conditioned language models (VLMs)

License:MITStargazers:0Issues:0Issues:0

RWKV-Infer

A large-scale RWKV v6 inference wrapper using the Cuda backend. Easy to deploy on docker. Supports multi-batch generation and dynamic State switching. Let's spread RWKV, which combines RNN technology with impressively low inference costs!

License:Apache-2.0Stargazers:0Issues:0Issues:0

Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

TikTokDownload

抖音去水印批量下载用户主页作品、喜欢、收藏、图文、音频

License:MITStargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Valley

The official repository of "Video assistant towards large language model makes everything easy"

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

Youku-mPLUG

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

License:Apache-2.0Stargazers:0Issues:0Issues:0