蓋瑞王's repositories

anything-llm

A multi-user ChatGPT for any LLMs and vector database. Unlimited documents, messages, and storage in one privacy-focused app. Now available as a desktop application!

Language:JavaScriptLicense:MITStargazers:1Issues:1Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

gpu-jupyter

Leverage the flexibility of Jupyterlab through the power of your NVIDIA GPU to run your code from Tensorflow and Pytorch in collaborative notebooks on the GPU.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

AutoDetect

Official github repo for AutoDetect, an automated weakness detection framework for LLMs.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CoDeF

Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

delta-iris

Efficient World Models with Context-Aware Tokenization. ICML 2024

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

dice

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Stargazers:0Issues:0Issues:0

diffusion-forcing-transformer

Transformer implementation for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Stargazers:0Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

EvTexture

[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

lancedb

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

make-it-count

Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"

Language:PythonStargazers:0Issues:0Issues:0

MM-NIAH

This is the official implementation of the paper "Needle In A Multimodal Haystack"

Language:PythonStargazers:0Issues:0Issues:0

MotionBooth

The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"

Stargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

planetarium

Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

RVT

Official Code for RVT: Robotic View Transformer for 3D Object Manipulation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

License:NOASSERTIONStargazers:0Issues:0Issues:0

Taiwan-LLM

Traditional Mandarin LLMs for Taiwan

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TalkTuner-chatbot-llm-dashboard

Designing a Dashboard for Transparency and Control of Conversational AI, https://arxiv.org/abs/2406.07882

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

xland-minigrid-datasets

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0