Beast code in Giters

Jason's Lab's repositories

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonApache-2.0000

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

000

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Apache-2.0000

CIF-HieraDist

[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation

Apache-2.0000

damaihelper

支持大麦网，淘票票、缤玩岛等多个平台，演唱会演出抢票脚本

AGPL-3.0000

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

MIT000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

MIT000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

NOASSERTION000

funNLP

NLP tips

000

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

BSD-3-Clause000

grok-1

Grok open release

Apache-2.0000

hello-algo

《Hello 算法》：动画图解、一键运行的数据结构与算法教程，支持 Java, C++, Python, Go, JS, TS, C#, Swift, Rust, Dart, Zig 等语言。

NOASSERTION000

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

MIT000

LaVIT

LaVIT: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

NOASSERTION000

llama

Inference code for LLaMA models

NOASSERTION000

Llama2-Chinese

Llama中文社区，最好的中文Llama大模型，完全开源可商用

000

LLaSM

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验，同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

Apache-2.0000

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Apache-2.0000

LLM-Conversation-Safety

[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey

000

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

MIT000

MetaGPT

🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo

MIT000

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Apache-2.0000

patchelf

A small utility to modify the dynamic linker and RPATH of ELF executables

GPL-3.0000

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Apache-2.0000

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

NOASSERTION000

RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

000

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Apache-2.0000

stable-diffusion

A latent text-to-image diffusion model

NOASSERTION000

stable-diffusion-webui

Stable Diffusion web UI

AGPL-3.0000

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks

000