Beast code in Giters

jason_li's starred repositories

stable-ts

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Language:PythonMIT145000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6613500

sd-webui-animatediff

AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI

Language:PythonNOASSERTION300400

barcode_detection_benchmark

Code for paper "New Benchmarks for Barcode Detection using both Synthetic and Real Data" https://link.springer.com/chapter/10.1007%2F978-3-030-57058-3_34

Language:PythonApache-2.07500

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookMIT9081300

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonApache-2.0187600

MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Language:PythonMIT256500

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonMIT424700

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.0689000

EditAnything

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Language:PythonApache-2.0325600

lora-scripts

LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Language:PythonAGPL-3.0423200

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.013809800

sd-webui-segment-anything

Segment Anything for Stable Diffusion WebUI

Language:Python334500

awesome-ai-painting

AI绘画资料合集（包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等） Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo

1105300

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:Python982600

awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

MIT5186300

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.01677700

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

1118700

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonNOASSERTION1566700

MOSS

An open-source tool-augmented conversational language model from Fudan University

Language:PythonApache-2.01190100

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonApache-2.01807300

ChatPLUG

A Chinese Open-Domain Dialogue System

Language:PythonApache-2.030800

BRIO

ACL 2022: BRIO: Bringing Order to Abstractive Summarization

Language:Python32700

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT2422500

ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Language:PythonGPL-3.0770200

TCL

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Language:PythonMIT25700

nerf

Code release for NeRF (Neural Radiance Fields)

Language:Jupyter NotebookMIT971500

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Language:PythonApache-2.0196600

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonNOASSERTION101700

liyaoyu2014