Suanyang's starred repositories
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
yolov9-improve
Integration of many innovative for YOLOV9
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Video-LLaVA
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
baipiaoOCR
convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino
PaddleOCRModelConvert
Convert the model in PaddleOCR to ONNX format
PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.