junedgar

junedgar

Geek Repo

Company:ustc

Location:hangzhou

Github PK Tool:Github PK Tool

junedgar's starred repositories

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:PythonStargazers:1609Issues:0Issues:0

unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Language:PythonLicense:AGPL-3.0Stargazers:1317Issues:0Issues:0
Stargazers:1635Issues:0Issues:0

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonLicense:Apache-2.0Stargazers:7372Issues:0Issues:0

MQ-Det

Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)

Language:PythonLicense:Apache-2.0Stargazers:238Issues:0Issues:0

Awesome-Video-Diffusion-Models

[Arxiv] A Survey on Video Diffusion Models

Stargazers:1431Issues:0Issues:0

video-generation-survey

A reading list of video generation

Stargazers:285Issues:0Issues:0

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:5556Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:60103Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:1213Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8852Issues:0Issues:0

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookLicense:MITStargazers:2208Issues:0Issues:0

aesthetic-predictor

A linear estimator on top of clip to predict the aesthetic quality of pictures

Language:Jupyter NotebookLicense:MITStargazers:398Issues:0Issues:0
Language:PythonLicense:MITStargazers:92Issues:0Issues:0

Prompt-Can-Anything

You can do anything by sota AI with prompt ,auto AI tools , VL larger model fine and project

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:175Issues:0Issues:0

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:2777Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23280Issues:0Issues:0

RobustVideoMatting

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Language:PythonLicense:GPL-3.0Stargazers:8278Issues:0Issues:0

OpenChineseLLaMA

Chinese large language model base generated through incremental pre-training on Chinese datasets

Language:PythonLicense:GPL-3.0Stargazers:231Issues:0Issues:0

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:7987Issues:0Issues:0

InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language:PythonLicense:Apache-2.0Stargazers:3150Issues:0Issues:0

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:35287Issues:0Issues:0

Make-A-Protagonist

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

Language:PythonLicense:Apache-2.0Stargazers:314Issues:0Issues:0

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:4009Issues:0Issues:0

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17724Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonLicense:MITStargazers:86847Issues:0Issues:0

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25057Issues:0Issues:0

gpt4all

gpt4all: run open-source LLMs anywhere

Language:C++License:MITStargazers:65715Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17347Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7650Issues:0Issues:0