Linxi (linxid)

linxid

Geek Repo

Company:alibaba

Location:hangzhou

Home Page:blog.csdn.net/linxid

Github PK Tool:Github PK Tool

Linxi's starred repositories

stable-diffusion-tutorial

全网最全Stable Diffusion全套教程,从入门到进阶,耗时三个月制作

Stargazers:1056Issues:0Issues:0

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:3236Issues:0Issues:0

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:191Issues:0Issues:0

DB-GPT-Web

DB-GPT WebUI,LLM to vision.

Language:TypeScriptStargazers:175Issues:0Issues:0

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:11312Issues:0Issues:0

chatllama

ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT

Language:PythonStargazers:1196Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3018Issues:0Issues:0

LLaVA-HR

LLaVA-HR: High-Resolution Large Language-Vision Assistant

Language:PythonLicense:Apache-2.0Stargazers:170Issues:0Issues:0

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:12852Issues:0Issues:0

Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:654Issues:0Issues:0

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1014Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:30541Issues:0Issues:0

How-to-use-Transformers

Transformers 库快速入门教程

Language:PythonLicense:Apache-2.0Stargazers:718Issues:0Issues:0

HPT

HPT - Open Multimodal LLMs from HyperGAI

Language:PythonLicense:Apache-2.0Stargazers:288Issues:0Issues:0

MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Language:PythonLicense:Apache-2.0Stargazers:818Issues:0Issues:0

DocBank

DocBank: A Benchmark Dataset for Document Layout Analysis

Language:PythonLicense:Apache-2.0Stargazers:529Issues:0Issues:0

MyArxiv

Arxiv个性化定制化模版,实现对特定领域的相关内容、作者与学术会议的有效跟进。

Language:CSSLicense:GPL-2.0Stargazers:208Issues:0Issues:0
Language:Jupyter NotebookStargazers:7Issues:0Issues:0

stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Language:PythonLicense:Apache-2.0Stargazers:4265Issues:0Issues:0

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:PythonLicense:MITStargazers:51996Issues:0Issues:0

SeeClick

The model, data and code for the visual GUI Agent SeeClick

Language:HTMLLicense:Apache-2.0Stargazers:115Issues:0Issues:0
Stargazers:1Issues:0Issues:0

FollowYourClick

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

Stargazers:744Issues:0Issues:0

cv-arxiv-daily

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Language:PythonLicense:Apache-2.0Stargazers:762Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:19Issues:0Issues:0

Image-Caption-Quality-Dataset

A dataset of crowdsourced ratings for machine-generated image captions

License:NOASSERTIONStargazers:30Issues:0Issues:0
Language:PythonStargazers:149Issues:0Issues:0

MobileAgent

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

Language:PythonLicense:MITStargazers:1894Issues:0Issues:0

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonLicense:MITStargazers:4415Issues:0Issues:0