2132660698's repositories
DB-GPT-Hub
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
Agent4Rec
The implementation of paper "On Generative Agents in Recommendation"
Awesome-AIGC-3D
A curated list of awesome AIGC 3D papers
Awesome-Domain-LLM
收集和梳理垂直领域的开源模型、数据集及评测基准。
Awesome-LLM4AD
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Awesome-Text2SQL
Curated tutorials and resources for Large Language Models, Text2SQL, and more.
CFGPT
Chinese Financial Assistant with Large Language Model
Chat-UniVi
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
ControlLLM
ControlLLM: Augment Language Models with Tools by Searching on Graphs
CoT-Igniting-Agent
This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
CoVLM
Official implementation for CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
Crossformer
Official implementation of our ICLR 2023 paper "Crossformer: Transformer Utilizing Cross-Dimension Dependency for Multivariate Time Series Forecasting"
FinRL
FinRL: Financial Reinforcement Learning. 🔥
GPT-4V_OCR
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
groundingLMM
Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
HumanTOMATO
[Arxiv-2023] 🍅HumanTOMATO: Text-aligned Whole-body Motion Generation
LanguageBind
Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
NeurIPS2023-One-Fits-All
The official code for "One Fits All: Power General Time Series Analysis by Pretrained LM (NeurIPS 2023 Spotlight)"
PicImageSearch
整合图片识别 API,用于以图搜源 / Aggregator for Reverse Image Search API
Qwen-Agent
Agent framework and applications built upon Qwen, featuring Code Interpreter and Chrome browser extension.
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
TransCore-M
Large Multimodal Model
Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
vimGPT
Browse the web with GPT-4V and Vimium