uobinxiao's starred repositories

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:4044Issues:0Issues:0

PMC-LLaMA

The official codes for "PMC-LLaMA: Towards Building Open-source Language Models for Medicine"

Language:PythonStargazers:559Issues:0Issues:0

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

Stargazers:1093Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5375Issues:0Issues:0
Language:PythonLicense:MITStargazers:165Issues:0Issues:0

llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

Stargazers:862Issues:0Issues:0

ERNIE-Layout-Pytorch

An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.

Language:PythonLicense:MITStargazers:95Issues:0Issues:0

LLM4IR-Survey

This is the repo for the survey of LLM4IR.

License:MITStargazers:373Issues:0Issues:0

CRATE

Code for CRATE (Coding RAte reduction TransformEr).

Language:PythonLicense:MITStargazers:1129Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5682Issues:0Issues:0

LLM-Planning-Papers

Must-read Papers on Large Language Model (LLM) Planning.

Stargazers:296Issues:0Issues:0

dataless-model-merging

Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)

Language:PythonLicense:Apache-2.0Stargazers:76Issues:0Issues:0

WikiTableSet

WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia

Language:PythonLicense:MITStargazers:20Issues:0Issues:0

self-adaptive-ICL

self-adaptive in-context learning

Language:PythonStargazers:41Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:363Issues:0Issues:0

Awesome-Code-LLM

A curated list of language modeling researches for code and related datasets.

Stargazers:1107Issues:0Issues:0

segment-anything-fast

A batched offline inference oriented version of segment-anything

Language:PythonLicense:Apache-2.0Stargazers:1150Issues:0Issues:0

streamdocs

Documentation, notes, links, etc for streams.

Stargazers:70Issues:0Issues:0

FreeSOLO

FreeSOLO for unsupervised instance segmentation, CVPR 2022

Language:PythonLicense:NOASSERTIONStargazers:312Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20259Issues:0Issues:0
Language:PythonLicense:MITStargazers:12Issues:0Issues:0

IENet

Codes for Learning an Invariant and Equivariant Network for Weakly Supervised Object Detection

Language:PythonStargazers:17Issues:0Issues:0

bevfusion

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Language:PythonLicense:Apache-2.0Stargazers:2160Issues:0Issues:0

Awesome-Weakly-Supervised-Semantic-Segmentation-Papers

Recent weakly supervised semantic segmentation paper

Stargazers:226Issues:0Issues:0

GPT-Fathom

GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under aligned settings.

Language:PythonLicense:MITStargazers:345Issues:0Issues:0

ScienceQA

Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

Language:PythonLicense:MITStargazers:574Issues:0Issues:0

vpt

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

Language:PythonLicense:NOASSERTIONStargazers:966Issues:0Issues:0

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonLicense:MITStargazers:2367Issues:0Issues:0

P2T

[TPAMI22] Pyramid Pooling Transformer for Scene Understanding

Language:PythonStargazers:191Issues:0Issues:0

VPGTrans

Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.

Language:PythonLicense:BSD-3-ClauseStargazers:264Issues:0Issues:0