uobinxiao's starred repositories
VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
LLM4IR-Survey
This is the repo for the survey of LLM4IR.
LLM-Planning-Papers
Must-read Papers on Large Language Model (LLM) Planning.
dataless-model-merging
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
WikiTableSet
WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia
self-adaptive-ICL
self-adaptive in-context learning
Awesome-Code-LLM
A curated list of language modeling researches for code and related datasets.
segment-anything-fast
A batched offline inference oriented version of segment-anything
streamdocs
Documentation, notes, links, etc for streams.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Awesome-Weakly-Supervised-Semantic-Segmentation-Papers
Recent weakly supervised semantic segmentation paper
GPT-Fathom
GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under aligned settings.
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"