Joez (joez17)

joez17

Geek Repo

Location:Beijing

Github PK Tool:Github PK Tool

Joez's starred repositories

Event-Bench

Official code of *Towards Event-oriented Long Video Understanding*

Language:PythonStargazers:3Issues:0Issues:0

VideoNIAH

VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

Language:PythonStargazers:18Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:NOASSERTIONStargazers:27860Issues:0Issues:0

SC-Tune

Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

Awesome-Mamba-Papers

Awesome Papers related to Mamba.

Stargazers:980Issues:0Issues:0

IVG

This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions", which is accepted by ACL 2024 (Findings).

License:Apache-2.0Stargazers:15Issues:0Issues:0

fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:467Issues:0Issues:0

TransBTS

This repo provides the official code for : 1) TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/abs/2103.04430) , accepted by MICCAI2021. 2) TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical Images(https://arxiv.org/abs/2201.12785).

Language:PythonLicense:Apache-2.0Stargazers:376Issues:0Issues:0

MRES

This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.

License:Apache-2.0Stargazers:59Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:129396Issues:0Issues:0

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25157Issues:0Issues:0

Heterformer

Heterformer: Transformer-based Deep Node Representation Learning on Heterogeneous Text-Rich Networks (KDD 2023)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18Issues:0Issues:0

Edgeformers

Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:53Issues:0Issues:0

Awesome-Language-Model-on-Graphs

A curated list of papers and resources based on "Large Language Models on Graphs: A Comprehensive Survey".

License:MITStargazers:632Issues:0Issues:0

OPT_Questioner

Official PyTorch implementation of the paper "Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner"

Language:PythonLicense:MITStargazers:14Issues:0Issues:0

COSA

Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

Language:PythonLicense:MITStargazers:37Issues:0Issues:0

VAST

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Language:Jupyter NotebookLicense:MITStargazers:219Issues:0Issues:0

ChatBridge

ChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without relying on all combinations of paired data.

Language:PythonLicense:BSD-3-ClauseStargazers:42Issues:0Issues:0

VALOR

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Language:PythonLicense:MITStargazers:248Issues:0Issues:0

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3571Issues:0Issues:0
Language:PythonStargazers:12Issues:0Issues:0
Language:PythonStargazers:23Issues:0Issues:0

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonLicense:Apache-2.0Stargazers:5033Issues:0Issues:0