Tangyueming311

followers

following

stars

唐月明's starred repositories

Raspberry-PI-Smart-Home

利用python语言进行编程设计，结合树莓派和小型电动机等硬件，通过软件的设计与硬件的相互协调，保证系统的正常运行。本文主要研究如何利用树莓派进行智能家居控制的程序设计，利用Python语言进行程序编写和其他的几个硬件设备完成自动窗帘的控制和温湿度的自动调控，并在自动模式之外可以进行手动的控制。进行本次研究，可以探寻树莓派在智能家居控制系统的设计方面的应用；同时通过与其他硬件搭配进行的设计，对探索不需要大幅度更改原有家居设备的情况下进行智能家居控制的方法有着重要意义。

Language:C++500

acl-style-files

Official style files for papers submitted to venues of the Association for Computational Linguistics

Language:TeX62800

awesome-llm-understanding-mechanism

awesome papers in LLM interpretability

MAP

Language:Python2800

MAIA-Data-Processing

This repository has all the multimodal data processing scripts

Language:Python400

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT2966000

Neural-Network-Parameter-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Language:Python78100

MISA

MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

Language:PythonMIT17200

MMSA-FET

A Tool for extracting multimodal features from videos.

Language:PythonGPL-3.012100

MMSA

MMSA is a unified framework for Multimodal Sentiment Analysis.

Language:PythonMIT60500

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:Python162800

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonApache-2.0107900

UniSA

UniSA: Unified Generative Framework for Sentiment Analysis

Language:PythonMIT4100

how-to-train-tokenizer

怎么训练一个LLM分词器

Language:Python10800

clip_text_span

official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"

Language:Jupyter NotebookMIT12200

ALMT

Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis

Language:PythonMIT5000

ConFEDE

Language:PythonMIT3500

Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonApache-2.071100

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonMIT340300

ml-ferret

Language:PythonNOASSERTION816500

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT2325200

GroundingGPT

[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model

Language:PythonApache-2.025700

CyCLIP

Language:Python11000

ICLR-2024-OpenReview-Ratings

2600

CVPR-2023-Papers

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Awesome-LLM-KG

Awesome papers about unifying LLMs and KGs

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

MIT561500

top-class-resource-library

SCUT拔尖基地班线上资源库

200

Tech_Aarticle

主要是我是日常看过的不错的文章的资源汇总，方便自己也分享给大家。有些我看过的，就会做简单的解读，没看过的，就先罗列一下，然后之后看了把解读更新上；涉及到搜索/推荐/自然语言处理。