Weoshin

Weoshin's starred repositories

MidJourney-Styles-and-Keywords-Reference

A reference containing Styles and Keywords that you can use with MidJourney AI. There are also pages showing resolution comparison, image weights, and much more!

1177500

self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型，更适合**宝宝的部署教程

Language:Jupyter NotebookApache-2.0590200

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonApache-2.0265400

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

1031100

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.03360600

LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

Apache-2.0112500

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT2337100

Leveraging-Intra-and-Inter-Modality-Relationship-for-Multimodal-Fake-News-Detection

Language:Jupyter Notebook1000

ALMT

Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis

Language:PythonMIT4900

opensmile

The Munich Open-Source Large-Scale Multimedia Feature Extractor

Language:C++NOASSERTION54100

OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Language:MATLABNOASSERTION671400

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonMIT284600

SAEval-Benchmark

SAEval: A benchmark for sentiment analysis to evaluate the model's performance on various subtasks.

Language:PythonMIT800

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0816800

torchscale

Foundation Architecture for (M)LLMs

Language:PythonMIT296600

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT1907900

ch-sims-v2

Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module

Language:Python4700

SPCL

code for "Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation, EMNLP 22"

Language:Python7100

MMSA

MMSA is a unified framework for Multimodal Sentiment Analysis.

Language:PythonMIT60800

MultiBench

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

Language:HTMLMIT45500

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonApache-2.0215100

DialogueCRN

Source code for ACL-IJCNLP 2021 paper "DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations"

Language:PythonMIT5600

Time-Series-Library

A Library for Advanced Deep Time Series Models.

Language:PythonMIT499200

TimesNet

About Code release for "TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis" (ICLR 2023), https://openreview.net/pdf?id=ju_Uqw384Oq

MIT62200

MM-DFN

Source code for ICASSP 2022 paper "MM-DFN: Multimodal Dynamic Fusion Network For Emotion Recognition in Conversations"

Language:PythonMIT6800

NAACL-19-CIM

Multi-task Learning for Multi-modal Emotion Recognition and Sentiment Analysis

Language:PythonMIT1300

This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the information about recent multimodal datasets which are available for research purposes. We found that although 100+ multimodal language resources are available in literature for various NLP tasks, still publicly available multimodal datasets are under-explored for its re-usage in subsequent problem domains.

20000

learned_optimization

Language:PythonApache-2.073900

UniMSE

Language:Python16000