Weoshin

Weoshin

Geek Repo

Location:Tianjin

Github PK Tool:Github PK Tool

Weoshin's starred repositories

MidJourney-Styles-and-Keywords-Reference

A reference containing Styles and Keywords that you can use with MidJourney AI. There are also pages showing resolution comparison, image weights, and much more!

Stargazers:11775Issues:0Issues:0

self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5902Issues:0Issues:0

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2654Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10311Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33606Issues:0Issues:0

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

License:Apache-2.0Stargazers:1125Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23371Issues:0Issues:0

ALMT

Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis

Language:PythonLicense:MITStargazers:49Issues:0Issues:0

opensmile

The Munich Open-Source Large-Scale Multimedia Feature Extractor

Language:C++License:NOASSERTIONStargazers:541Issues:0Issues:0

OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Language:MATLABLicense:NOASSERTIONStargazers:6714Issues:0Issues:0

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:2846Issues:0Issues:0
Language:PythonLicense:MITStargazers:35Issues:0Issues:0

SAEval-Benchmark

SAEval: A benchmark for sentiment analysis to evaluate the model's performance on various subtasks.

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8168Issues:0Issues:0

torchscale

Foundation Architecture for (M)LLMs

Language:PythonLicense:MITStargazers:2966Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19079Issues:0Issues:0

ch-sims-v2

Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module

Language:PythonStargazers:47Issues:0Issues:0

SPCL

code for "Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation, EMNLP 22"

Language:PythonStargazers:71Issues:0Issues:0

MMSA

MMSA is a unified framework for Multimodal Sentiment Analysis.

Language:PythonLicense:MITStargazers:608Issues:0Issues:0

MultiBench

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

Language:HTMLLicense:MITStargazers:455Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2151Issues:0Issues:0

DialogueCRN

Source code for ACL-IJCNLP 2021 paper "DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations"

Language:PythonLicense:MITStargazers:56Issues:0Issues:0

Time-Series-Library

A Library for Advanced Deep Time Series Models.

Language:PythonLicense:MITStargazers:4992Issues:0Issues:0

TimesNet

About Code release for "TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis" (ICLR 2023), https://openreview.net/pdf?id=ju_Uqw384Oq

License:MITStargazers:622Issues:0Issues:0

MM-DFN

Source code for ICASSP 2022 paper "MM-DFN: Multimodal Dynamic Fusion Network For Emotion Recognition in Conversations"

Language:PythonLicense:MITStargazers:68Issues:0Issues:0

NAACL-19-CIM

Multi-task Learning for Multi-modal Emotion Recognition and Sentiment Analysis

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

Multimodal-datasets

This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the information about recent multimodal datasets which are available for research purposes. We found that although 100+ multimodal language resources are available in literature for various NLP tasks, still publicly available multimodal datasets are under-explored for its re-usage in subsequent problem domains.

Stargazers:200Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:739Issues:0Issues:0
Language:PythonStargazers:160Issues:0Issues:0