real-ljt

real-ljt

Geek Repo

Company:Southeast University

Location:Melbourne, Australia

Github PK Tool:Github PK Tool

real-ljt's starred repositories

OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Language:MATLABLicense:NOASSERTIONStargazers:6732Issues:0Issues:0

Exp-CLIP

[arXiv'24] Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

mmflow

OpenMMLab optical flow toolbox and benchmark

Language:PythonLicense:Apache-2.0Stargazers:926Issues:0Issues:0

Awesome_Multimodel_LLM

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.

Stargazers:213Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18075Issues:0Issues:0

EmoVIT

[CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning

Language:PythonStargazers:8Issues:0Issues:0

livekit

End-to-end stack for WebRTC. SFU media server and SDKs.

Language:GoLicense:Apache-2.0Stargazers:8986Issues:0Issues:0

HSTA_MER

Hierarchical Space-Time Attention for Micro-Expression Recognition in PyTorch

Language:PythonStargazers:6Issues:0Issues:0

SPIGA

SPIGA: Shape Preserving Facial Landmarks with Graph Attention Networks.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:273Issues:0Issues:0

OpenGraphAU

An tool for facial action unit analysis

Language:PythonLicense:Apache-2.0Stargazers:25Issues:0Issues:0

XX-Net

A proxy tool to bypass GFW.

Language:PythonStargazers:32870Issues:0Issues:0

diffused-heads

Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Language:PythonLicense:NOASSERTIONStargazers:447Issues:0Issues:0

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:30721Issues:0Issues:0

UCMT

[IJCAI 2023] Co-training with High-Confidence Pseudo Labels for Semi-supervised Medical Image Segmentation

Language:PythonStargazers:32Issues:0Issues:0

DiM-DiffusionMamba

The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Language:PythonStargazers:120Issues:0Issues:0

Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Language:PythonStargazers:12884Issues:0Issues:0

Xshell-ColorScheme

250+ Xshell Color Schemes

License:MITStargazers:646Issues:0Issues:0

TVT

Code of TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation, WACV 2023

Language:PythonLicense:MITStargazers:65Issues:0Issues:0

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonLicense:Apache-2.0Stargazers:1879Issues:0Issues:0

Awesome-CV

:page_facing_up: Awesome CV is LaTeX template for your outstanding job application

Language:TeXLicense:LPPL-1.3cStargazers:22284Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

pyCirclize

Circular visualization in Python (Circos Plot, Chord Diagram, Radar Chart)

Language:PythonLicense:MITStargazers:667Issues:0Issues:0

me_recognition

CapsuleNet for Micro-expression Recognition (IEEE FG 2019)

Language:PythonStargazers:99Issues:0Issues:0

awesome-resume-for-chinese

:page_facing_up: 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护

Stargazers:4157Issues:0Issues:0

EAMM

Code for paper 'EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model'

Language:PythonLicense:MITStargazers:183Issues:0Issues:0

BiGS

Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE benchmark with subquadratic complexity in length (or without attention).

Language:PythonLicense:Apache-2.0Stargazers:109Issues:0Issues:0

2d-gaussian-splatting

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Language:PythonLicense:NOASSERTIONStargazers:1625Issues:0Issues:0

EAT

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Language:PythonLicense:MITStargazers:87Issues:0Issues:0

mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Language:PythonLicense:Apache-2.0Stargazers:3711Issues:0Issues:0

ToMe

A method to increase the speed and lower the memory footprint of existing vision transformers.

Language:PythonLicense:NOASSERTIONStargazers:902Issues:0Issues:0