An Yan's repositories
MM-Navigator
GPT-4V in Wonderland: LMMs as Smartphone Agents
Awesome-CLIP
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
awesome-radiology-report-generation
A curated list of radiology report generation (medical report generation) and related areas. :-)
clinicalBERT
repository for Publicly Available Clinical BERT Embeddings
constrained_decoding
Lexically constrained decoding for sequence generation using Grid Beam Search
first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
fluent-python
《流畅的Python》2015年8月
gancaption_iccv2017
Towards Diverse and Natural Image Descriptions via a Conditional GAN
hello-world
Just a repository
Im2Text
Im2Text extension to OpenNMT
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
pytorch-sgns
Skipgram Negative Sampling in PyTorch
Realtime_Multi-Person_Pose_Estimation
Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)
SoM
Set-of-Mark Prompting for LMMs
state-spaces
Sequence Modeling with Structured State Spaces
zzxslp.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes