cshnai's repositories
a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
CLIP
Contrastive Language-Image Pretraining
ColossalAI
Making large AI models cheaper, faster and more accessible
COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
dalle-mini
DALL·E Mini - Generate images from a text prompt
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
gpt4all
gpt4all: an ecosystem of open-source chatbots trained on a massive collections of clean assistant data including code, stories and dialogue
HierVL
[CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings
HIPT
Hierarchical Image Pyramid Transformer - CVPR 2022 (Oral)
KULLM
☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM
live-streaming-demo
Use D-ID's live streaming API to stream a talking presenter
LLaVA-Med
Large Language-and-Vision Assistant for BioMedicine, built towards multimodal GPT-4 level capabilities.
make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
pdffigures2
Given a scholarly PDF, extract figures, tables, captions, and section titles.
pubmed_parser
:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset
pytorch-zoom-in-network
Source code for "Efficient Classification of Very Large Images with Tiny Objects"
rltrader
파이썬과 케라스를 이용한 딥러닝/강화학습 주식투자 - 퀀트 투자, 알고리즘 트레이딩을 위한 최첨단 해법 입문 (개정판)
roop
one-click face swap
SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Self-Supervised-ViT-Path
Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology (LMRL Workshop, NeurIPS 2021)
stable-diffusion-webui
Stable Diffusion web UI
StableLM
StableLM: Stability AI Language Models
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
virtex
[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations
zoommil
ZoomMIL is a multiple instance learning (MIL) method that learns to perform multi-level zooming for efficient Whole-Slide Image (WSI) classification.