Aniki (aniki-ly)

aniki-ly

Geek Repo

Company:University of Technology Sydney

Location:Sydney

Home Page:yulu.net.cn

Github PK Tool:Github PK Tool

Aniki's repositories

FlowZero

FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax

Language:JavaScriptStargazers:1Issues:2Issues:0

CRIS.pytorch

An official PyTorch implementation of the CRIS paper

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

Awesome-Cross-Modal-Video-Moment-Retrieval

前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。

Stargazers:0Issues:0Issues:0

awesome-language-model-with-vision

Related about vision and language models

Stargazers:0Issues:1Issues:0

Awesome-Segment-Anything

Collect some resource about Segment Anything (SAM), including the latest papers and demo

Stargazers:0Issues:0Issues:0

awesome-source-free-test-time-adaptation

[2022] A curated list of papers in Test-time Adaptation, Test-time Training and Source-free Domain Adaptation

Stargazers:0Issues:0Issues:0

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Stargazers:0Issues:0Issues:0

Awesome-Video-Diffusion-Models

[Arxiv] A Survey on Video Diffusion Models

Stargazers:0Issues:0Issues:0

datacomp

DataComp: In search of the next generation of multimodal datasets

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LayoutGPT

Official repo for LayoutGPT

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MaskCLIP

Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

VideoX

VideoX: a collection of video cross-modal models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Gen-L-Video

The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

Stargazers:0Issues:0Issues:0

MedSegDiff

Official implementation of paper "MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model"

Language:PythonStargazers:0Issues:0Issues:0

RPG-DiffusionMaster

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Stargazers:0Issues:0Issues:0

SciencePlots

Matplotlib styles for scientific plotting

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

visual-chatgpt

Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0