whut265107

whut265107

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

whut265107's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23880Issues:218Issues:3669

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18428Issues:158Issues:1418

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonLicense:Apache-2.0Stargazers:9121Issues:87Issues:711

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8818Issues:82Issues:36

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7422Issues:110Issues:150

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

annotated-transformer

An annotated implementation of the Transformer paper.

Language:Jupyter NotebookLicense:MITStargazers:5431Issues:64Issues:85

segment-geospatial

A Python package for segmenting geospatial data with the Segment Anything Model (SAM)

Language:PythonLicense:MITStargazers:2787Issues:53Issues:126

pytorch-toolbelt

PyTorch extensions for fast R&D prototyping and Kaggle farming

Language:PythonLicense:MITStargazers:1504Issues:26Issues:33

GlobalMLBuildingFootprints

Worldwide building footprints derived from satellite imagery

Language:PythonLicense:NOASSERTIONStargazers:1336Issues:63Issues:83

Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonLicense:Apache-2.0Stargazers:732Issues:14Issues:36

HuggingFace-Download-Accelerator

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

train-CLIP

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

Language:PythonLicense:MITStargazers:644Issues:16Issues:37

GeoSeg

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.

Language:PythonLicense:GPL-3.0Stargazers:618Issues:14Issues:0

TransNeXt

[CVPR 2024] Code release for TransNeXt model

Language:PythonLicense:Apache-2.0Stargazers:309Issues:5Issues:16

fc-clip

[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Language:PythonLicense:Apache-2.0Stargazers:267Issues:16Issues:33
Language:PythonLicense:NOASSERTIONStargazers:192Issues:4Issues:6

ViT-CoMer

Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.

Language:PythonLicense:Apache-2.0Stargazers:177Issues:3Issues:20

PixelLM

PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding. PixelLM is accepted by CVPR 2024.

Language:PythonLicense:Apache-2.0Stargazers:157Issues:5Issues:19

MTP

The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"

Language:PythonLicense:MITStargazers:131Issues:3Issues:20
Language:PythonLicense:Apache-2.0Stargazers:97Issues:2Issues:9

RSBuilding

This is the pytorch implement of our paper "RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model"

Language:PythonLicense:Apache-2.0Stargazers:89Issues:3Issues:10

GABLE

A first Fine-grained 3D Building Model of China on a National Scale from Very High Resolution Satellite Imagery

Language:PythonLicense:MITStargazers:87Issues:4Issues:7

H2RSVLM

H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model

rschange

Change detection of remote sensing images

Language:PythonStargazers:41Issues:0Issues:0

LaSagnA

Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".

Language:PythonLicense:Apache-2.0Stargazers:38Issues:2Issues:3

LeMeViT

The official repo for [IJCAI'24] "LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation"

Language:PythonStargazers:38Issues:3Issues:0

muno21

A map update dataset and benchmark

Language:Jupyter NotebookStargazers:20Issues:0Issues:1

SecViT

official code for "Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer"

Language:PythonLicense:MITStargazers:8Issues:0Issues:0