stdKonjac

stdKonjac

Geek Repo

Company:Tsinghua University

Location:Shenzhen, Guangdong, China

Home Page:https://www.stdkonjac.icu/

Twitter:@stdKonjac

Github PK Tool:Github PK Tool

stdKonjac's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66123Issues:556Issues:697

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53828Issues:509Issues:923

ChatGPT

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

Language:RustLicense:AGPL-3.0Stargazers:51202Issues:426Issues:991

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:44924Issues:299Issues:646
Language:PythonLicense:NOASSERTIONStargazers:34529Issues:305Issues:350

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonLicense:NOASSERTIONStargazers:17839Issues:88Issues:214

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17349Issues:155Issues:1345

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9488Issues:63Issues:102

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9028Issues:95Issues:619

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8853Issues:76Issues:441
Language:PythonLicense:NOASSERTIONStargazers:6036Issues:69Issues:115

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:5572Issues:78Issues:141

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5485Issues:36Issues:870

SS-SSR-V2ray

机场推荐与机场评测ssr/v2ray2023

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonLicense:MITStargazers:3488Issues:100Issues:159

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonLicense:NOASSERTIONStargazers:2962Issues:44Issues:357

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1528Issues:21Issues:84

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

SEED

Official implementation of SEED-LLaMA (ICLR 2024).

Language:PythonLicense:NOASSERTIONStargazers:500Issues:14Issues:41

XPretrain

Multi-modality pre-training

Language:PythonLicense:NOASSERTIONStargazers:448Issues:14Issues:33

flip

Official Open Source code for "Scaling Language-Image Pre-training via Masking"

Language:PythonLicense:NOASSERTIONStargazers:383Issues:8Issues:2

model-soups

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Language:PythonLicense:MITStargazers:281Issues:5Issues:0

SEED-Bench

(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.

Language:PythonLicense:NOASSERTIONStargazers:261Issues:4Issues:26

unmasked_teacher

[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Language:PythonLicense:MITStargazers:257Issues:14Issues:39

ICCV23-IDPT

The code for the paper "Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models" (ICCV'23).

HBI

[CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

Language:PythonLicense:Apache-2.0Stargazers:95Issues:4Issues:7
Language:PythonLicense:MITStargazers:93Issues:4Issues:11

STAN

Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"

Language:PythonLicense:Apache-2.0Stargazers:86Issues:5Issues:18

Recformer

Codebase for KDD 2023 paper, Text Is All You Need: Learning Language Representations for Sequential Recommendation