SherryShall's starred repositories

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonLicense:MITStargazers:147726Issues:1566Issues:1938

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:136386Issues:1050Issues:7544

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:55829Issues:537Issues:2889

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45686Issues:303Issues:658

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40075Issues:391Issues:1290

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33941Issues:341Issues:2649

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:29142Issues:215Issues:530

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24190Issues:193Issues:3809

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonLicense:GPL-3.0Stargazers:16536Issues:150Issues:1467

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15115Issues:104Issues:968

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonLicense:NOASSERTIONStargazers:13743Issues:202Issues:2291

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:13152Issues:149Issues:526

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9268Issues:76Issues:454

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6820Issues:59Issues:137

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:5670Issues:64Issues:623

hnswlib

Header-only C++/python library for fast approximate nearest neighbors

Language:C++License:Apache-2.0Stargazers:4166Issues:64Issues:355

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:PythonLicense:MITStargazers:4018Issues:35Issues:299

ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Language:PythonLicense:Apache-2.0Stargazers:3637Issues:32Issues:374

DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Language:PythonLicense:MITStargazers:3237Issues:42Issues:522

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonLicense:Apache-2.0Stargazers:3155Issues:39Issues:243

BERT-NER-Pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Language:PythonLicense:MITStargazers:2035Issues:13Issues:104

SynthText

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Language:PythonLicense:Apache-2.0Stargazers:1994Issues:57Issues:273

ALBEF

Code for ALBEF: a new vision-language pre-training method

Language:PythonLicense:BSD-3-ClauseStargazers:1460Issues:11Issues:139

text_renderer

Generate text images for training deep learning ocr model

Language:PythonLicense:MITStargazers:1358Issues:43Issues:104

ONE-PEACE

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Language:PythonLicense:Apache-2.0Stargazers:898Issues:12Issues:51

DeepSolo

The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting"

Language:PythonLicense:NOASSERTIONStargazers:240Issues:7Issues:64

TCM

Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:163Issues:13Issues:18

ViTAE-Transformer-Scene-Text-Detection

A comprehensive list [I3CL@IJCV'22, DPText-DETR@AAAI'23, DeepSolo(++)@ CVPR'23] of our research works related to scene text detection and spotting, including papers, codes. Note: The official repo for "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped ..." has been moved to: https://github.com/ViTAE-Transformer/I3CL

Language:PythonLicense:Apache-2.0Stargazers:46Issues:3Issues:9

On-Mitigating-Hard-Clusters

Official implementation for "On Mitigating Hard Clusters for Face Clustering" (ECCV 2022 Oral).

Language:PythonLicense:MITStargazers:27Issues:2Issues:6