shimin (Young1993)

Young1993

Geek Repo

Company:Institute of Computing Innovation, Zhejiang University; Master of The Hong Kong Polytechnic University

Location:Hangzhou

Github PK Tool:Github PK Tool

shimin's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:NOASSERTIONStargazers:167989Issues:1549Issues:2773

gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:37041Issues:431Issues:1641

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36842Issues:350Issues:1824

pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:28304Issues:250Issues:7113

flux

Official inference repo for FLUX.1 models

Language:PythonLicense:Apache-2.0Stargazers:15499Issues:136Issues:147

FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:13934Issues:256Issues:105

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language:PythonLicense:MITStargazers:12555Issues:74Issues:270

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:10018Issues:84Issues:248

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9849Issues:99Issues:663

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Language:PythonLicense:Apache-2.0Stargazers:6905Issues:73Issues:123

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:5860Issues:57Issues:593

ViT-pytorch

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Language:Jupyter NotebookLicense:MITStargazers:1929Issues:13Issues:55

mteb

MTEB: Massive Text Embedding Benchmark

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1920Issues:15Issues:444

instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language:PythonLicense:Apache-2.0Stargazers:1858Issues:18Issues:110

Restormer

[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Language:PythonLicense:NOASSERTIONStargazers:1773Issues:18Issues:101

CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

Language:PythonLicense:Apache-2.0Stargazers:1718Issues:57Issues:63

Research

novel deep learning research works with PaddlePaddle

Language:PythonLicense:Apache-2.0Stargazers:1717Issues:48Issues:150

CrossWOZ

A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset

Language:PythonLicense:Apache-2.0Stargazers:641Issues:16Issues:32

CPT

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

picard

PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. PICARD is a ServiceNow Research project that was started at Element AI.

Language:HaskellLicense:Apache-2.0Stargazers:342Issues:11Issues:112

PLOME

Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021

Language:PythonLicense:Apache-2.0Stargazers:228Issues:3Issues:31

backpacks-flash-attn

The original Backpack Language Model implementation, a fork of FlashAttention

Language:PythonLicense:BSD-3-ClauseStargazers:63Issues:2Issues:4

g2g-transformer

Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”

Language:PythonLicense:GPL-2.0Stargazers:61Issues:7Issues:5

G2GTr

Pytorch implementation of Graph-to-Graph Transformer for Transition-based Dependency Parsing accepted to EMNLP 2020

Language:PythonLicense:GPL-2.0Stargazers:21Issues:2Issues:4

UGEN

Incorporating Instructional Prompts into A Unified Generative Framework for Joint Multiple Intent Detection and Slot Filling - Coling2022(Oral))

tlm

The public code of EMNLP2023 (main conference) paper "TLM: Token-Level Masking for Transformers"

Language:PythonLicense:MITStargazers:4Issues:1Issues:1