mst272

mst272

Geek Repo

Company:Southwest Jiaotong University

Github PK Tool:Github PK Tool

mst272's starred repositories

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:28539Issues:301Issues:92

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:12152Issues:99Issues:534

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:9402Issues:104Issues:161

segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Language:PythonLicense:MITStargazers:9129Issues:79Issues:620

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8231Issues:72Issues:409

CodeGeeX

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

Language:PythonLicense:Apache-2.0Stargazers:8158Issues:87Issues:217

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6970Issues:74Issues:205

pycorrector

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:5517Issues:84Issues:470

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4483Issues:47Issues:193

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2566Issues:24Issues:27

baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Language:PythonLicense:MITStargazers:2475Issues:17Issues:75

CodeXGLUE

CodeXGLUE

mamba.py

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Language:PythonLicense:MITStargazers:933Issues:7Issues:39

makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Language:Jupyter NotebookLicense:MITStargazers:580Issues:7Issues:3

Machine-Mindset

An MBTI Exploration of Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:456Issues:7Issues:2

Focal-Loss-Pytorch

全中文注释.(The loss function of retinanet based on pytorch).(You can use it on one-stage detection task or classifical task, to solve data imbalance influence).用于one-stage目标检测算法,提升检测效果.你也可以在分类任务中使用该损失函数,解决数据不平衡问题.

Language:Jupyter NotebookStargazers:453Issues:5Issues:19

NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Language:PythonLicense:MITStargazers:373Issues:11Issues:14

SenseEarth2020-ChangeDetection

1st place solution to the Satellite Remote Sensing Image Change Detection Challenge hosted by SenseTime

Language:PythonLicense:MITStargazers:283Issues:4Issues:28

dora

Implementation of DoRA

Language:PythonLicense:MITStargazers:278Issues:10Issues:2

core-pytorch-utils

Yet another PyTorch Trainer and some core components for deep learning.

Language:PythonLicense:MITStargazers:202Issues:4Issues:4

grouped-query-attention-pytorch

(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)

Language:PythonLicense:MITStargazers:120Issues:3Issues:4

xCodeEval

xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval

Language:PythonLicense:MITStargazers:73Issues:4Issues:8

codefuse-evaluation

Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中

Language:PythonLicense:NOASSERTIONStargazers:72Issues:3Issues:4

Semantic-Change-Detection

This repository is for semantic change detection.

dfc2021-msd-baseline

A baseline for the "Multitemporal Semantic Change Detection" track of the 2021 IEEE GRSS Data Fusion Competition.

Language:Jupyter NotebookStargazers:39Issues:3Issues:3

ClearSCD

A remote sensing semantic change detection model

baby-llama2-chinese_fix

使用单个24G显卡,从0开始训练LLM

Language:PythonLicense:MITStargazers:31Issues:1Issues:0

transformer-pytorch

A pytorch Implementation of the Transformer: Attention Is All You Need

Language:PythonStargazers:9Issues:2Issues:0

KoNEFTune

Random Noisy Embeddings with fine-tuning 방법론을 한국어 LLM에 간단히 적용할 수 있는 Kosy🍵llama

Language:PythonLicense:MITStargazers:8Issues:2Issues:0