Fengxi ZHANG (XCJinggai)

XCJinggai

Geek Repo

Company:Shanghai Jiaotong University

Location:Shanghai

Github PK Tool:Github PK Tool

Fengxi ZHANG's starred repositories

Language:PythonLicense:MITStargazers:136Issues:0Issues:0

visqol

Perceptual Quality Estimator for speech and audio

Language:C++License:Apache-2.0Stargazers:653Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8893Issues:0Issues:0

FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Language:PythonLicense:MITStargazers:323Issues:0Issues:0

USLM

Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)

Language:PythonStargazers:124Issues:0Issues:0

1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:298Issues:0Issues:0

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2238Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18419Issues:0Issues:0

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language:Jupyter NotebookLicense:MITStargazers:2728Issues:0Issues:0

MaMMUT-pytorch

Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch

Language:PythonLicense:MITStargazers:95Issues:0Issues:0

EMPatches

Extract and Merge image patches for easy, fast and self-contained digital image processing and deep learning model training.

Language:Jupyter NotebookStargazers:45Issues:0Issues:0

patchify.py

A library that helps you split image into small, overlappable patches, and merge patches into original image.

Language:PythonLicense:MITStargazers:201Issues:0Issues:0

LM4LV

🔥Official PyTorch implementation for "LM4LV: A Frozen Large Language Model for Low-level Vision Tasks".

Language:PythonLicense:Apache-2.0Stargazers:30Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11329Issues:0Issues:0

libjxl

JPEG XL image format reference implementation

Language:C++License:BSD-3-ClauseStargazers:2388Issues:0Issues:0

L3C-PyTorch

PyTorch Implementation of the CVPR'19 Paper "Practical Full Resolution Learned Lossless Image Compression"

Language:PythonLicense:GPL-3.0Stargazers:392Issues:0Issues:0

libbpg-py

a pure python binding for BPG (Better Portable Graphics)

Language:PythonStargazers:22Issues:0Issues:0

imageio-flif

imageio plugin with FLIF wrapper for Python

License:AGPL-3.0Stargazers:1Issues:0Issues:0

pyFLIF

ctypes based python wrapper for FLIF library

Language:PythonLicense:LGPL-3.0Stargazers:2Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8816Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:11569Issues:0Issues:0

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:19062Issues:0Issues:0

CompressAI-Vision

CompressAI-Vision helps you design, test and compare Video Compression for Machines pipelines. Compression methods can be either pulled from custom AI-based modules from CompressAI or traditional codecs such as H.266/VVC.

Language:PythonLicense:BSD-3-Clause-ClearStargazers:80Issues:0Issues:0

CompressAI

A PyTorch library and evaluation platform for end-to-end compression research

Language:PythonLicense:BSD-3-Clause-ClearStargazers:1115Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9892Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9822Issues:0Issues:0

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonLicense:Apache-2.0Stargazers:380Issues:0Issues:0

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:916Issues:0Issues:0

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:502Issues:0Issues:0

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:3962Issues:0Issues:0