ZxShen (FrankZxShen)

FrankZxShen

Geek Repo

Company:Southwest Jiaotong University

Location:Saturn

Github PK Tool:Github PK Tool

ZxShen's repositories

visual-chatgpt-zh-vits

visual-chatgpt支持中文的windows版本,融合vits推断模块

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0

Amadeus

アマデウスver 1.0.4

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

ATLA-Demo

Source code for "Adversarial Training for Layout-Aware Text-VQA".

Language:PythonStargazers:1Issues:0Issues:0

EfficientZero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021. Optimize the residual module

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

GameAudioCrawler

A script used to climb the wiki audios for some common games.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

latr

Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answering (STVQA)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

MNlexNet

This is the PyTorch version repository for MNIST dataset identification.

Language:PythonStargazers:1Issues:0Issues:0

so-vits-svc-audio2audio

Replace the song vocals to get the target audio.

Language:PythonStargazers:1Issues:0Issues:0

ATS-Demo

A ''demo'' of ATS (Adversarial Training with OCR-Level Perturbation Incorporation for Scene-Text Visual Question Answering)

Stargazers:0Issues:0Issues:0

Attention-Efficientzero-Alpaca-Lora-Webui

The Webui based on Alpaca-Lora+ChatGLM aims to visualize Atari game results of Efficientzero.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ChatGLM-webui

A WebUI for ChatGLM-6B

Language:PythonStargazers:0Issues:0Issues:0

depth_yolo

combination of darknet_ros and iai_kinect2

License:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

echarts

Apache ECharts is a powerful, interactive charting and data visualization library for browser

License:Apache-2.0Stargazers:0Issues:0Issues:0

efficient-vits-finetuning

Finetuning VITS Efficiently (Lora)

License:MITStargazers:0Issues:0Issues:0
Language:CSSStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

FrankZxshen.github.io

blog,随便创的

Language:StylusStargazers:0Issues:0Issues:0

genshin-gacha-export

原神抽卡记录导出

Stargazers:0Issues:0Issues:0

Grasscutter

A server software reimplementation for a certain anime game.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LATLA

LLM portion of ATLA. Used to bring llama2 external knowledge into Text-VQA.

Language:PythonStargazers:0Issues:0Issues:0

Machine-Learning-Assignments

This project is only for SWJTU's students providing their assignments.

Stargazers:0Issues:0Issues:0

sklearn

from mofan python

Language:PythonStargazers:0Issues:0Issues:0

TAP

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral):Add prompt for LLM.

License:MITStargazers:0Issues:0Issues:0

terminal

The new Windows Terminal and the original Windows console host, all in the same place!

License:MITStargazers:0Issues:0Issues:0

visual-chatgpt-zh

visual-chatgpt支持中文版本

License:Apache-2.0Stargazers:0Issues:0Issues:0

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

License:Apache-2.0Stargazers:0Issues:0Issues:0

vits-fast-fineturing-infer

For vits fine-tuning inference.

Language:PythonStargazers:0Issues:0Issues:0