Qingsong Liu's repositories
3ddfav2_cpp
the cpp version of 3ddfav2
Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
CTCWordBeamSearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
FasterTransformer
Transformer related optimization, including BERT, GPT
GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
LLaVA
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
NeuralVoicePuppetry
This github contains the network architectures of NeuralVoicePuppetry.
ocrevalUAtion
OCR evaluation brought to you by University of Alicante
pdf-to-markdown
A PDF to Markdown converter
pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,Kenlm,ConvSeq2Seq,BERT,MacBERT,ELECTRA,ERNIE,Transformer,T5等模型实现,开箱即用。
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
text-generation-inference
Large Language Model Text Generation Inference
Yet-Another-OCR
Flask website integrated with Tesseract-OCR for reading multiple images, extracting text from them, and saving to Word, PDF, or txt file 🖼🡆🆎 [finished]