Haider Asad's starred repositories
Audio-and-text-based-emotion-recognition
A multimodal approach on emotion recognition using audio and text.
mistral-inference
Official inference library for Mistral models
tensorrtllm_backend
The Triton TensorRT-LLM Backend
WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
ctranslate2_triton_backend
Triton backend for https://github.com/OpenNMT/CTranslate2
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
dedoc
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
Table-Detection-Extraction
Detect the tables in a form and extract the tables as well as the cells of the tables.
deepdoctection
A Repo For Document AI
CascadeTabNet
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
OCR_tablenet
TableNet Implementation on Pytorch
awesome-faceReenactment
papers about Face Reenactment/Talking Face Generation
Wav2Lip-GFPGAN
High quality Lip sync
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Auto-Synced-Translated-Dubs
Automatically translates the text of a video based on a subtitle file, and also uses AI voice to dub the video, and synced using the subtitle's timings
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild