Haider Asad (haiderasad)

haiderasad

Geek Repo

Location:Pakistan

Github PK Tool:Github PK Tool

Haider Asad's starred repositories

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:3934Issues:0Issues:0

Audio-and-text-based-emotion-recognition

A multimodal approach on emotion recognition using audio and text.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:142Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:MITStargazers:28693Issues:0Issues:0

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8853Issues:0Issues:0

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonLicense:Apache-2.0Stargazers:530Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:11661Issues:0Issues:0
Language:PythonLicense:MITStargazers:509Issues:0Issues:0

WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

Language:Jupyter NotebookLicense:MITStargazers:200Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:6945Issues:0Issues:0

ctranslate2_triton_backend

Triton backend for https://github.com/OpenNMT/CTranslate2

Language:C++License:MITStargazers:28Issues:0Issues:0

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookLicense:MITStargazers:3465Issues:0Issues:0

camelot

Camelot: PDF Table Extraction for Humans

Language:PythonLicense:NOASSERTIONStargazers:3585Issues:0Issues:0

dedoc

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

Language:PythonLicense:Apache-2.0Stargazers:89Issues:0Issues:0

Table-Detection-Extraction

Detect the tables in a form and extract the tables as well as the cells of the tables.

Language:PythonLicense:MITStargazers:55Issues:0Issues:0

doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Language:PythonLicense:Apache-2.0Stargazers:3177Issues:0Issues:0

deepdoctection

A Repo For Document AI

Language:PythonLicense:Apache-2.0Stargazers:2269Issues:0Issues:0

CascadeTabNet

This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"

Language:PythonLicense:MITStargazers:1448Issues:0Issues:0

Multi-Type-TD-TSR

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

Language:Jupyter NotebookLicense:MITStargazers:242Issues:0Issues:0

OCR_tablenet

TableNet Implementation on Pytorch

Language:PythonStargazers:145Issues:0Issues:0

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonLicense:BSD-3-ClauseStargazers:7516Issues:0Issues:0

awesome-faceReenactment

papers about Face Reenactment/Talking Face Generation

Stargazers:427Issues:0Issues:0

Wav2Lip-GFPGAN

High quality Lip sync

Language:PythonStargazers:934Issues:0Issues:0

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonLicense:NOASSERTIONStargazers:13694Issues:0Issues:0

Lip_Wise

Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.

Language:PythonLicense:Apache-2.0Stargazers:42Issues:0Issues:0

DINet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Language:PythonStargazers:865Issues:0Issues:0

Auto-Synced-Translated-Dubs

Automatically translates the text of a video based on a subtitle file, and also uses AI voice to dub the video, and synced using the subtitle's timings

Language:PythonLicense:GPL-3.0Stargazers:1510Issues:0Issues:0

pydub

Manipulate audio with a simple and easy high level interface

Language:PythonLicense:MITStargazers:8443Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:47Issues:0Issues:0

DPE

[CVPR 2023] DPE: Disentanglement of Pose and Expression for General Video Portrait Editing

Language:PythonLicense:MITStargazers:405Issues:0Issues:0

T2M-GPT

(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”

Language:PythonLicense:Apache-2.0Stargazers:536Issues:0Issues:0