MachineLP's starred repositories

klaam

Arabic speech recognition, classification and text-to-speech.

Language:Jupyter NotebookLicense:MITStargazers:314Issues:0Issues:0

MASR

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

Language:PythonLicense:Apache-2.0Stargazers:548Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10322Issues:0Issues:0

Face-Recognition-SDK-Javascript

Face Recognition Javascript SDK using ONNX Runtime Web and OpenCV.js (Face Detection, Face Landmarks, Face Liveness, Face Pose, Face Expression, Eye Closeness, Age, Gender and Face Recognition)

Language:JavaScriptLicense:MITStargazers:55Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29407Issues:0Issues:0

PassportEye

Extraction of machine-readable zone information from passports, visas and id-cards via OCR

Language:PythonLicense:MITStargazers:366Issues:0Issues:0

pytriton

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Language:PythonLicense:Apache-2.0Stargazers:672Issues:0Issues:0

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonLicense:Apache-2.0Stargazers:5645Issues:0Issues:0

NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Language:PythonLicense:NOASSERTIONStargazers:3499Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10237Issues:0Issues:0

backgroundremover

Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.

Language:PythonLicense:MITStargazers:6316Issues:0Issues:0

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookLicense:MITStargazers:3436Issues:0Issues:0

KuiperInfer

带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

Language:C++License:MITStargazers:2031Issues:0Issues:0

opencv-mobile

The minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, WebAssembly

Language:C++License:Apache-2.0Stargazers:2171Issues:0Issues:0

Dummy-Robot

我的超迷你机械臂机器人项目。

Language:CStargazers:11432Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:44532Issues:0Issues:0

torchdistill

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.

Language:PythonLicense:MITStargazers:1287Issues:0Issues:0

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoLicense:Apache-2.0Stargazers:27263Issues:0Issues:0

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:22257Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7602Issues:0Issues:0

Real-World-Masked-Face-Dataset

Real-World Masked Face Dataset,口罩人脸数据集

Language:PythonStargazers:1920Issues:0Issues:0

MaskTheFace

Convert face dataset to masked dataset

Language:PythonLicense:MITStargazers:561Issues:0Issues:0

pytorch_face_landmark

Fast and accurate face landmark detection library using PyTorch; Support 68-point semi-frontal and 39-point profile landmark detection; Support both coordinate-based and heatmap-based inference; Up to 100 FPS landmark inference speed with SOTA face detector on CPU.

Language:PythonStargazers:812Issues:0Issues:0

FaceDataSets_Manual_Annotation_Tools

Face attributes manual annotatior python tool projects

Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:28926Issues:0Issues:0

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:28219Issues:0Issues:0

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonLicense:NOASSERTIONStargazers:13627Issues:0Issues:0

CodeFormer-ncnn

ncnn version of CodeFormer

Language:C++Stargazers:96Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53542Issues:0Issues:0

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonLicense:BSD-3-ClauseStargazers:7452Issues:0Issues:0