Beast code in Giters

王恒's repositories

ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

GPL-3.0000

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

BSD-3-Clause000

ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等

Language:Python000

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

NOASSERTION000

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Apache-2.0000

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

MPL-2.0000

Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Apache-2.0000

Efficient-3DCNNs

PyTorch Implementation of "Resource Efficient 3D Convolutional Neural Networks", codes and pretrained models.

Language:PythonMIT010

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Apache-2.0000

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect , Segment and Generate Anything with Image and Text Inputs

Apache-2.0000

head-pose-estimation

Realtime human head pose estimation with ONNXRuntime and OpenCV.

MIT000

LLaMA-Efficient-Tuning

Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)

Language:PythonApache-2.0000

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现包括二次预训练、有监督微调、奖励建模、强化学习训练。

Apache-2.0000

openvino

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

Apache-2.0000

PaddleDetection

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Language:PythonApache-2.0010

PaddleSpeech

An Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Apache-2.0000

Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Language:PythonMIT000

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0000

tensorflow-wavenet

A TensorFlow implementation of DeepMind's WaveNet paper

Language:PythonMIT000

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Language:PythonApache-2.0010

tensorrtx

Implementation of popular deep learning networks with TensorRT network definition API

MIT000

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonMIT000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0010

ultralytics

YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite

Language:PythonGPL-3.0010

visual-chatgpt

Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

MIT000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT000

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

GPL-3.0000

yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

GPL-3.0000

jiyuxuan926

王恒's repositories

ASRT_SpeechRecognition

BLIP

ChatGLM-Finetuning

ChatGLM3

CogVLM

DeepSpeech

Detic

Efficient-3DCNNs

gradio

Grounded-Segment-Anything

head-pose-estimation

LLaMA-Efficient-Tuning

MedicalGPT

model

openvino

PaddleDetection

PaddleSpeech

Personalize-SAM

rknn-toolkit2

segment-anything

tensorflow-wavenet

TensorFlowTTS

tensorrtx

Track-Anything

TTS

ultralytics

visual-chatgpt

whisper

YOLO-World

yolov5