王恒 (jiyuxuan926)

jiyuxuan926

Geek Repo

Location:CN

Github PK Tool:Github PK Tool

王恒's repositories

ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

License:GPL-3.0Stargazers:0Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Language:PythonStargazers:0Issues:0Issues:0

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

License:NOASSERTIONStargazers:0Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

License:Apache-2.0Stargazers:0Issues:0Issues:0

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

License:MPL-2.0Stargazers:0Issues:0Issues:0

Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

License:Apache-2.0Stargazers:0Issues:0Issues:0

Efficient-3DCNNs

PyTorch Implementation of "Resource Efficient 3D Convolutional Neural Networks", codes and pretrained models.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

License:Apache-2.0Stargazers:0Issues:0Issues:0

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect , Segment and Generate Anything with Image and Text Inputs

License:Apache-2.0Stargazers:0Issues:0Issues:0

head-pose-estimation

Realtime human head pose estimation with ONNXRuntime and OpenCV.

License:MITStargazers:0Issues:0Issues:0

LLaMA-Efficient-Tuning

Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

openvino

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

License:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleDetection

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

PaddleSpeech

An Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:BSD-3-ClauseStargazers:0Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tensorflow-wavenet

A TensorFlow implementation of DeepMind's WaveNet paper

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

tensorrtx

Implementation of popular deep learning networks with TensorRT network definition API

License:MITStargazers:0Issues:0Issues:0

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:1Issues:0

ultralytics

YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

visual-chatgpt

Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

License:MITStargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

License:GPL-3.0Stargazers:0Issues:0Issues:0

yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

License:GPL-3.0Stargazers:0Issues:0Issues:0