shaohua.zhang (BeyondYourself)

BeyondYourself

Geek Repo

Location:Hangzhou China

Github PK Tool:Github PK Tool

shaohua.zhang's starred repositories

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:27563Issues:154Issues:8413

DeepFaceLive

Real-time face swap for PC streaming or video calls

Language:PythonLicense:GPL-3.0Stargazers:25574Issues:361Issues:144

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22746Issues:223Issues:129

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:18459Issues:169Issues:1291

sing-box

The universal proxy platform

Language:GoLicense:NOASSERTIONStargazers:17831Issues:137Issues:1614

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonLicense:Apache-2.0Stargazers:9441Issues:88Issues:723

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Language:PythonLicense:Apache-2.0Stargazers:9263Issues:77Issues:1479

FreeAskInternet

FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.

Language:PythonLicense:Apache-2.0Stargazers:8407Issues:55Issues:78

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7359Issues:88Issues:121

awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:6059Issues:37Issues:292

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4440Issues:62Issues:177

llm-universe

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

Language:Jupyter NotebookStargazers:4173Issues:20Issues:44

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:4171Issues:38Issues:414

poster-design

一款漂亮且功能强大的在线海报设计器,图片编辑器,仿稿定设计,适用于多种场景:海报生成、电商产品图、文章长图、视频/公众号封面等。A beautiful online image designer, suitable for various scenarios like generate posters, making design easier!

Language:VueLicense:MITStargazers:3439Issues:21Issues:71

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Language:PythonLicense:GPL-3.0Stargazers:3423Issues:29Issues:560

DynamiCrafter

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:2285Issues:31Issues:118

PyTorch-Tutorial-2nd

《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。

Language:Jupyter NotebookStargazers:2148Issues:9Issues:20

MagicClothing

Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis

Language:PythonLicense:NOASSERTIONStargazers:1304Issues:39Issues:89

BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Language:PythonLicense:NOASSERTIONStargazers:1299Issues:42Issues:63

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1281Issues:31Issues:153

Tutorial

LLM&VLM Tutorial

FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:700Issues:10Issues:41

OCR-SAM

Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting

label-studio-ml-backend

Configs and boilerplates for Label Studio's Machine Learning backend

Language:PythonLicense:Apache-2.0Stargazers:501Issues:15Issues:221

mlc-MiniCPM

MiniCPM on Android platform.

Language:PythonLicense:Apache-2.0Stargazers:485Issues:6Issues:0
Language:PythonLicense:Apache-2.0Stargazers:288Issues:3Issues:26

language-models

pre-trained Language Models

Language:Jupyter NotebookStargazers:275Issues:20Issues:12

VLE

VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)

Language:PythonLicense:Apache-2.0Stargazers:176Issues:6Issues:8

FineControlNet

Official Pytorch Implementation of "FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection", 2023

Language:PythonLicense:NOASSERTIONStargazers:174Issues:8Issues:3