imloama

Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Speech Inputs

Language:Jupyter NotebookApache-2.0000

lenis

How smooth scroll should be

Language:JavaScript000

libs

010

lizzie

Lizzie - Leela Zero Interface

GPL-3.0000

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonMIT000

MINI_LLM

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

000

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

000

PlayEdu

PlayEdu 是一款适用于搭建内部培训平台的开源系统，旨在为企业/机构打造自己品牌的内部培训平台。

Language:JavaApache-2.0000

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

Language:PythonGPL-3.0000

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0000

stt

Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务，输出json、srt字幕带时间戳、纯文字格式

Language:PythonGPL-3.0000

Umi-OCR

OCR图片转文字识别软件，完全离线。截屏/批量导入图片，支持多国语言、合并段落、竖排文字。可排除水印区域，提取干净的文本。基于 PaddleOCR 。

Language:PythonMIT000

video-edit-demo

基于ffmpeg.js的web简版视频编辑器

Language:Vue000

wire-pod

Fully-featured server software for the Anki (now Digital Dream Labs) Vector robot.

Language:GoMIT000

XunFeiTTS

XunFei text-to-speech intergration for unreal engine 5.

Language:C++000