hok's repositories

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CLicense:MITStargazers:0Issues:0Issues:0

Awesome-Anything

AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask

Stargazers:0Issues:0Issues:0

backend

PlayEdu 后台管理前端程序

License:Apache-2.0Stargazers:0Issues:0Issues:0

buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Dango-Translator

团子翻译器 —— 个人兴趣制作的一款基于OCR技术的翻译器

Language:PythonLicense:LGPL-2.1Stargazers:0Issues:0Issues:0

FastChat

The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

License:MITStargazers:0Issues:0Issues:0

frontend

PlayEdu PC前端项目

License:Apache-2.0Stargazers:0Issues:0Issues:0

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect , Segment and Generate Anything with Image and Text Inputs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

immersive-translate

Immersive Dual Web Page Translation Extension - 沉浸式双语网页翻译扩展

License:NOASSERTIONStargazers:0Issues:0Issues:0

kuroshiro

Japanese language library for converting Japanese sentence to Hiragana, Katakana or Romaji with furigana and okurigana modes supported.

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

License:Apache-2.0Stargazers:0Issues:0Issues:0

OCR-SAM

Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting

Stargazers:0Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

License:Apache-2.0Stargazers:0Issues:0Issues:0

PlayEdu

PlayEdu 是一款适用于搭建内部培训平台的开源系统,旨在为企业/机构打造自己品牌的内部培训平台。

License:Apache-2.0Stargazers:0Issues:0Issues:0

playground

A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.

License:Apache-2.0Stargazers:0Issues:0Issues:0

pytesseract

A Python wrapper for Google Tesseract

License:Apache-2.0Stargazers:0Issues:0Issues:0

Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

License:MPL-2.0Stargazers:0Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

textshot

Python tool for grabbing text via screenshot

License:MITStargazers:0Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

License:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0
License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Umi-OCR

OCR图片转文字识别软件,完全离线。截屏/批量导入图片,支持多国语言、合并段落、竖排文字。可排除水印区域,提取干净的文本。基于 PaddleOCR 。

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

unit-minions

《AI 研发提效研究:自己动手训练 LoRA》,包含 Llama (Alpaca LoRA)模型、ChatGLM (ChatGLM Tuning)相关 Lora 的训练。训练内容:用户故事生成、测试代码生成、代码辅助生成、文本转 SQL、文本生成代码……

Stargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

whispercpp

Pybind11 bindings for Whisper.cpp

License:Apache-2.0Stargazers:0Issues:0Issues:0

yuzu

Nintendo Switch Emulator

License:GPL-3.0Stargazers:0Issues:0Issues:0