leafiy's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65104Issues:543Issues:0

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:PythonLicense:MITStargazers:55075Issues:319Issues:274

nerd-fonts

Iconic font aggregator, collection, & patcher. 3,600+ icons, 50+ patched fonts: Hack, Source Code Pro, more. Glyph collections: Font Awesome, Material Design Icons, Octicons, & more

Language:CSSLicense:NOASSERTIONStargazers:52984Issues:390Issues:968

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45760Issues:303Issues:658

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:28690Issues:371Issues:8231

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Language:C++License:Apache-2.0Stargazers:26341Issues:494Issues:5036

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20299Issues:197Issues:367

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15654Issues:134Issues:615

auto-animate

A zero-config, drop-in animation utility that adds smooth transitions to your web app. You can use it with React, Vue, or any other JavaScript application.

Language:TypeScriptLicense:MITStargazers:12034Issues:22Issues:139

SafeLine

serve as a reverse proxy to protect your websites from attacks and exploits.

Language:GoLicense:GPL-3.0Stargazers:11259Issues:64Issues:787

firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

Language:TypeScriptLicense:AGPL-3.0Stargazers:8252Issues:52Issues:208

giscus

A comment system powered by GitHub Discussions. :octocat: :speech_balloon: :gem:

Language:TypeScriptLicense:MITStargazers:7778Issues:26Issues:314

cog

Containers for machine learning

Language:PythonLicense:Apache-2.0Stargazers:7529Issues:67Issues:709

gotenberg

A developer-friendly API for converting numerous document formats into PDF files, and more!

google-indexing-script

Script to get your site indexed on Google in less than 48 hours

Language:TypeScriptLicense:MITStargazers:6794Issues:26Issues:35

ChatTTS-ui

一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Language:PythonLicense:NOASSERTIONStargazers:5386Issues:37Issues:175

manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

Language:PythonLicense:GPL-3.0Stargazers:4771Issues:42Issues:514

XHS-Downloader

小红书链接提取/作品采集工具:提取账号发布、收藏、点赞作品链接;提取搜索结果作品、用户链接;采集小红书作品信息;提取小红书作品下载地址;下载小红书无水印作品文件!

Language:PythonLicense:GPL-3.0Stargazers:4592Issues:22Issues:106

awesome-macos-screensavers

🍎 🖥 🎆 A curated list of screensavers for Mac OS X

talk

Group video call for the web. No signups. No downloads.

Language:JavaScriptLicense:MITStargazers:3881Issues:52Issues:35

biomes-game

Biomes is an open source sandbox MMORPG built for the web using web technologies such as Next.js, Typescript, React and WebAssembly.

Language:TypeScriptLicense:MITStargazers:2479Issues:25Issues:33

ChatTTS_colab

🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

Language:PythonLicense:MITStargazers:1486Issues:23Issues:65

metahuman-stream

Real time interactive streaming digital human

Language:PythonLicense:MITStargazers:1054Issues:21Issues:125

PowerSwitcher

Power plan switcher for Windows 10. Heavily inspired by EarTrumpet.

Language:C#License:MITStargazers:313Issues:13Issues:36

FIFO-Diffusion_public

Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training

comfyui-instantId-faceswap

Implementation of faceswap based on InstantID for ComfyUI.

Language:PythonLicense:Apache-2.0Stargazers:179Issues:4Issues:41

yara

A terminal-based companion program for ComfyUI.