youtaya's starred repositories

Chital

A native macOS app for chatting with local LLMs

Language:SwiftStargazers:243Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:12228Issues:0Issues:0

Catch2

A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)

Language:C++License:BSL-1.0Stargazers:18670Issues:0Issues:0

LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:31654Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:13789Issues:0Issues:0

mlx

MLX: An array framework for Apple silicon

Language:C++License:MITStargazers:17086Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:18873Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10917Issues:0Issues:0

CasaOS

CasaOS - A simple, easy-to-use, elegant open-source Personal Cloud system.

Language:GoLicense:Apache-2.0Stargazers:25875Issues:0Issues:0

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonLicense:MITStargazers:587Issues:0Issues:0

Modern-CPP-Programming

Modern C++ Programming Course (C++03/11/14/17/20/23/26)

Language:HTMLStargazers:12073Issues:0Issues:0

generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:64921Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:9171Issues:0Issues:0

infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

Language:C++License:Apache-2.0Stargazers:2604Issues:0Issues:0

devv

An AI-powered search engine for developers.

Stargazers:1428Issues:0Issues:0

gpt-crawler

Crawl a site to generate knowledge files to create your own custom GPT from a URL

Language:TypeScriptLicense:ISCStargazers:18821Issues:0Issues:0

ant

Ant game engine

Language:LuaLicense:MITStargazers:3823Issues:0Issues:0

fully-local-pdf-chatbot

Yes, it's another chat over documents implementation... but this one is entirely local!

Language:TypeScriptLicense:MITStargazers:1676Issues:0Issues:0

flask-extension-status

Let's make Flask ecosystem better together!

Language:PythonLicense:MITStargazers:56Issues:0Issues:0

ChatAnything

Official Repo for the Paper: CHATANYTHING: FACETIME CHAT WITH LLM-ENHANCED PERSONAS

Language:PythonStargazers:379Issues:0Issues:0

chisel

Chisel is a collection of LLDB commands to assist debugging iOS apps.

Language:PythonLicense:MITStargazers:9125Issues:0Issues:0

system-design-101

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

License:NOASSERTIONStargazers:64187Issues:0Issues:0

awesome-bbs-signature

A curated list of awesome bbs signature specifications, libraries, software and resources

Stargazers:35Issues:0Issues:0

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10700Issues:0Issues:0

slam_in_autonomous_driving

《自动驾驶中的SLAM技术》对应开源代码

Language:C++Stargazers:1966Issues:0Issues:0

buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Language:PythonLicense:MITStargazers:12477Issues:0Issues:0

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:NOASSERTIONStargazers:168140Issues:0Issues:0

video-subtitle-extractor

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Language:PythonLicense:Apache-2.0Stargazers:6060Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++License:MITStargazers:35535Issues:0Issues:0

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:36585Issues:0Issues:0