cduk's starred repositories

docling

Get your documents ready for gen AI

Language:PythonLicense:MITStargazers:18207Issues:84Issues:327

gpt-researcher

LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.

Language:PythonLicense:Apache-2.0Stargazers:15705Issues:131Issues:413

anthropic-cookbook

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Language:Jupyter NotebookLicense:MITStargazers:9309Issues:264Issues:44
Language:TypeScriptLicense:Apache-2.0Stargazers:8960Issues:60Issues:96

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8681Issues:175Issues:2391

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6683Issues:60Issues:140

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:6085Issues:41Issues:88

ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).

Language:PythonLicense:Apache-2.0Stargazers:5038Issues:23Issues:1541

smol-course

A course on aligning smol models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4454Issues:28Issues:31

OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Language:Jupyter NotebookLicense:MITStargazers:3370Issues:81Issues:136

cloudflare-ddns

🎉🌩️ Dynamic DNS (DDNS) service based on Cloudflare! Access your home network remotely via a custom domain name without a static IP!

Language:PythonLicense:GPL-3.0Stargazers:3238Issues:23Issues:101

Automated-AI-Web-Researcher-Ollama

A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from various relevant websites and do research for you all on its own! And more, not limited to but including saving the findings for you!

Language:PythonLicense:MITStargazers:2496Issues:30Issues:37

modded-nanogpt

NanoGPT (124M) in 3.4 minutes

Language:PythonLicense:MITStargazers:2064Issues:30Issues:29

hertz-dev

first base model for full-duplex conversational audio

Language:PythonLicense:Apache-2.0Stargazers:1668Issues:19Issues:26

soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:1457Issues:51Issues:22

hlb-CIFAR10

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Language:PythonLicense:Apache-2.0Stargazers:1236Issues:19Issues:3

ChatterUI

Simple frontend for LLMs built in react-native.

Language:TypeScriptLicense:AGPL-3.0Stargazers:755Issues:12Issues:142

inswapper

One-click Face Swapper and Restoration powered by insightface 🔥

e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Language:PythonLicense:MITStargazers:404Issues:26Issues:25

pve-backup-server-dockerfiles

Unofficial, and unmaintained build of proxmox-backup-server

openai-edge-tts

Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs

Language:PythonLicense:GPL-3.0Stargazers:232Issues:4Issues:10
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:195Issues:7Issues:1
Language:PythonLicense:Apache-2.0Stargazers:79Issues:4Issues:2
Language:PythonLicense:MITStargazers:76Issues:7Issues:3

BlahST

Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp offline. Speak with local LLMs.

Language:ShellLicense:BSD-3-ClauseStargazers:52Issues:4Issues:5

Lucid_Vision

This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized conversations about images with their favorite language models; and allowing direct communication with vision models.

pascal-pkgs-ci

The main repository for building Pascal-compatible versions of ML applications and libraries.

Language:ShellLicense:MITStargazers:34Issues:2Issues:3

PBQA

Pattern Based Question and Answer

Language:PythonLicense:MITStargazers:3Issues:0Issues:0