Marcin Wielgus's starred repositories
LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
postgresml
Postgres with GPUs for ML/AI apps.
sveltekit-sse
Server Sent Events with SvelteKit
localcache
Local file-based atomic cache manager
augmentoolkit
Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.
llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
Awesome-LLMs-Datasets
Summarize existing representative LLMs text datasets.
neural-speed
An innovative library for efficient LLM inference via low-bit quantization
quivr
Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
guardrails
Adding guardrails to large language models.
docker-on-top
Docker volume driver: mount host directory with copy-on-write
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation