sirus20x6's starred repositories

data-is-better-together

Let's build better datasets, together!

Language:Jupyter NotebookStargazers:191Issues:0Issues:0

Vodalus-Expert-LLM-Forge

Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation editor Gradio UI.

Language:Jupyter NotebookStargazers:144Issues:0Issues:0

optuna-dashboard

Real-time Web Dashboard for Optuna.

Language:TypeScriptLicense:NOASSERTIONStargazers:491Issues:0Issues:0

graph-studio-next

GraphStudioNext is a tool for developers to build and test DirectShow Graphs

Language:C++Stargazers:350Issues:0Issues:0

nanoXLSTM

The simplest, fastest repository for training/finetuning medium-sized xLSTMs.

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

llm

My attempt at an LLM from complete scratch!

Language:PythonStargazers:10Issues:0Issues:0

nox

Efficient fine-tuning for ko-llm models

Language:PythonLicense:Apache-2.0Stargazers:185Issues:0Issues:0

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

License:MITStargazers:3266Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:15650Issues:0Issues:0

simdjson

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

Language:C++License:Apache-2.0Stargazers:18964Issues:0Issues:0

dynet

DyNet: The Dynamic Neural Network Toolkit

Language:C++License:Apache-2.0Stargazers:3416Issues:0Issues:0

vkvg

Vulkan 2D graphics library

Language:CLicense:MITStargazers:750Issues:0Issues:0

awesome-electron-alternatives

A curated list of awesome Electron alternatives.

License:MITStargazers:1568Issues:0Issues:0

nnstreamer

:twisted_rightwards_arrows: Neural Network (NN) Streamer, Stream Processing Paradigm for Neural Network Apps/Devices.

Language:C++License:LGPL-2.1Stargazers:692Issues:0Issues:0

phidata

Build AI Assistants with memory, knowledge and tools.

Language:PythonLicense:MPL-2.0Stargazers:10980Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:14628Issues:0Issues:0

Cyberpunk-Neon

Cyberpunk Neon Themes for KDE Plasma, GTK, Telegram, Tilix, Vim, Zim and more.

Language:CSSLicense:CC-BY-SA-4.0Stargazers:667Issues:0Issues:0
Language:CSSStargazers:89Issues:0Issues:0

llm-chain

`llm-chain` is a powerful rust crate for building chains in large language models allowing you to summarise text and complete complex tasks

Language:RustLicense:MITStargazers:1292Issues:0Issues:0

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:18136Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:12692Issues:0Issues:0

How-To-Secure-A-Linux-Server

An evolving how-to guide for securing a Linux server.

License:CC-BY-SA-4.0Stargazers:17174Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4352Issues:0Issues:0

fbpca

Fast Randomized PCA/SVD

Language:PythonLicense:NOASSERTIONStargazers:491Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:10551Issues:0Issues:0

neurallambda

Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.

Language:PythonLicense:NOASSERTIONStargazers:180Issues:0Issues:0

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7349Issues:0Issues:0

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Language:PythonLicense:NOASSERTIONStargazers:2954Issues:0Issues:0

Cadetwriter

Developed as a by-product of the IBM 1620 Jr. project, the Cadetwriter is a general-purpose ASCII terminal. It can be connected via RS-232 or USB to any mini, micro, mainframe, or replica computer as a terminal device. A commercial quality IBM/Lexmark Wheelwriter 1000 was adapted by interposing a circuit board containing a Teensy 3.5 microcontroller between the typewriter's keyboard and motherboard. Custom firmware controls the typewriter and communicates with the host computer. To support the full ASCII character set, characters not on the printwheel are synthesized using overprinted characters and "period graphics". The Cadetwriter can print up to 16cps and is a reliable, low-maintenance, low-cost substitute for Teletype, DECwriter, Diablo, Spinwriter, Imagewriter, etc. teleprinters.

Language:C++Stargazers:53Issues:0Issues:0