ReactiveThings's starred repositories

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:20674Issues:0Issues:0

proposal-signals

A proposal to add signals to JavaScript.

License:MITStargazers:2995Issues:0Issues:0

awesome-lego-machine-learning

A curated list of resources dedicated to Machine Learning applications to LEGO bricks

Stargazers:307Issues:0Issues:0
Language:Jupyter NotebookStargazers:11Issues:0Issues:0

supervision

We write your reusable computer vision tools. 💜

Language:PythonLicense:MITStargazers:14994Issues:0Issues:0

jscpd

Copy/paste detector for programming source code.

Language:TypeScriptLicense:MITStargazers:4587Issues:0Issues:0

linak-controller

A Python script to control Linak standing desks.

Language:PythonLicense:MITStargazers:343Issues:0Issues:0

textra

A command-line application to convert images, PDFs, and audio files to text using Apple's APIs

Language:SwiftLicense:MITStargazers:587Issues:0Issues:0

LiLT

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

Language:PythonLicense:MITStargazers:329Issues:0Issues:0

edit-distance-js

JavaScript library to compute the edit distance for strings and trees.

Language:CoffeeScriptLicense:Apache-2.0Stargazers:28Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18314Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:17040Issues:0Issues:0

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:8276Issues:0Issues:0

tsgm

Generation and evaluation of synthetic time series datasets (also, augmentations, visualizations, a collection of popular datasets)

Language:PythonLicense:Apache-2.0Stargazers:102Issues:0Issues:0

system-design-101

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

License:NOASSERTIONStargazers:59672Issues:0Issues:0

slides

Terminal based presentation tool

Language:GoLicense:MITStargazers:9340Issues:0Issues:0

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:27132Issues:0Issues:0

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:PythonLicense:Apache-2.0Stargazers:1434Issues:0Issues:0

postgresml

The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.

Language:RustLicense:MITStargazers:5611Issues:0Issues:0

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

Stargazers:5582Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:39710Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:562Issues:0Issues:0
Language:Jupyter NotebookStargazers:11Issues:0Issues:0

TypeChat

TypeChat is a library that makes it easy to build natural language interfaces using types.

Language:TypeScriptLicense:MITStargazers:7981Issues:0Issues:0

SlideVQA

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

Language:PythonLicense:NOASSERTIONStargazers:57Issues:0Issues:0

deepdoctection

A Repo For Document AI

Language:PythonLicense:Apache-2.0Stargazers:2304Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:18911Issues:0Issues:0

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonLicense:MITStargazers:5446Issues:0Issues:0

llama2_aided_tesseract

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections, complete with options for text validation and hallucination filtering.

Language:PythonStargazers:215Issues:0Issues:0

OCR-Form-Tools

A set of tools to use in Microsoft Azure Form Recognizer and OCR services.

Language:TypeScriptLicense:MITStargazers:506Issues:0Issues:0