RealNLP's starred repositories

PythonNumericalDemos

Well-documented Python demonstrations for spatial data analytics, geostatistical and machine learning to support my courses.

Language:Jupyter NotebookLicense:MITStargazers:1374Issues:0Issues:0

minsearch

Minimalistic text search engine that uses sklearn and pandas

Language:Jupyter NotebookStargazers:13Issues:0Issues:0

llm-zoomcamp

LLM Zoomcamp - a free online course about building a Q&A system

Language:Jupyter NotebookStargazers:2479Issues:0Issues:0

pytriton

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Language:PythonLicense:Apache-2.0Stargazers:687Issues:0Issues:0

serve

Serve, optimize and scale PyTorch models in production

Language:JavaLicense:Apache-2.0Stargazers:4066Issues:0Issues:0

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonLicense:Apache-2.0Stargazers:601Issues:0Issues:0

DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Language:Jupyter NotebookStargazers:13011Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23042Issues:0Issues:0

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

License:MITStargazers:1361Issues:0Issues:0

AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

Language:TypeScriptLicense:GPL-3.0Stargazers:30695Issues:0Issues:0

interpret

Fit interpretable models. Explain blackbox machine learning.

Language:C++License:MITStargazers:6147Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1094Issues:0Issues:0

otdd

Optimal Transport Dataset Distance

Language:PythonLicense:MITStargazers:147Issues:0Issues:0

wimbd

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Language:PythonLicense:Apache-2.0Stargazers:155Issues:0Issues:0

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonLicense:Apache-2.0Stargazers:852Issues:0Issues:0

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4225Issues:0Issues:0
Language:PythonLicense:MITStargazers:4055Issues:0Issues:0

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:17801Issues:0Issues:0

social-media-profile-scrapers

Fetch user's data across social media

Language:PythonLicense:Apache-2.0Stargazers:422Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9203Issues:0Issues:0

simpsons-scripts

Find out how much the simpsons characters like each other with text and audio analysis.

Language:PythonLicense:Apache-2.0Stargazers:24Issues:0Issues:0

scan

Score essays automatically with an easy web interface.

Language:PythonLicense:AGPL-3.0Stargazers:41Issues:0Issues:0

classified

Score LLM pretraining data with classifiers

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

scribe

Simple speech recognition using your microphone.

Language:PythonStargazers:123Issues:0Issues:0

apartment-finder

A Slack bot that helps you find an apartment.

Language:PythonLicense:MITStargazers:1060Issues:0Issues:0

texify

Math OCR model that outputs LaTeX and markdown

Language:PythonLicense:GPL-3.0Stargazers:631Issues:0Issues:0

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:467Issues:0Issues:0

pdftext

Extract structured text from pdfs quickly

Language:PythonLicense:Apache-2.0Stargazers:248Issues:0Issues:0

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:8584Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:14422Issues:0Issues:0