Bryan Romas's starred repositories

Language:TypeScriptLicense:MITStargazers:6771Issues:0Issues:0

NeMo-Curator

Scalable data pre processing and curation toolkit for LLMs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:475Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:8Issues:0Issues:0

brickflow

Pythonic Programming Framework to orchestrate jobs in Databricks Workflow

Language:PythonLicense:Apache-2.0Stargazers:186Issues:0Issues:0

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18763Issues:0Issues:0

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:5306Issues:0Issues:0

autolabel

Label, clean and enrich text datasets with LLMs.

Language:PythonLicense:MITStargazers:2027Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:17273Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:27476Issues:0Issues:0

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:8333Issues:0Issues:0

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:90984Issues:0Issues:0

litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Language:PythonLicense:NOASSERTIONStargazers:12478Issues:0Issues:0

opentelemetry-python-contrib

OpenTelemetry instrumentation for Python modules

Language:PythonLicense:Apache-2.0Stargazers:702Issues:0Issues:0

phoenix

AI Observability & Evaluation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3516Issues:0Issues:0

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10812Issues:0Issues:0

human-learn

Natural Intelligence is still a pretty good idea.

Language:Jupyter NotebookLicense:MITStargazers:792Issues:0Issues:0

bulk

A Simple Bulk Labelling Tool

Language:PythonLicense:MITStargazers:538Issues:0Issues:0

tpot

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Language:PythonLicense:LGPL-3.0Stargazers:9679Issues:0Issues:0

bricks

Open-source natural language enrichments at your fingertips.

Language:PythonLicense:Apache-2.0Stargazers:447Issues:0Issues:0

ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

Language:PythonLicense:Apache-2.0Stargazers:11101Issues:0Issues:0

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36971Issues:0Issues:0

d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Language:PythonLicense:NOASSERTIONStargazers:23290Issues:0Issues:0

scrubadub_spacy

Clean personally identifiable information from dirty dirty text using spaCy.

Language:PythonLicense:Apache-2.0Stargazers:40Issues:0Issues:0

project-menu

See the issue board for the current status of active and prospective projects!

Stargazers:65Issues:0Issues:0

statcheck

A spellchecker for statistics

Language:RStargazers:174Issues:0Issues:0

Data-Whisperer

An NLP text to vizualization builder for Tableau.

Language:PythonStargazers:15Issues:0Issues:0

ecco

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1970Issues:0Issues:0

Palmetto

Palmetto is a quality measuring tool for topics

Language:JavaLicense:AGPL-3.0Stargazers:213Issues:0Issues:0

dejavu

Audio fingerprinting and recognition in Python

Language:PythonLicense:MITStargazers:6405Issues:0Issues:0

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Language:PythonLicense:MITStargazers:6023Issues:0Issues:0