nilutz's starred repositories

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:13051Issues:0Issues:0

gitta

Grammar Induction using a Template Tree Approach

Language:PythonStargazers:43Issues:0Issues:0

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonLicense:Apache-2.0Stargazers:2558Issues:0Issues:0

mamba.np

A pure NumPy implementation of Mamba.

Language:PythonLicense:MITStargazers:200Issues:0Issues:0

koheesio

Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.

Language:PythonLicense:Apache-2.0Stargazers:565Issues:0Issues:0

tantivy

Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust

Language:RustLicense:MITStargazers:11304Issues:0Issues:0

contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

Language:PythonLicense:MITStargazers:1183Issues:0Issues:0

chatgpt_corpus

ChatGPT-generated texts for automated ChatGPT detection

Language:PythonStargazers:2Issues:0Issues:0

dataherald

Interact with your SQL database, Natural Language to SQL using LLMs

Language:PythonLicense:Apache-2.0Stargazers:3200Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:10861Issues:0Issues:0

llama3.np

llama3.np is a pure NumPy implementation for Llama 3 model.

Language:PythonLicense:MITStargazers:924Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13586Issues:0Issues:0

drl-zh

Deep Reinforcement Learning: Zero to Hero!

Language:Jupyter NotebookLicense:MITStargazers:1956Issues:0Issues:0

magika

Detect file content types with deep learning

Language:PythonLicense:Apache-2.0Stargazers:7530Issues:0Issues:0

StarSpace

Learning embeddings for classification, retrieval and ranking.

Language:C++License:MITStargazers:3923Issues:0Issues:0

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonLicense:Apache-2.0Stargazers:1360Issues:0Issues:0

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language:PythonLicense:Apache-2.0Stargazers:1533Issues:0Issues:0

zoekt

Fast trigram based code search

Language:GoLicense:Apache-2.0Stargazers:537Issues:0Issues:0

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1539Issues:0Issues:0

fasthx

FastAPI and HTMX, the right way.

Language:PythonLicense:MITStargazers:330Issues:0Issues:0

unstract

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

Language:PythonLicense:AGPL-3.0Stargazers:348Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:10711Issues:0Issues:0

pgprolog

PostgreSQL Prolog language handler

Language:RustLicense:BSD-3-ClauseStargazers:126Issues:0Issues:0

frigate

NVR with realtime local object detection for IP cameras

Language:PythonLicense:MITStargazers:15963Issues:0Issues:0

dora

DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.

Language:RustLicense:Apache-2.0Stargazers:1345Issues:0Issues:0

GLiNER

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Language:PythonLicense:Apache-2.0Stargazers:928Issues:0Issues:0

deptry

Find unused, missing and transitive dependencies in a Python project.

Language:PythonLicense:MITStargazers:799Issues:0Issues:0

Brick-by-Brick

Official repository of Brick-by-Brick, presented at NeurIPS-2021

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

KataGo

GTP engine and self-play learning in Go

Language:C++License:NOASSERTIONStargazers:3333Issues:0Issues:0

based

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Language:PythonLicense:Apache-2.0Stargazers:185Issues:0Issues:0