Satheesh K's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:63720Issues:0Issues:0

Python

All Algorithms implemented in Python

Language:PythonLicense:MITStargazers:183369Issues:0Issues:0

Plastic-Bottles-Dataset

A dataset of 5,592 plastic bottles swimming in rivers and some attempts to build a model on that.

Stargazers:22Issues:0Issues:0

optical

A collection of utilities related to various computer vision tasks

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Language:PythonLicense:Apache-2.0Stargazers:3331Issues:0Issues:0

semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps

Language:C#License:MITStargazers:21042Issues:0Issues:0

TransformerPrograms

[NeurIPS 2023] Learning Transformer Programs

Language:PythonStargazers:154Issues:0Issues:0

chatnoir-resiliparse

A robust web archive analytics toolkit

Language:CythonLicense:Apache-2.0Stargazers:68Issues:0Issues:0

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonLicense:MITStargazers:22480Issues:0Issues:0

JointBERT

Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"

Language:PythonLicense:Apache-2.0Stargazers:640Issues:0Issues:0

kgi-slot-filling

This is the code for our KILT leaderboard submissions (KGI + Re2G models).

Language:PythonLicense:Apache-2.0Stargazers:143Issues:0Issues:0

pyllms

Minimal Python library to connect to LLMs (OpenAI, Anthropic, Google, Groq, Reka, Together, AI21, Cohere, Aleph Alpha, HuggingfaceHub), with a built-in model performance benchmark.

Language:PythonLicense:MITStargazers:698Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:15851Issues:0Issues:0

Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

Stargazers:28345Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4490Issues:0Issues:0

cleantext

An open-source package for python to clean raw text data

Language:PythonLicense:MITStargazers:67Issues:0Issues:0

python-xz

Pure Python implementation of the XZ file format with random access support

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

jusText

Heuristic based boilerplate removal tool

Language:PythonLicense:BSD-2-ClauseStargazers:712Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:34534Issues:0Issues:0

chat-langchain

Quarto version of chat-langchain

Language:PythonStargazers:41Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1658Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:15183Issues:0Issues:0

empirical-philosophy

A collection of empirical experiments using large language models and other neural network architectures to test the usefulness of metaphysical constructs.

Language:TypeScriptStargazers:142Issues:0Issues:0

HDC_TUBerlin_version_1

This is the submission of the TU Berlin Team to the Helsinki Deblur Challenge 2021.

Language:PythonStargazers:16Issues:0Issues:0

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:8094Issues:0Issues:0

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:34527Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:26155Issues:0Issues:0

gen-invoice

Template-based invoice generator

Language:PythonLicense:MITStargazers:6Issues:0Issues:0
Language:CLicense:NOASSERTIONStargazers:9328Issues:0Issues:0
License:MITStargazers:236Issues:0Issues:0