sirus20x6's starred repositories

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30032Issues:425Issues:4178

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:29759Issues:363Issues:1562

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19383Issues:298Issues:1344

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:15852Issues:132Issues:666

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:15184Issues:131Issues:3459

instructor

structured outputs for llms

Language:PythonLicense:MITStargazers:7176Issues:51Issues:270

introtodeeplearning

Lab Materials for MIT 6.S191: Introduction to Deep Learning

Language:Jupyter NotebookLicense:MITStargazers:7153Issues:293Issues:99

skywater-pdk

Open source process design kit for usage with SkyWater Technology Foundry's 130nm node.

Language:PythonLicense:Apache-2.0Stargazers:2930Issues:152Issues:269

Devon

Devon: An open-source pair programmer

Language:PythonLicense:AGPL-3.0Stargazers:2920Issues:30Issues:70
Language:PythonLicense:Apache-2.0Stargazers:2615Issues:33Issues:30

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2562Issues:24Issues:162

langroid

Harness LLMs with Multi-Agent Programming

Language:PythonLicense:MITStargazers:2170Issues:17Issues:151

jailbreak_llms

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Language:Jupyter NotebookLicense:MITStargazers:1849Issues:25Issues:7

HippoRAG

HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.

Language:PythonLicense:MITStargazers:1168Issues:14Issues:24

burr

Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.

Language:PythonLicense:BSD-3-Clause-ClearStargazers:1033Issues:8Issues:84

llama3.np

llama3.np is a pure NumPy implementation for Llama 3 model.

Language:PythonLicense:MITStargazers:946Issues:13Issues:4

YaFSDP

YaFSDP: Yet another Fully Sharded Data Parallel

Language:PythonLicense:Apache-2.0Stargazers:811Issues:15Issues:3

augmentoolkit

Convert Compute And Books Into Instruct-Tuning Datasets (or classifiers)!

Language:PythonLicense:MITStargazers:745Issues:17Issues:32

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonLicense:Apache-2.0Stargazers:413Issues:33Issues:5

omnichain

Efficient visual programming for AI language models

Language:TypeScriptLicense:MITStargazers:258Issues:7Issues:5

llama-zip

LLM-powered lossless compression tool

Language:PythonLicense:NOASSERTIONStargazers:237Issues:6Issues:16

autogenstudio-skills

Repo of skills for autogenstudio

Language:PythonLicense:MITStargazers:205Issues:14Issues:4

Yuan2.0-M32

Mixture-of-Experts (MoE) Language Model

Language:PythonLicense:Apache-2.0Stargazers:173Issues:3Issues:7

Mitten

Infinite canvas drawing application.

Language:C#License:MITStargazers:81Issues:3Issues:12

Note

Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch

Language:PythonLicense:Apache-2.0Stargazers:52Issues:6Issues:1

autogen_skills

python skills for autogen

Language:PythonLicense:MITStargazers:22Issues:0Issues:0

caravel_fulgor_opamp

Test Chip General Purpose OpAmp using Skywater SKY130 PDK

Language:VerilogLicense:Apache-2.0Stargazers:17Issues:4Issues:0

blaz

Blaz: a library for frugal matrix computations. Blaz provides compression/uncompression functions for matrices of floating-point numbers and makes it possible to perform basic linear algebra on the compressed matrices, without uncompressing them.