sensitiveanalyst's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:25364Issues:218Issues:4096

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:25044Issues:277Issues:77

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:18155Issues:205Issues:382

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:14272Issues:110Issues:338

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:13087Issues:96Issues:357

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:12738Issues:90Issues:16

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8651Issues:64Issues:204

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4494Issues:76Issues:88

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:4294Issues:31Issues:438

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Devon

Devon: An open-source pair programmer

Language:PythonLicense:AGPL-3.0Stargazers:3035Issues:32Issues:70

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language:PythonLicense:Apache-2.0Stargazers:1615Issues:18Issues:15

textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:1421Issues:19Issues:56

AutoCoder

We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.

Language:PythonLicense:Apache-2.0Stargazers:778Issues:14Issues:12

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Agentless

Agentless🐱: an agentless approach to automatically solve software development problems

Language:PythonLicense:MITStargazers:620Issues:6Issues:16

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonLicense:MITStargazers:609Issues:8Issues:59

gpu-alpha

High Quality Resources on GPU Programming/Architecture

merge-models

Merges two latent diffusion models at a user-defined ratio

kanrl

Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments

KAN

Implementation on how to use Kolmogorov-Arnold Networks (KANs) for classification and regression tasks.

Language:Jupyter NotebookStargazers:173Issues:3Issues:5

LLM-Merging

LLM-Merging: Building LLMs Efficiently through Merging

distillm

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Composio-Function-Calling-Benchmark

Function Calling Benchmark & Testing

Language:Jupyter NotebookLicense:MITStargazers:70Issues:7Issues:0

Deep-KAN

This repository contains a better implementation of Kolmogorov-Arnold networks

Language:Jupyter NotebookLicense:MITStargazers:59Issues:2Issues:0

MACM

MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:30Issues:9Issues:2