sensitiveanalyst's starred repositories

textgrad

Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:602Issues:0Issues:0

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:3049Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:5442Issues:0Issues:0

LLM-Merging

LLM-Merging: Building LLMs Efficiently through Merging

Language:PythonStargazers:100Issues:0Issues:0

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:17751Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:11777Issues:0Issues:0

MACM

MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems

Language:PythonStargazers:33Issues:0Issues:0

AutoCoder

We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.

Language:PythonLicense:Apache-2.0Stargazers:723Issues:0Issues:0

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonStargazers:458Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:23Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:10615Issues:0Issues:0

Devon

Devon: An open-source pair programmer

Language:PythonLicense:AGPL-3.0Stargazers:2103Issues:0Issues:0

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8375Issues:0Issues:0

kanrl

Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments

Language:PythonStargazers:221Issues:0Issues:0
Language:PythonLicense:MITStargazers:13Issues:0Issues:0

Composio-Function-Calling-Benchmark

Function Calling Benchmark & Testing

Language:Jupyter NotebookLicense:MITStargazers:57Issues:0Issues:0

KAN

Implementation on how to use Kolmogorov-Arnold Networks (KANs) for classification and regression tasks.

Language:Jupyter NotebookStargazers:125Issues:0Issues:0

Deep-KAN

This repository contains a better implementation of Kolmogorov-Arnold networks

Language:Jupyter NotebookLicense:MITStargazers:56Issues:0Issues:0

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

License:MITStargazers:2709Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13415Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:20261Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4422Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:21516Issues:0Issues:0

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language:PythonLicense:Apache-2.0Stargazers:1515Issues:0Issues:0
Language:PythonStargazers:94Issues:0Issues:0

merge-models

Merges two latent diffusion models at a user-defined ratio

Language:PythonStargazers:263Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:3927Issues:0Issues:0

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1062Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29663Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33513Issues:0Issues:0