shenfe

Shen's starred repositories

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

Apache-2.0724400

meta-prompting

Official implementation of paper "Meta Prompting for AGI Systems" (https://arxiv.org/abs/2311.11482)

Language:Python4900

ArXivQA

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)

Language:Python25200

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0708700

Awesome-LLM-KG

Awesome papers about unifying LLMs and KGs

171500

dl4math

Resources of deep learning for mathematical reasoning (DL4MATH).

MIT30800

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

MIT311600

SafeNLP

Safety Score for Pre-Trained Language Models

Language:PythonNOASSERTION9100

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonMIT1250700

facefusion

Next generation face swapper and enhancer

Language:PythonNOASSERTION1572500

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Language:Python125400

seqio

Task-based datasets, preprocessing, and evaluation for sequence models.

Language:PythonApache-2.053800

multipack_sampler

Multipack distributed sampler for fast padding-free training of LLMs

Language:PythonMIT15700

galai

Model API for GALACTICA

Language:Jupyter NotebookApache-2.0265800

GAP

[ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization

Language:PythonApache-2.02600

olm-datasets

Pipeline for pulling and processing online language model pretraining data from the web

Language:PythonApache-2.016900

anything-llm

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

Language:JavaScriptMIT1570100

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

914800

mteb

MTEB: Massive Text Embedding Benchmark

Language:PythonApache-2.0153800

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language:PythonMIT1117000

Awasome-Pruning

Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.

CC0-1.09200

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

167300

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonApache-2.01772100

python-searchengine

Language:PythonMIT34900

DecomP

Repository for Decomposed Prompting

Language:PythonApache-2.07900

AGiXT

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

Language:PythonMIT249100