Shen (shenfe)

shenfe

Geek Repo

Company:ByteDance

Location:Beijing

Github PK Tool:Github PK Tool

Shen's starred repositories

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

License:Apache-2.0Stargazers:7244Issues:0Issues:0

meta-prompting

Official implementation of paper "Meta Prompting for AGI Systems" (https://arxiv.org/abs/2311.11482)

Language:PythonStargazers:49Issues:0Issues:0

ArXivQA

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)

Language:PythonStargazers:252Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7087Issues:0Issues:0

Awesome-LLM-KG

Awesome papers about unifying LLMs and KGs

Stargazers:1715Issues:0Issues:0

dl4math

Resources of deep learning for mathematical reasoning (DL4MATH).

License:MITStargazers:308Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3116Issues:0Issues:0

SafeNLP

Safety Score for Pre-Trained Language Models

Language:PythonLicense:NOASSERTIONStargazers:91Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:12507Issues:0Issues:0

facefusion

Next generation face swapper and enhancer

Language:PythonLicense:NOASSERTIONStargazers:15725Issues:0Issues:0

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Language:PythonStargazers:1254Issues:0Issues:0

seqio

Task-based datasets, preprocessing, and evaluation for sequence models.

Language:PythonLicense:Apache-2.0Stargazers:538Issues:0Issues:0

multipack_sampler

Multipack distributed sampler for fast padding-free training of LLMs

Language:PythonLicense:MITStargazers:157Issues:0Issues:0

galai

Model API for GALACTICA

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2658Issues:0Issues:0
Language:Jupyter NotebookStargazers:34Issues:0Issues:0

GAP

[ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization

Language:PythonLicense:Apache-2.0Stargazers:26Issues:0Issues:0

olm-datasets

Pipeline for pulling and processing online language model pretraining data from the web

Language:PythonLicense:Apache-2.0Stargazers:169Issues:0Issues:0

anything-llm

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

Language:JavaScriptLicense:MITStargazers:15701Issues:0Issues:0

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

Stargazers:9148Issues:0Issues:0

mteb

MTEB: Massive Text Embedding Benchmark

Language:PythonLicense:Apache-2.0Stargazers:1538Issues:0Issues:0

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language:PythonLicense:MITStargazers:11170Issues:0Issues:0

Awasome-Pruning

Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.

License:CC0-1.0Stargazers:92Issues:0Issues:0

awesome-model-quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

Stargazers:1673Issues:0Issues:0

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17721Issues:0Issues:0
Language:PythonLicense:MITStargazers:349Issues:0Issues:0

DecomP

Repository for Decomposed Prompting

Language:PythonLicense:Apache-2.0Stargazers:79Issues:0Issues:0

AGiXT

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

Language:PythonLicense:MITStargazers:2491Issues:0Issues:0

knowledge-graph-from-GPT

Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.

Language:Jupyter NotebookLicense:MITStargazers:602Issues:0Issues:0

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1197Issues:0Issues:0

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Language:Jupyter NotebookLicense:MITStargazers:2412Issues:0Issues:0