nzjin

nzjin

Geek Repo

Github PK Tool:Github PK Tool

nzjin's starred repositories

LoRAMoE

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Language:PythonStargazers:179Issues:0Issues:0

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonLicense:MITStargazers:946Issues:0Issues:0

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Language:PythonStargazers:1328Issues:0Issues:0

AdaMix

This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.12410).

Language:PythonLicense:Apache-2.0Stargazers:121Issues:0Issues:0

EMoE

Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]

Language:PythonLicense:MITStargazers:21Issues:0Issues:0
Language:PythonStargazers:237Issues:0Issues:0

Switch-NeRF

Codes for Switch-NeRF (ICLR 2023)

Language:PythonLicense:MITStargazers:194Issues:0Issues:0

awesome_moe

The collections of MOE (Mixture Of Expert) papers, code and tools, etc.

Stargazers:11Issues:0Issues:0

ODTQA

The data and processing code of the paper "Enhancing Open-Domain Table Question Answering via Syntax- and Structure-aware Dense Retrieval"

Language:PythonStargazers:3Issues:0Issues:0

Chinese-Mixtral-8x7B

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

Language:PythonLicense:Apache-2.0Stargazers:635Issues:0Issues:0

mixture-of-experts

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Language:PythonLicense:MITStargazers:588Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12983Issues:0Issues:0

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonLicense:Apache-2.0Stargazers:826Issues:0Issues:0
Language:Jupyter NotebookStargazers:92Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:29137Issues:0Issues:0

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:PythonStargazers:9829Issues:0Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:14232Issues:0Issues:0

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

Stargazers:5970Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:297Issues:0Issues:0

Open_WikiTable

Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table

Language:PythonLicense:CC-BY-4.0Stargazers:16Issues:0Issues:0

GTR

[SIGIR 2021] Retrieving Complex Tables with Multi-Granular Graph Representation Learning.

Language:PythonLicense:Apache-2.0Stargazers:42Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0

trec_eval

Evaluation software used in the Text Retrieval Conference

Language:CStargazers:230Issues:0Issues:0

TestSuiteEval

"Semantic Evaluation for Text-to-SQL with Distilled Test Suite", EMNLP2020

Language:PythonStargazers:31Issues:0Issues:0

RESDSQL

The Pytorch implementation of RESDSQL (AAAI 2023).

Language:PythonLicense:MITStargazers:233Issues:0Issues:0

rasat

The official implementation of the paper "RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQL"(EMNLP 2022)

Language:PythonLicense:Apache-2.0Stargazers:63Issues:0Issues:0