SuperXiang

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:Jupyter NotebookApache-2.0100

mdtraj

An open library for the analysis of molecular dynamics trajectories

Language:CLGPL-2.1100

mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Language:PythonGPL-3.0100

mlmc

Code for fine-tuning transformers (XLNet, Bert and GPT-2) on binary, multi-class and multi-label sequence classification tasks.

Language:Python100

MULTICOM3

The software system of predicting protein tertiary and quaternary structures. It is prepared for CASP15 by BMLab.

Language:Python100

multipack_sampler

Multipack distributed sampler for fast padding-free training of LLMs

Language:PythonMIT100

Paddle Distributed Training Examples. 飞桨分布式训练示例 Resnet Bert GPT MOE DataParallel ModelParallel PipelineParallel HybridParallel AutoParallel Zero Sharding Recompute GradientMerge Offload AMP DGC LocalSGD Wide&Deep

Language:PythonApache-2.0100

RFdiffusion

Code for running RFdiffusion

Language:PythonNOASSERTION100

RoseTTAFold2

Language:PythonMIT100

SecBERT

pretrained BERT model for cyber security text, learned CyberSecurity Knowledge

Language:PythonMIT100

sentencepiece_chinese_bpe

使用sentencepiece中BPE训练中文词表，并在transformers中进行使用。

Language:Python100

sk-iterative-planner

Iterative Planner for Semantic Kernel

Language:C#MIT100

SpeedPPI

Rapid protein-protein interaction network creation from multiple sequence alignments with Deep Learning

Language:PythonNOASSERTION100

Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Language:PythonNOASSERTION100

VardaGPT

Associative memory-enhanced GPT-2 model

Language:Python100

mesh

Mesh TensorFlow: Model Parallelism Made Easier

Language:PythonApache-2.0000

MT-LLaMA

Multi-Task instruction-tuned LLaMA

Language:PythonApache-2.0000

PdfGptIndexer

An efficient tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for rapid information retrieval and superior search accuracy.

Language:PythonMIT000

PLSC

Paddle Large Scale Classification Tools，supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.

Language:PythonApache-2.0000

SuperXiang

Yingfei(Jeremy) Xiang's repositories

AttnPacker

CLIP

codealpaca

ContinualLM

DIFFormer

dl_binder_design

efficient-evolution

Fengshenbang-LM

GeoStab

GPT-NER

how-to-train-tokenizer

long_llama

mdtraj

mixture-of-experts

mlmc

MULTICOM3

multipack_sampler

PaddleFleetX

RFdiffusion

RoseTTAFold2

SecBERT

sentencepiece_chinese_bpe

sk-iterative-planner

SpeedPPI

Stable-Alignment

VardaGPT

mesh

MT-LLaMA

PdfGptIndexer

PLSC