Yingfei(Jeremy) Xiang (SuperXiang)

SuperXiang

User data from Github https://github.com/SuperXiang

Company:Sangfor

Location:Shenzhen, Guangdong, China

Home Page:https://scholar.google.com/citations?user=7n2td58AAAAJ

GitHub:@SuperXiang

Twitter:@YingfeiX

Yingfei(Jeremy) Xiang's repositories

AttnPacker

Code and Pre-Trained Models for "AttnPacker: An end-to-end deep learning method for protein side-chain packing"

Language:PythonStargazers:1Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ContinualLM

An Extensible Continual Learning Framework Focused on Language Models (LMs)

Language:PythonStargazers:1Issues:0Issues:0

DIFFormer

The official implementation for ICLR23 spotlight paper "DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion"

Language:PythonStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

efficient-evolution

Efficient evolution from protein language models

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0
Stargazers:1Issues:0Issues:0

how-to-train-tokenizer

怎么训练一个LLM分词器

Language:PythonStargazers:1Issues:0Issues:0

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

mdtraj

An open library for the analysis of molecular dynamics trajectories

Language:CLicense:LGPL-2.1Stargazers:1Issues:0Issues:0

mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

mlmc

Code for fine-tuning transformers (XLNet, Bert and GPT-2) on binary, multi-class and multi-label sequence classification tasks.

Language:PythonStargazers:1Issues:0Issues:0

MULTICOM3

The software system of predicting protein tertiary and quaternary structures. It is prepared for CASP15 by BMLab.

Language:PythonStargazers:1Issues:0Issues:0

multipack_sampler

Multipack distributed sampler for fast padding-free training of LLMs

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

PaddleFleetX

Paddle Distributed Training Examples. 飞桨分布式训练示例 Resnet Bert GPT MOE DataParallel ModelParallel PipelineParallel HybridParallel AutoParallel Zero Sharding Recompute GradientMerge Offload AMP DGC LocalSGD Wide&Deep

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

RFdiffusion

Code for running RFdiffusion

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

SecBERT

pretrained BERT model for cyber security text, learned CyberSecurity Knowledge

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

sentencepiece_chinese_bpe

使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。

Language:PythonStargazers:1Issues:0Issues:0

sk-iterative-planner

Iterative Planner for Semantic Kernel

Language:C#License:MITStargazers:1Issues:0Issues:0

SpeedPPI

Rapid protein-protein interaction network creation from multiple sequence alignments with Deep Learning

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

VardaGPT

Associative memory-enhanced GPT-2 model

Language:PythonStargazers:1Issues:0Issues:0

mesh

Mesh TensorFlow: Model Parallelism Made Easier

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MT-LLaMA

Multi-Task instruction-tuned LLaMA

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PdfGptIndexer

An efficient tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for rapid information retrieval and superior search accuracy.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PLSC

Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0