Shuai Zeng's starred repositories

Language:PythonLicense:NOASSERTIONStargazers:456Issues:0Issues:0

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:28054Issues:0Issues:0

prot2token

This is the official repository of Prot2Token paper.

Language:PythonLicense:Apache-2.0Stargazers:18Issues:0Issues:0

MAPE-PPI

Code for ICLR 2024 (Spotlight) paper "MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding"

Language:PythonLicense:MITStargazers:180Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13489Issues:0Issues:0

Awesome-Graph-LLM

A collection of AWESOME things about Graph-Related LLMs.

License:MITStargazers:1425Issues:0Issues:0
Language:RStargazers:17Issues:0Issues:0

scBSP

scBSP is a specialized package designed for processing biological data, specifically in the analysis of gene expression and cell coordinates. It efficiently computes p-values for a given set of genes based on input matrices representing cell coordinates and gene expression data.

Language:PythonLicense:GPL-3.0Stargazers:22Issues:0Issues:0

ProLLaMA

A Protein Large Language Model for Multi-Task Protein Language Processing

Language:PythonLicense:Apache-2.0Stargazers:94Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1391Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18367Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29075Issues:0Issues:0

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:3908Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:10280Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:25134Issues:0Issues:0

Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

Stargazers:736Issues:0Issues:0

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:5798Issues:0Issues:0
Language:PythonLicense:MITStargazers:10Issues:0Issues:0

evo

DNA foundation modeling from molecular to genome scale

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:860Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:21709Issues:0Issues:0

gemma

Open weights LLM from Google DeepMind.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2182Issues:0Issues:0

S-PLM

S-PLM: Structure-aware Protein Language Model via Contrastive Learning between Sequence and Structure

Language:PythonLicense:MITStargazers:36Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7249Issues:0Issues:0

PMC-LLaMA

The official codes for "PMC-LLaMA: Towards Building Open-source Language Models for Medicine"

Language:PythonStargazers:548Issues:0Issues:0
Language:PythonLicense:MITStargazers:2Issues:0Issues:0

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

Stargazers:3976Issues:0Issues:0

MedQA

Code and data for MedQA

Language:PythonLicense:MITStargazers:177Issues:0Issues:0

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonLicense:NOASSERTIONStargazers:504Issues:0Issues:0