EagleW

Qingyun Wang's starred repositories

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.033957 341 2650

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonMIT14662 129 605

MemGPT

Create LLM agents with long-term memory and custom tools 📚🦙

Language:PythonApache-2.010888 112 663

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookNOASSERTION7850 68 227

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Language:PythonMIT3278 33 366

Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Language:Jupyter NotebookApache-2.03132 47 67

esm

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Language:PythonMIT3023 65 318

meditron

Meditron is a suite of open-source medical Large Language Models (LLMs).

Language:PythonApache-2.01779 30 29

ProtTrans

ProtTrans is providing state of the art pretrained language models for proteins. ProtTrans was trained on thousands of GPUs from Summit and hundreds of Google TPUs using Transformers Models.

Language:Jupyter NotebookAFL-3.01061 32 145

lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Language:PythonApache-2.01031 12 77

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonApache-2.0857 17 66

KnowledgeEditingPapers

[知识编辑] Must-read Papers on Knowledge Editing for Large Language Models.

MIT735 23 6

progen

Official release of the ProGen models

Language:PythonBSD-3-Clause590 18 43

Megatron-LLM

distributed trainer for LLMs

Language:PythonNOASSERTION503 18 57

gt4sd-core

GT4SD, an open-source library to accelerate hypothesis generation in the scientific discovery process.

Language:Jupyter NotebookMIT325 17 99

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonMIT284 10 13

hetionet

Hetionet: an integrative network of disease

Language:HTML248 14 46

MetaICL

An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi

Language:PythonNOASSERTION245 10 21

enzynet

EnzyNet: enzyme classification using 3D convolutional neural networks on spatial representation

Language:PythonMIT199 15 15

tart

Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.

Language:PythonNOASSERTION156 8 12

gpqa

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Language:Jupyter NotebookMIT117 4 12

Generative_KG_Construction_Papers

[EMNLP 2022] Generative Knowledge Graph Construction: A Review

MIT101 60

SPECTER2

Language:PythonApache-2.071 5 4

group-selfies

Language:Jupyter NotebookApache-2.048 6 6

CODA-19

This is the Github repo of "CODA-19: Using a Non-Expert Crowd to Annotate Research Aspects on 10,000+ Abstracts in the COVID-19 Open Research Dataset" (https://arxiv.org/abs/2005.02367)

Language:Python36 50

csfaculty.github.io

Interview questions for Computer Science faculty jobs

Language:CSS34 2 1

enzyme-datasets

Enzyme datasets used to benchmark enzyme-substrate promiscuity models

Language:Python28 50

BioREx

Language:Python23 12 5

Megatron-LLM

distributed trainer for LLMs

Language:PythonNOASSERTION300

rufes

Scripts for supporting TAC KBP Recognizing Ultra Fine-grained EntitieS Task (RUFES)

Language:Python200