whaleloops's starred repositories

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:32530Issues:325Issues:2492

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:28743Issues:336Issues:266

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9360Issues:84Issues:240

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8008Issues:72Issues:857

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:5641Issues:72Issues:516

llm-foundry

LLM training code for Databricks foundation models

Language:PythonLicense:Apache-2.0Stargazers:3691Issues:45Issues:347

ChatGLM-Tuning

基于ChatGLM-6B + LoRA的Fintune方案

Language:PythonLicense:MITStargazers:3656Issues:31Issues:247

semantra

Multi-tool for semantic search

Language:PythonLicense:MITStargazers:2264Issues:32Issues:56

graph4nlp

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html) for various learning resources!

Language:PythonLicense:Apache-2.0Stargazers:1654Issues:30Issues:170

awesome-information-retrieval

A curated list of awesome information retrieval resources

HuatuoGPT

HuatuoGPT, Towards Taming Language Models To Be a Doctor. (An Open Medical GPT)

Language:PythonLicense:Apache-2.0Stargazers:921Issues:19Issues:48

GraphWriter

Code for "Text Generation from Knowledge Graphs with Graph Transformers"

OpenGPT

A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:312Issues:8Issues:6

cumulative-reasoning

Official implementation of BGPT @ ICLR 2024 paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)

MixPoet

Source codes of MixPoet: Diverse Poetry Generation via Learning Controllable Mixed Latent Space (AAAI 2020)

MindMap

MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models

Awesome-medical-coding-NLP

A collection of papers on automated medical coding from free-texts

NYUTron

public code repository for paper "Health system scale language models are general purpose clinical prediction engines"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:94Issues:5Issues:1

EventStreamGPT

Dataset and modelling infrastructure for modelling "event streams": sequences of continuous time, multivariate events with complex internal dependencies.

Language:Jupyter NotebookLicense:MITStargazers:83Issues:5Issues:39

umassthesis

Unofficial UMass thesis style files for use with LaTeX

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:46Issues:4Issues:1

KEPT

auto icd coding with prompt

Language:Jupyter NotebookLicense:MITStargazers:42Issues:4Issues:8

interpoetry

Interpoetry: Generating Classical Chinese Poems from Vernacular Chinese.

Language:PythonLicense:NOASSERTIONStargazers:38Issues:5Issues:4

SequenceLabelingWithMultiTaskLEarning

Multi Task Learning for sequence labeling tasks (e.g. NER).

Language:PythonStargazers:8Issues:0Issues:0
Language:PythonStargazers:6Issues:0Issues:0

Dr.NoteAid

ACL Workshop 2023

Language:PythonStargazers:5Issues:0Issues:0

long-biomedical-model

How Long Is Enough? Exploring the Optimal Intervals of Long-Range Clinical Note Language Modeling

Language:PythonLicense:Apache-2.0Stargazers:3Issues:5Issues:0

ehr_section_prediction

EMNLP paper release for EHR section prediction

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0