mzthhy's starred repositories

DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Language:PythonLicense:MITStargazers:3436Issues:0Issues:0

DataTager

Fine-Tune LLM Synthetic-Data application and "From Data to AGI: Unlocking the Secrets of Large Language Model"

Language:PythonLicense:GPL-3.0Stargazers:12Issues:0Issues:0

LLM4Chem

Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset"

Language:PythonLicense:MITStargazers:61Issues:0Issues:0

EasyInstruct

[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.

Language:PythonLicense:MITStargazers:357Issues:0Issues:0

Mol-Instructions

[ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models

Language:PythonLicense:MITStargazers:234Issues:0Issues:0

Darwin

An open-source project dedicated to build foundational large language model for natural science, mainly in physics, chemistry and material science.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:187Issues:0Issues:0

SciCrawler

Web-Scarping tool for downloading the content of the following publishers: Elsevier, RSC, Web of Science, Springer Nature , Wiley.

Language:PythonLicense:MITStargazers:14Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8818Issues:0Issues:0

BLOOM-LORA

Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:184Issues:0Issues:0

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Language:ShellLicense:NOASSERTIONStargazers:975Issues:0Issues:0

LegalQA-bloomz-560m

Finetuning a small BLOOMZ model (bloomz-560m) on a small dataset and with limited resources.

Language:Jupyter NotebookStargazers:17Issues:0Issues:0

TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Language:PythonLicense:NOASSERTIONStargazers:1019Issues:0Issues:0

TianGong-AI-Unstructure

TianGong-AI-Unstructure

Language:PythonLicense:MITStargazers:49Issues:0Issues:0

nltk_data

NLTK Data

Language:PythonStargazers:1444Issues:0Issues:0

KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

Language:PythonLicense:MITStargazers:1209Issues:0Issues:0

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonLicense:NOASSERTIONStargazers:22300Issues:0Issues:0
Stargazers:2Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:23536Issues:0Issues:0

reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

Language:TypeScriptLicense:Apache-2.0Stargazers:6493Issues:0Issues:0

Awesome-RAG-Evaluation

The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.

License:MITStargazers:75Issues:0Issues:0

MarineGPT

The official implementation of MarineGPT

Language:PythonLicense:NOASSERTIONStargazers:24Issues:0Issues:0

multi-llm-chat

An application allowing for interaction with different LLM models. With the option to provide PDF, web and CSV links for context.

Language:PythonLicense:Apache-2.0Stargazers:15Issues:0Issues:0

HalluQA

Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:107Issues:0Issues:0

Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

Stargazers:687Issues:0Issues:0

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:PythonStargazers:10076Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1757Issues:0Issues:0

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

Stargazers:9321Issues:0Issues:0

eRAG

Codes and packages for the paper titled Evaluating Retrieval Quality in Retrieval-Augmented Generation.

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

test

Measuring Massive Multitask Language Understanding | ICLR 2021

Language:PythonLicense:MITStargazers:1160Issues:0Issues:0

llm-continual-learning-survey

Continual Learning of Large Language Models: A Comprehensive Survey

Stargazers:215Issues:0Issues:0