hugq (Daemon-ser)

Daemon-ser

Geek Repo

Location:Hefei

Github PK Tool:Github PK Tool

hugq's starred repositories

WildBench

Benchmarking LLMs with Challenging Tasks from Real Users

Language:PythonLicense:Apache-2.0Stargazers:161Issues:0Issues:0
Language:PythonLicense:MITStargazers:1440Issues:0Issues:0
Language:PythonLicense:MITStargazers:34Issues:0Issues:0

llm_interview_note

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

Language:HTMLStargazers:1782Issues:0Issues:0

MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Language:Jupyter NotebookStargazers:304Issues:0Issues:0

test

Measuring Massive Multitask Language Understanding | ICLR 2021

Language:PythonLicense:MITStargazers:1101Issues:0Issues:0

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

License:Apache-2.0Stargazers:759Issues:0Issues:0

math

The MATH Dataset (NeurIPS 2021)

Language:PythonLicense:MITStargazers:789Issues:0Issues:0

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

License:MITStargazers:1593Issues:0Issues:0

LiveBench

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Language:PythonLicense:NOASSERTIONStargazers:160Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:28383Issues:0Issues:0

Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

Stargazers:659Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:6672Issues:0Issues:0

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

Language:PythonLicense:Apache-2.0Stargazers:1809Issues:0Issues:0

awesome-auto-alignment

Collection of papers for scalable automated alignment.

Stargazers:39Issues:0Issues:0

LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Language:PythonStargazers:157Issues:0Issues:0

DoLa

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Language:PythonStargazers:384Issues:0Issues:0

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileLicense:MITStargazers:1353Issues:0Issues:0

llms_tool

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

Language:PythonLicense:Apache-2.0Stargazers:194Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1923Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1908Issues:0Issues:0

Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

License:MITStargazers:684Issues:0Issues:0

AlignBench

大模型多维度中文对齐评测基准 (ACL 2024)

Language:PythonStargazers:273Issues:0Issues:0
Language:PythonStargazers:864Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34127Issues:0Issues:0

chatbot-ui

AI chat for every model.

Language:TypeScriptLicense:MITStargazers:27718Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4411Issues:0Issues:0

speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Language:PythonLicense:MITStargazers:177Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9912Issues:0Issues:0

sd-webui-EasyPhoto

📷 EasyPhoto | Your Smart AI Photo Generator.

Language:PythonLicense:Apache-2.0Stargazers:4840Issues:0Issues:0