Shen Huang (huangshenno1)

huangshenno1

Geek Repo

Company:Alibaba

Location:China

Github PK Tool:Github PK Tool

Shen Huang's starred repositories

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:110891Issues:1430Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:92573Issues:680Issues:7604

streamlit

Streamlit — A faster way to build and share data apps.

Language:PythonLicense:Apache-2.0Stargazers:34694Issues:318Issues:4518

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:27158Issues:226Issues:4528

one-api

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.

Language:JavaScriptLicense:MITStargazers:18026Issues:99Issues:1403

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:13443Issues:100Issues:1041

hugo-PaperMod

A fast, clean, responsive Hugo theme.

Language:HTMLLicense:MITStargazers:9644Issues:42Issues:537

XAgent

An Autonomous LLM Agent for Complex Task Solving

Language:PythonLicense:Apache-2.0Stargazers:8045Issues:75Issues:305

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:6889Issues:43Issues:984

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6483Issues:36Issues:1065

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

dedupe

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Language:PythonLicense:MITStargazers:4101Issues:120Issues:810

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3786Issues:23Issues:510

Qwen-Agent

Agent framework and applications built upon Qwen2.x, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:3167Issues:30Issues:339

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonLicense:Apache-2.0Stargazers:2128Issues:29Issues:138

lagent

A lightweight framework for building LLM-based agents

Language:PythonLicense:Apache-2.0Stargazers:1755Issues:17Issues:62

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1751Issues:18Issues:79

ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Language:PythonLicense:MITStargazers:1609Issues:15Issues:81

sql-eval

Evaluate the accuracy of LLM generated outputs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:533Issues:9Issues:18
Language:PythonLicense:NOASSERTIONStargazers:262Issues:1Issues:22

EcomGPT

An Instruction-tuned Large Language Model for E-commerce

T-Eval

[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step

Language:PythonLicense:Apache-2.0Stargazers:210Issues:3Issues:49

SeqGPT

SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

Language:PythonLicense:Apache-2.0Stargazers:207Issues:4Issues:14

awesome-llm-attributions

A Survey of Attributions for Large Language Models

tinyBenchmarks

Evaluating LLMs with fewer examples

Language:Jupyter NotebookLicense:MITStargazers:131Issues:3Issues:9

ToolTalk

Evaluating tool-augmented LLMs in conversation settings

Language:PythonLicense:MITStargazers:72Issues:4Issues:4

KCA

EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud

Language:PythonLicense:Apache-2.0Stargazers:18Issues:0Issues:0

CDQA

CDQA: Chinese Dynamic Question Answering Benchmark