Shen Huang (huangshenno1)

huangshenno1

Geek Repo

Company:Alibaba

Location:China

Github PK Tool:Github PK Tool

Shen Huang's starred repositories

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:112332Issues:1447Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:94388Issues:690Issues:7822

streamlit

Streamlit — A faster way to build and share data apps.

Language:PythonLicense:Apache-2.0Stargazers:35516Issues:320Issues:4619

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:29606Issues:242Issues:5127

one-api

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.

Language:JavaScriptLicense:MITStargazers:18919Issues:105Issues:1467

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:13918Issues:104Issues:1051

hugo-PaperMod

A fast, clean, responsive Hugo theme.

Language:HTMLLicense:MITStargazers:9904Issues:42Issues:547

XAgent

An Autonomous LLM Agent for Complex Task Solving

Language:PythonLicense:Apache-2.0Stargazers:8126Issues:75Issues:305

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:7423Issues:46Issues:1046

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6880Issues:38Issues:1129

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

dedupe

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Language:PythonLicense:MITStargazers:4137Issues:120Issues:811

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:4029Issues:24Issues:539

Qwen-Agent

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:3398Issues:29Issues:362

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonLicense:Apache-2.0Stargazers:2195Issues:28Issues:141

lagent

A lightweight framework for building LLM-based agents

Language:PythonLicense:Apache-2.0Stargazers:1840Issues:18Issues:64

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1813Issues:17Issues:82

ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Language:PythonLicense:MITStargazers:1630Issues:15Issues:83

sql-eval

Evaluate the accuracy of LLM generated outputs

Language:PythonLicense:Apache-2.0Stargazers:552Issues:9Issues:19
Language:PythonLicense:NOASSERTIONStargazers:274Issues:1Issues:22

T-Eval

[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step

Language:PythonLicense:Apache-2.0Stargazers:227Issues:3Issues:50

EcomGPT

An Instruction-tuned Large Language Model for E-commerce

SeqGPT

SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

Language:PythonLicense:Apache-2.0Stargazers:210Issues:4Issues:14

awesome-llm-attributions

A Survey of Attributions for Large Language Models

tinyBenchmarks

Evaluating LLMs with fewer examples

Language:Jupyter NotebookLicense:MITStargazers:133Issues:3Issues:10

ToolTalk

Evaluating tool-augmented LLMs in conversation settings

Language:PythonLicense:MITStargazers:72Issues:4Issues:4

KCA

EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud

Language:PythonLicense:Apache-2.0Stargazers:19Issues:0Issues:0

CDQA

CDQA: Chinese Dynamic Question Answering Benchmark