Maybewuss's starred repositories

openai-cookbook

Examples and guides for using the OpenAI API

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18536Issues:116Issues:518

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:13072Issues:98Issues:1039

Best-websites-a-programmer-should-visit-zh

程序员应该访问的最佳网站中文版

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8652Issues:64Issues:204

XAgent

An Autonomous LLM Agent for Complex Task Solving

Language:PythonLicense:Apache-2.0Stargazers:7988Issues:75Issues:305

chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

Language:HTMLLicense:MITStargazers:7899Issues:88Issues:9

instructor

structured outputs for llms

Language:PythonLicense:MITStargazers:7248Issues:51Issues:271

WechatExporter

Wechat Chat History Exporter 微信聊天记录导出备份程序

Language:C++License:GPL-2.0Stargazers:6086Issues:44Issues:174

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3600Issues:23Issues:473

presidio

Context aware, pluggable and customizable data protection and de-identification SDK for text and images

Language:PythonLicense:MITStargazers:3560Issues:67Issues:404

trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Language:PythonLicense:Apache-2.0Stargazers:3356Issues:30Issues:355

Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:2946Issues:30Issues:296

BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Language:PythonLicense:Apache-2.0Stargazers:2868Issues:35Issues:37

modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Language:PythonLicense:Apache-2.0Stargazers:2547Issues:36Issues:197

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonLicense:Apache-2.0Stargazers:2091Issues:29Issues:137

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileLicense:MITStargazers:1372Issues:24Issues:32

AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

cc_net

Tools to download and cleanup Common Crawl data

Language:PythonLicense:MITStargazers:950Issues:24Issues:44

llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

chatgpt-corpus

ChatGPT 中文语料库 对话语料 小说语料 客服语料 用于训练大模型

z-bench

Z-Bench 1.0 by 真格基金:一个麻瓜的大语言模型中文测试集。Z-Bench is a LLM prompt dataset for non-technical users, developed by an enthusiastic AI-focused team in Zhenfund.

h2o-wizardlm

Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning

Language:PythonLicense:Apache-2.0Stargazers:292Issues:66Issues:3

byzer-llm

Easy, fast, and cheap pretrain,finetune, serving for everyone

Language:PythonLicense:Apache-2.0Stargazers:234Issues:4Issues:12

LLMBar

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

Language:PythonLicense:MITStargazers:102Issues:6Issues:4

CELLO

Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)

CCL2022-DQAB

CCL2022 领域问答库构建测评

WikiHowQAExtractor-mnbvc

Extract Chinese/English QA Data from WikiHow pages.

Language:PythonLicense:MITStargazers:14Issues:1Issues:0