THUDM / AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

https://thudm.github.io/AgentTuning/

THUDM/AgentTuning Issues

Can you open source the unfiltered dataset
Updated 2 months ago
训练数据中指令与模型行为不匹配
Updated 3 months ago
本地模型
Closed 4 months ago
请问哪里可以找到工作里对于数据库方面的训练数据
Closed 5 months ago1
weight decay确定是0.1吗？
Closed 5 months ago1
魔塔上的 AgentInstruct 数据集的 conversation 都是空值
Updated 5 months ago
基于fastchat部署，推理异常
Updated 6 months ago3
貌似hotpotqa测试脚本跑不起来？
Updated 6 months ago1
训练数据是如何采样的？
Closed 6 months ago3
通用数据如何筛选
Updated 6 months ago7
Can you point to the ShareGPT filtered/cleaned data used?
Closed 6 months ago1
if it is possible to conduct RLHF from env
Updated 6 months ago1
Can I run AgentInstruct data on the AgentBench?
Updated 7 months ago1
可以给个简单点的工具调用示例吗
Updated 7 months ago1
期待用 Qwen72B 训练的模型。
Closed 7 months ago1
除了用docker运行，还有其他方式可以运行AgentLM吗？
Closed 8 months ago6
关于TRAJECTORY FILTERING问题
Closed 8 months ago3
AgentTuning 7b evaluate in HH， not expect as paper result
Updated 8 months ago13
请问下agentlm-7b最少需要多少显存可以推理
Closed 8 months ago5
论文中关于损失函数E的问题
Closed 9 months ago7
Adding Contributors Section in readme.md file.
Closed 8 months ago
Finetuning with Mistral or Yi?
Closed 8 months ago1
关于数据集
Closed 9 months ago
Number of training steps
Closed 8 months ago1
Dataset details 中找不到reward的计算方式
Closed 8 months ago5
关于dataset statics 和 download
Closed 9 months ago3
关于reward
Closed 9 months ago2
agent tuning和toolbench的区别
Closed 9 months ago1
微调显存
Closed 9 months ago1
Start TGI worker
Closed 9 months ago1
请教reward分数的各种情况
Closed 9 months ago1
requests.exceptions.MissingSchema: Invalid URL '127.0.0.123332/generate': No scheme supplied. Perhaps you meant https://127.0.0.123332/generate?
Closed 9 months ago1
Inference with `vllm`
Closed 9 months ago1
论文中Table 2中的数字的含义和计算方式
Closed 9 months ago2
Add license
Closed 9 months ago
Grammer mistake in readme
Closed 9 months ago1
Auto comment
Closed 9 months ago1
什么时候上魔塔社区
Closed 9 months ago1
论文中的问题
Closed 9 months ago1
Model Output Length
Closed 9 months ago2
是否可以不在docker里运行
Closed 9 months ago2
底座模型基于llama2，是否支持中文呢
Closed 9 months ago1
fine-tune code
Closed 9 months ago1
微调
Closed 9 months ago2
An open queston: What's the difference between Agents and Tools
Closed 9 months ago
交互轨迹的Reward如何得到
Closed 9 months ago1
AgentLM能支持openai.api类的接口本地部署吗？
Closed 9 months ago1
train code
Closed 9 months ago3