QwenLM / Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

QwenLM/Qwen2 Issues

使用 Accelerate 加速Qwen2多卡推理失败 Failed to inference on multiple GPUs using accelerate
Updated in 2 hours2
单机8卡A100-80G可以全参数训练qwen72b嘛？
Closed a day ago2
我是用8张T4卡部署32B模型，为什么感觉处理会很慢呢？
Updated a day ago1
Qwen2-72B-ins-gptq-int4 非常喜欢用英文回答问题，即使指定了中文。
Updated a day ago2
输出结果一直重复，只到max_new_tokens长度
Updated a day ago4
官方给的微调案例无法处理batch数据，非常浪费显存。
Updated a day ago3
使用past_key_values生成内容时报错
Updated 2 days ago
Is it feasible to use Function Calling without Qwen-Agent?
Closed 2 days ago
How Can I Use Run Manager to Stream Response on RetrievalQA?
Closed 3 days ago2
微调Qwen2-7B-Instruct，用的bf16，训练一段时间后loss=nan，请问这个有什么方法解决吗？
Updated 4 days ago
Qwen2-7b-instruct使用SFT-FT，loss变为0，如何解决？
Updated 5 days ago3
Qwen2-7B vllm openai server 全都是重复的感叹号
Closed 15 days ago2
function call标注数据格式咨询
Closed 6 days ago2
确定种子前提无法固定输出
Updated 5 days ago2
是否有计划支持JSON MODE
Updated 5 days ago
Qwen2-7B 输出结果一直重复，只到max_new_tokens长度
Updated 5 days ago3
Question about target_ids in finetune.py
Updated 5 days ago4
sft 7B model_max_length=90000 24 A00 OOM
Updated 6 days ago2
文档“使用AutoAWQ量化你的模型”章节示范代码有误
Closed 6 days ago1
使用Qwen2的sft脚本微调Qwen2-1.5B-Instruct-GPTQ-In4模型失败报错：AttributeError: 'BitsAndBytesConfig' object has no attribute 'get_loading_attributes'
Updated 6 days ago
如何微调base模型？
Updated 6 days ago2
Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4 模型加载时间过长（近 2 小时）
Updated 7 days ago
Qwen2-0.5B模型是不是没有精度
Closed 7 days ago2
instruct和base模型区别是什么呢？微调应该采用哪个比较好？
Closed 7 days ago3
gptq版本与vllm版本冲突问题
Updated 7 days ago4
为什么 qwen2 不继续提供 14B 版本了呢？
Closed 7 days ago1
关于量化模型过程中，对qwen2-72b部分权重参数进行padded的一些问题。
Updated 7 days ago
Terminology can not be translated correctly
Closed 7 days ago2
Qwen2-1.5B-Instruct推理耗时问题
Closed 7 days ago5
The training speed of 2 nodes with total 4 A10 GPUs is much slower than that of single node with 2 A10 GPUs.
Updated 7 days ago2
What is the difference between id_to_token() and decode() in tokenizer?
Updated 7 days ago
Qwen2-72B 量化后无法加载
Closed 8 days ago3
What gpus can finetune qwen2 72B by using qlora?
Updated 8 days ago1
请问QWEN2的技术报告会什么时候出来？
Closed 8 days ago1
关于多机分布式训练部署问题
Closed 8 days ago2
继续预训qwen2，rope_theta值是否需要调整
Closed 8 days ago1
qwen2-72b-gptq-int4的问题
Closed 9 days ago3
Qwen2-72B-Instruct报错 ”RuntimeError: CUDA error: device-side assert triggered“
Closed 9 days ago1
qwen2-7b(或instruct), float16精度的generate方法输出都是感叹号, 其它精度正常
Closed 15 days ago3
Qwen2-72B-Instruct出现输出截断的问题，该问题必现。
Closed 11 days ago2
推理出现无端端的英文，怎么处理呢
Updated 12 days ago4
量化 lora 微调后的qwen2-72b 为4bit的模型
Updated 12 days ago3
After executing 'qwen2', I interacted with it by posing several queries. Upon exiting the session and attempting to relaunch 'qwen2', I encountered an error
Updated 14 days ago2
How to return response in Japanese?
Closed 14 days ago4
4块4090部署推理性能问题
Updated 14 days ago1
Qwen2 How to use fastapi to encapsulate the stream output interface
Updated 15 days ago1
Is there any officially marlin quantized model ?
Updated 15 days ago1
GGUF int4（qwen2:72b） Model issue
Updated 15 days ago1
使用量化模型，输出错误。
Closed 15 days ago2
RuntimeError: at::cuda::blas::gemm: not implemented for struct c10::BFloat16
Updated 15 days ago1