li-plus / chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4

li-plus/chatglm.cpp Issues

如何设置更长的上下文(context)
Updated 5 months ago5
mac m1 启动cli_demo.py时，抛出incompatible architecture异常
Closed 6 months ago1
为什么4090显卡用不了，编译成功之后也是用的cpu推理
Closed 6 months ago5
显存使用增加
Updated 7 months ago
【已解决，附解决方法】无法运行 LangChain API ，执行命令后报错，能否提供能成功运行此命令的 python版本号和使用的环境中安装的包的列表
Updated 6 months ago3
更新后运行报错：No module named 'pydantic._internal'
Updated 6 months ago4
How to construct a ToolCallMessage?
Updated 7 months ago
check failed (data != MAP_FAILED) Invalid argument
Updated 5 months ago1
ChatGLM3 使用 examples/cli_demo.py 时报错 'TypeError: chat() got an unexpected keyword argument 'max_new_tokens''
Updated 7 months ago1
CodeGeeX2模型转换错误
Updated 7 months ago2
How to release an old loaded model file using Python.
Updated 7 months ago
Q4_0 + CPU转换避坑指南
Updated 3 months ago9
Failed to install python binding for Windows
Updated 2 months ago2
opeai_api没有支持function call功能
Closed 6 months ago
能否提供对P100 GPU的支持
Updated 6 months ago2
请问是否有将chatglm.cpp合并到llama.cpp的计划？
Updated 2 months ago7
跟新cmke版本后出现问题
Updated 3 months ago2
openai api 启动后，测试请求无法并行
Updated 7 months ago
chatglm_cpp.openai_api:app 支持千问 (qwen)转的bin模型的api么
Updated 7 months ago
llama.cpp main命令包含-ngl N 参数 chatglm.cpp 能否支持 CPU+GPU 混合处理
Updated 6 months ago3
Cmake error on Windows
Closed 7 months ago4
赞👍🏻 文档写得非常漂亮，从头走到尾没有遇到一点阻碍👍🏻
Updated 7 months ago
macbook convert失败
Updated 7 months ago1
pip安装失败
Closed 7 months ago1
can not call tool by using q4_0 chatglm3 model
Updated 7 months ago
Error: CUDA error when release memory!
Updated 7 months ago2
The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
Updated 7 months ago3
Baichuan2-7b-chat Tokenizer and outputs difference between your repo and officical hugginface example.
Closed 7 months ago2
build docker error
Updated 7 months ago1
使用 gpu 时如何指定使用哪一个设备？
Updated 6 months ago3
构建容器后，运行API接口报错
Closed 7 months ago1
./build/bin/main -m chatglm3-ggml-q8.bin -i zsh: illegal hardware instruction ./build/bin/main -m chatglm3-ggml-q8.bin -i
Updated 6 months ago5
macbookpro m1量化baichuan2-13b失败
Updated 7 months ago2
小白问题，chatglm3-ggml.bin通过python的pipeline报错
Closed 7 months ago2
请问能否在iPhone上跑起来呢？
Updated 7 months ago1
lora微调后的模型部署
Closed 4 months ago7
是否可以修改接口处理messages的逻辑？
Updated 7 months ago2
关于cpp部署
Updated 8 months ago2
请教chatglm.cpp如何支持chatglm3 的 function calling
Updated 7 months ago7
no CUDA-capable device is detected current device:
Updated 7 months ago2
device
Closed 8 months ago1
transformers>=4.34.0 时无法转换 codegeex2-6b
Closed 8 months ago5
p-tuning后的chatglm3模型，经量化后怎么把微调的知识全忘了？
Closed 3 months ago5
一种可行的解决方案
Closed 8 months ago
无论在哪个目录运行 web_demo.py 均报错 No module named 'chatglm_cpp'
Closed 8 months ago3
cmake -B build error
Updated 4 months ago6
chatglm3
Updated 7 months ago1
chatglm-ggml_q4_0.bin GGML_ASSERT ggml-metal.m:1453: false
Updated 7 months ago7
如何提升CPU利用率
Updated 8 months ago1
执行./build/bin/main -m chatglm-ggml.bin 卡住
Updated 8 months ago