CVI-SZU / Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

Repository from Github https://github.comCVI-SZU/Linly

CVI-SZU/Linly Issues

问下大佬们有没有训练3B的打算？场景需要时延不能太高
Updated a month ago1
llama3增量预训练冻结哪些层训练哪些层效果比较好？
Closed 10 months ago
请问有没有性别年龄检测模型？
Updated a year ago
请问70B的模型要如何使用，抱脸上的模型看着文件和其他模型不一样
Updated a year ago
pretrain.py的示例似乎有点错误
Updated a year ago
请问，deepspeed 微调时，CPU的内存需要多大？
Updated a year ago1
在线地址无法使用
Updated a year ago
服务器最低配置要求是什么？
Updated a year ago
有人有pile的数据集吗？22个来源，825G的那个版本
Updated a year ago
readme上的加群二维码过期了
Updated 2 years ago
关于平行语料的预处理
Updated 2 years ago3
Chinese-LLaMA-2-13B-hf样本模板prompt到底是什么样的？
Updated 2 years ago
关于Chinese-LLaMA-2-13B (hf格式)
Updated 2 years ago
Please clarify the License for Chinese-LLaMA-2
Updated 2 years ago1
微信满员了，请重新上传新的微信图片我可以免费做管理员
Closed 2 years ago3
多轮对话问问题之后直接报错
Updated 2 years ago
python3 llama_server.py结果乱码
Updated 2 years ago
ChatFlow-13B.bin只有136字节
Updated 2 years ago1
关于33B模型预训练语料长度
Updated 2 years ago2
huggingface上openllama-13b的模型大小为26.4G,转换为huggingface那种模型格式之后模型大小为24.7G，这也就是大概是以fp16或者是bf16保存的模型
Updated 2 years ago
Are the tokenizer.model the same with the one in llama-7b?
Updated 2 years ago
Chinese-LLaMA-33B在多少块gpu上训了多长时间？
Updated 2 years ago
是否考虑通过位置插值来扩展大语言模型的上下文窗口，将上下文窗口提升至32K
Updated 2 years ago
请问在指令微调时损失函数与预训练有什么区别吗
Updated 2 years ago
open-llama13B做推理时，结果是英文
Updated 2 years ago
使用openllama13B + openmodel进行推理时，结果都是数字？这个需要做其他操作？
Updated 2 years ago1
关于openllama的两个相关问题
Updated 2 years ago1
falcon的使用中文预料进行增量训练
Updated 2 years ago2
readme上的加群二维码过期了
Updated 2 years ago4
额，是我用错了吗?简单推理都不行吗
Updated 2 years ago1
Multi machine pre-training hung
Updated 2 years ago1
请问有中文falcon的下载地址嘛
Updated 2 years ago2
Wrong argments
Updated 2 years ago
chatflow模型推理的时候，prompt需要加类似“human: {query}\n assistant: \n” 前后缀嘛？
Updated 2 years ago
openllama 13b base model生成内容比较奇怪
Updated 2 years ago
请问是否有增量预训练的基础模型13B的评测结果？
Updated 2 years ago
如何cite？
Updated 2 years ago1
增量预训练的时候报错exits with return code = -9 ，单卡80G显存的A100
Updated 2 years ago2
请问OpenLLaMA-13B在转换为hf模型时，convert_llama_from_tencentpretrain_to_hf.py直接复制了词表tokenizer.model，open_llama.model没有用到，是正常的吗？
Updated 2 years ago1
HF在线崩溃了
Updated 2 years ago1
Pretraining corpus formatting
Updated 2 years ago
7b模型性能和billa对比
Updated 2 years ago
请问大佬65B的模型何时能够放出
Updated 2 years ago
请问openllama 13b怎么转成HF格式
Updated 2 years ago
博主群二维码过期了,可以更新一个新的二维码吗
Closed 2 years ago1
Is it possible to support OPT models
Updated 2 years ago1
openllama性能评估
Updated 2 years ago
请问模型在tencentpretrain框架下预训练时选择的是bpe tokenizer吗？是否有对应的预训练的merge.txt呢？
Updated 2 years ago
Chinese-LLaMA-33B (hf格式)的模型如何部署，进行推理？
Updated 2 years ago1
33b Huggingface 格式怎么转成TencentPretrain 格式
Closed 2 years ago2