Leadblock / ChatGLM-6B-Int4-Web-Demo

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ChatGLM-6B-Int4-Web-Demo

Launch In Colab <-- press here to launch the web int4 demo

Launch In Colab <-- press here to launch the web int8 demo 测试失败, colab无法使用int8,所以作者开发int4是有原因的。

About ChatGLM

ChatGLM-6B is an open bilingual language model based on General Language Model (GLM) framework, with 6.2 billion parameters. With the quantization technique, users can deploy locally on consumer-grade graphics cards (only 6GB of GPU memory is required at the INT4 quantization level).

ChatGLM-6B uses technology similar to ChatGPT, optimized for Chinese QA and dialogue. The model is trained for about 1T tokens of Chinese and English corpus, supplemented by supervised fine-tuning, feedback bootstrap, and reinforcement learning wit human feedback. With only about 6.2 billion parameters, the model is able to generate answers that are in line with human preference.

Links

About

License:Apache License 2.0


Languages

Language:Jupyter Notebook 91.1%Language:Python 8.9%