OfficiallyMelon / ChatGLM-6B-Int4-Web-Demo

Less memory

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ChatGLM-6B-Int4-Web-Demo

Launch In Colab <-- press here to launch the web demo

About ChatGLM

ChatGLM-6B is an open bilingual language model based on General Language Model (GLM) framework, with 6.2 billion parameters. With the quantization technique, users can deploy locally on consumer-grade graphics cards (only 6GB of GPU memory is required at the INT4 quantization level).

ChatGLM-6B uses technology similar to ChatGPT, optimized for Chinese QA and dialogue. The model is trained for about 1T tokens of Chinese and English corpus, supplemented by supervised fine-tuning, feedback bootstrap, and reinforcement learning wit human feedback. With only about 6.2 billion parameters, the model is able to generate answers that are in line with human preference.

Links

About

Less memory

License:Apache License 2.0


Languages

Language:Jupyter Notebook 91.0%Language:Python 9.0%