jiaohuix / RecurrentYiYan

Recurrent YiYan: Interactively use YiYan to generate Chinese long text

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RecurrentYiYan

[📄 Paper] | [🤗 Demo - Writing Assistant] | [🤗 Demo - Interactive Fiction] | [📺 Video] | [🔥 Discord]


RecurrentYiYan🔥

1 介绍:

本项目修改自 RecurrentGPT,使用文心一言进行中文的长文本生成(需要能访问网页版一言)。

pip installl -r requirements.txt
# 指定谷歌浏览器打开的端口,使用selenium连接,从而使用python获取一言的输出
./chrome.exe --remote-debugging-port=9222  --user-data-dir=D:\\selenium
# 命令行
python main.py --type '科幻小说' --topic 'AI克隆人、意识上传、机械飞升' --iter 3
# gradio 
python gradio_server.py

p9zIkE4.png

2 未来计划:

1.修改prompt,避免超出max_token(或者多轮提示),并生成更高质量的故事

2.使用langchain管理长期记忆

3.使用其他的语言模型在小说上微调,如ChatGlm,RWKV

4.写沙雕小说哈哈√

3 Bug:

Framework Illustration

struct

RecurrentGPT replaces the vectorized elements (i.e., cell state, hidden state, input, and output) in a Long-short Term Memory RNN (LSTM) with natural language (i.e., paragraphs of texts), and simulates the recurrence mechanism with prompt engineering.

At each timestep t, RecurrentGPT receives a paragraph of text and a brief plan of the next paragraph, which are both generated in step t − 1. It then attends to the long-term memory, which contains the summaries of all previously generated paragraphs and can be stored on hard drives, and relevant paragraphs can be retrieved with semantic search.

RecurrentGPT also maintains a short-term memory that summarizes key information within recent timesteps in natural language and is updated at each time step. RecurrentGPT combines all aforementioned inputs in a prompt and asks the backbone LLM to generate a new paragraph, a short plan for the next paragraph, and updates the long-short term memory by rewriting the short-term memory and appending the summary of the output paragraph to the long-term memory.

Example

struct

Deployment

You can change the configurations given in the recurrent.sh script

iteration: 10                       #(int) the number of rounds you would like it to roll.
outfile: response.txt               #(str) the output file path.
init_prompt: init_prompt.json       #(str) the path to the prompt used for initialization.
topic: Aliens                       #(str) the topic that you wish your novel is about.
type: science-fiction               #(str) the type of novel you would like to write.

Then after specify your OPENAI_API_KEY in the recurrent.sh file, you can run

sh recurrent.sh

NOTE: If your local internet is not allowed to access OpenAI's API, you might need to first export your HTTP proxy in the recurrent.sh file as well.

export http_proxy='your_proxy'

Showcases

Prompt Engineering

struct

Iterations

struct

Human writer starts by choosing the topic he/she wants to write and writes a short paragraph describing the background and the outline of the book. Then RECURRENTGPT automatically generates the first paragraphs and provides a few possible options for the writer to continue the story. The writer may select one from them and edit it if needed. He or she can also write a short plan for the next few paragraphs by him/herself if generated plans are all inappropriate, which makes human-AI co-writing process more flexible

web demo

You can directly use our online demo at: https://www.aiwaves.org/recurrentgpt and https://www.aiwaves.org/interactivefiction

Or you can run it on your local machine by:

export OPENAI_API_KEY="your_api_key"
python gradio_server.py

web-demo

Citation

@misc{zhou2023recurrentgpt,
      title={RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text}, 
      author={Wangchunshu Zhou and Yuchen Eleanor Jiang and Peng Cui and Tiannan Wang and Zhenxin Xiao and Yifan Hou and Ryan Cotterell and Mrinmaya Sachan},
      year={2023},
      eprint={2305.13304},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

About

Recurrent YiYan: Interactively use YiYan to generate Chinese long text

License:Apache License 2.0


Languages

Language:Python 99.1%Language:Shell 0.9%