ROADMAP 2024

Question

ROADMAP 2024

writinwaters opened this issue 3 months ago · comments

writinwaters commented 3 months ago

v0.8.0

Support RAG graph orchestration and workflow #918
Support Self RAG #1069
Support several operators for initial agent/workflow
Moving files #785

v0.7.0

Implements RAPTOR for better chunking. #882
Supports ARM platform. #842
Supports HTML file.
Integrates reranker.

v0.6.0

Print version or commit-id when RAGFlow is started. Or showing these information on UI. #643
Chunks retrieval APIs #821
Files in knowledge base should also be found in file manager. #800
System components monitoring. #848
Supports simple document layout to speed up file parsing.#799
Streaming conversation output. #709
Default language will be given according to the browse setting and also can be configured. #801

Long-term plan

RAGFlow documents #720
APIs #1102

ptdaxiake · Answer 1 · Sun Apr 07 2024 11:39:29 GMT+0800 (China Standard Time)

                                                 INFO:werkzeug:WARNING: This is a development server. Do not use it in a production deployment. Use a production WSGI server instead.

Running on all addresses (0.0.0.0) [00:00<?, ?B/s]
Running on http://127.0.0.1:9380
Running on http://172.18.0.6:9380
INFO:werkzeug:Press CTRL+C to quit
tsr.onnx: 100%|██████████| 12.2M/12.2M [00:01<00:00, 11.3MB/s]
layout.manual.onnx: 100%|██████████| 12.2M/12.2M [00:01<00:00, 8.83MB/s]
layout.paper.onnx: 100%|██████████| 12.2M/12.2M [00:01<00:00, 11.6MB/s]]
layout.onnx: 100%|██████████| 12.2M/12.2M [00:01<00:00, 7.65MB/s]3MB/s]
Fetching 9 files: 100%|██████████| 9/9 [00:06<00:00, 1.42it/s]s]
Fetching 9 files: 100%|██████████| 9/9 [00:06<00:00, 1.39it/s]
INFO:werkzeug:172.18.0.1 - - [07/Apr/2024 11:35:22] "GET / HTTP/1.1" 200 -
INFO:werkzeug:172.18.0.1 - - [07/Apr/2024 11:35:22] "GET /favicon.ico HTTP/1.1" 200 -
WARNING:root:Realtime synonym is disabled, since no redis connection.

Arno.Edwards · Answer 2 · Wed Apr 10 2024 10:25:42 GMT+0800 (China Standard Time)

v0.1.0

URL support: Capable of web crawling and the corresponding content extration. @KevinHuSh

@KevinHuSh Hi, I wonder what's the current status? Maybe i can collaborate on it. My tech Stack is python backend and i think web crawling is essential for workflow of software development.

KevinHuSh · Answer 3 · Thu Apr 11 2024 13:39:26 GMT+0800 (China Standard Time)

v0.1.0

URL support: Capable of web crawling and the corresponding content extration. @KevinHuSh

@KevinHuSh Hi, I wonder what's the current status? Maybe i can collaborate on it. My tech Stack is python backend and i think web crawling is essential for workflow of software development.

Crawling web page is big thing in my understanding. We do not have a clear picture for this. Here are two important points to note:
Crawling Task Dispaching
The way to execute JS on the page
Page classification which is related the structure of the data we store
The extraction of the main parts of the page

If you have any good solution to these points, please let me know....

Arno.Edwards · Answer 4 · Thu Apr 11 2024 14:48:33 GMT+0800 (China Standard Time)

Crawling web page is big thing in my understanding. We do not have a clear picture for this. Here are two important points to note: Crawling Task Dispaching The way to execute JS on the page Page classification which is related the structure of the data we store The extraction of the main parts of the page

If you have any good solution to these points, please let me know....

@KevinHuSh Please refer to #315

Maybe i can start with AWS Bedrock models to contribute to the project, then Support x-inference as model provider. Feel free to contact me via here or wx.

Vignesh T.V. · Answer 5 · Sat Apr 13 2024 20:35:40 GMT+0800 (China Standard Time)

@writinwaters @KevinHuSh Hi. Requesting to look at the issue I created: #345

Maybe fixing these issues would help us adopt ragflow better.

dashi6174 · Answer 6 · Fri May 10 2024 10:34:05 GMT+0800 (China Standard Time)

Not supporting streaming really affects the user experience. I hope it can be supported soon, as the implementation is not complicated.

ZJUT_miki · Answer 7 · Fri May 10 2024 20:50:15 GMT+0800 (China Standard Time)

v0.6.0

Print version or commit-id when RAGFlow is started. Or showing these information on UI. [Feature Request]: Print version or commit-id when RAGFlow is started. #643

APIs for knowledge base and file manager. [Feature Request]: File manager & API #345

Files in knowledge base should also be found in file manager. let file in knowledgebases visible in file manager #714

System components monitoring.

Supports simple document layout to speed up file parsing.

Supports user configured reranker.

Streaming conversation output. [Feature Request]: Support for conversational streaming #709

Whether it can provide users with accurate answers and quick answers, one is subject to accuracy and the other is subject to quick response

Jin Hai · Answer 8 · Fri May 10 2024 21:04:40 GMT+0800 (China Standard Time)

v0.6.0

Print version or commit-id when RAGFlow is started. Or showing these information on UI. [Feature Request]: Print version or commit-id when RAGFlow is started. #643

APIs for knowledge base and file manager. [Feature Request]: File manager & API #345

Files in knowledge base should also be found in file manager. let file in knowledgebases visible in file manager #714

System components monitoring.

Supports simple document layout to speed up file parsing.

Supports user configured reranker.

Streaming conversation output. [Feature Request]: Support for conversational streaming #709

Whether it can provide users with accurate answers and quick answers, one is subject to accuracy and the other is subject to quick response

You can file a new issue, so we can discuss in that issue.

xs818818 · Answer 9 · Sat May 11 2024 11:20:58 GMT+0800 (China Standard Time)

Are there any plans to use big language models for knowledge graphs?

Jin Hai · Answer 10 · Sat May 11 2024 12:10:41 GMT+0800 (China Standard Time)

Are there any plans to use big language models for knowledge graphs?

We've been thinking about this for a while, but haven't figured out how to implement it in RAGFlow. If you have any good issues, feel free to create a new issue and we'll discuss it!

Ding Jiatong · Answer 11 · Sat May 11 2024 19:35:30 GMT+0800 (China Standard Time)

Can it automatically continue when it says 'Due to length...' ? The current handling of length issues feels very rudimentary.

Jin Hai · Answer 12 · Mon May 20 2024 19:49:47 GMT+0800 (China Standard Time)

Postpone feature request of reranker configuration to 0.7.0.