microsoft / LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

https://llmlingua.com/

microsoft/LLMLingua Issues

Experiments[Question]:
Closed a month ago2
[Question]: Difficulty Reproducing Results in CoT.ipynb
Updated a month ago4
[Question]: Token indices sequence length is longer than the specified maximum sequence length for this model (614 > 512). Running this sequence through the model will result in indexing errors
Updated a month ago1
Issues with reproducing LongLLMLingua on the LongBench dataset.
Updated a month ago5
[Question]: longbench数据集测试对context和question的压缩
Updated 2 months ago2
[Question]: How to combine longllmlingua and llmlingua2 ?
Updated 2 months ago1
[Question]: LongBench BM25 reproduce
Updated 2 months ago3
[Question]: How to use a manually downloaded model
Updated 2 months ago5
[Question]: Reproducing the score of official microsoft/llmlingua-2-xlm-roberta-large-meetingbank
Updated 2 months ago6
[Feature Request]: How to improve the accuracy of compressor for large SFT models through training
Updated 2 months ago1
[Question]: Reproduce LLMLingua-2 results with Mistral-7B
Updated 2 months ago2
[Bug]: 返回中文会出现乱码
Updated 2 months ago2
Troubleshooting Issues in LlamaIndex RAG Demo after Updating to Version 0.10
Closed 2 months ago
[bug]: Can not use local model as input
Closed 2 months ago2
[Question]: LLMLingua requires too much GPU memory, and it takes a lot of time to compress long text, such as 16k, etc. How to make it and LLM work at the same time
Updated 2 months ago2
[Question]: Reproduce LLMLingua-2 on the LongBench SingleDoc dataset
Updated 2 months ago2
Meaningless tokens generation
Updated 2 months ago3
[Feature Request]: Lingua2 can discards tokens based on a probability threshold
Updated 2 months ago1
[Question]: How does the token-level question-aware compression work?
Updated 3 months ago1
[Bug]: `n_original_token` might not be set correctly for `compress_prompt_llmlingua2`
Closed 3 months ago1
[Bug]: structured_compress_prompt not working correctly with LLMLingua2
Updated 3 months ago4
[Question]: Reproduce LongLLMLingua on the LongBench MultiDoc dataset
Updated 3 months ago3
[Bug]: AssertionError when executing Code.ipynb
Closed 3 months ago3
[Question]: How to compress a simple prompt on mac
Closed 3 months ago1
[Question]: When I changed the target token, the code reported an error
Updated 3 months ago1
[Question]: Markdown table compression
Closed 3 months ago1
[Bug]: When I use Chinese prompt, the compressed prompt has extra spaces.
Updated 3 months ago1
[Feature Request]: Docker service support
Closed 3 months ago1
[Question]: LLMLingua2 query condition and dynamic ratio
Updated 4 months ago1
[Question]: reproducing LongLLMLingua on the LongBench dataset.
Updated 4 months ago1
[Question]: access to public storage https://openaipublic.blob.core.windows.net/ is prohibited in secure environments ,
Updated 4 months ago3
[Question]: is it possible to use PromptCompressor without GPU?
Closed 4 months ago1
[Question]: LLMLingua1 code
Updated 4 months ago1
[Question]: Support for Aleph Alpha Luminous Models via API
Updated 4 months ago2
[Question] Compressor fine-tune
Updated 4 months ago1
[Question]: LongLLMLingua vs. LLMLingua2 for chatbot history compression
Closed 4 months ago1
version 0.2.1 iteration plan
Closed 4 months ago1
[Question]: How to only compress documents in the RAG setting?
Closed 4 months ago1
[Bug]: Compression truncates words and sentences
Updated 4 months ago3
[Feature Request]: Token compression using GPT-3.5-turbo
Updated 4 months ago3
[Question]: Incorrect `condition_mode` parameter value in `get_condition_ppl` function
Closed 4 months ago2
[Question]: Running LLMLingua with GGUF models
Updated 5 months ago1
[Question]: Why set "cache_dir" to "/tmp/cache" on macOS when passing mps as device_map?
Updated 5 months ago3
How to reproduce Multidocument QA results under 9th？
Updated 5 months ago5
Experiments with Alphanumeric Entities
Updated 5 months ago3
Enhancing quality - Recovery settings
Updated 5 months ago1
Index error for small token amounts
Closed 5 months ago3
PromptCompressor -- Missing Package Accelerate but it is installed
Closed 5 months ago1
error on chatglm3-6b-32k
Closed 5 months ago1
Compatible models
Updated 6 months ago1