There are 1 repository under text-compression topic.
AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning with advanced compression strategies.
LLMA = LLM + Arithmetic coder, which use LLM to do insane text data compression. LLMA=大模型+算术编码,它能使用LLM对文本数据进行暴力的压缩,达到极高的压缩率。
Extreme inline text compression for HTML / JS. A custom pipeline that generates stand-alone HTML or JS files which embed competitively compressed self-extracting text, with file sizes of 25% - 40% the original.
This project is about the implementation of the LZ77 and Deflate COMPRESSION algorithm.
🍶 llm-distillery ⇢ use LLMs to run map-reduce summarization tasks on large documents until a target token size is met.
A simple visualization for huffman tree, which is used in encoding text.
Cross platform diff engine for .net core used to compare string values and generate a comparison output
Easy data compression algorithm.
Using Huffman code to encode and compress the secret text data along with using histogram shifting to encode the data into the image.
Huffman code
LZ77 algorithm, providing efficient lossless data compression and decompression capabilities. It offers a reliable and flexible solution for reducing file sizes in various programming languages.
Data driven approach for compressing text using an encoder-decoder architecture and word-level embeddings.
PAQ8PX compression archiver
A Java program that implements (4) four lossless text compression techniques for ASCII Text and compares compression ratio.
Official Implementation of Paper "A Novel Metric for Evaluating Semantics Preservation"
A simple python implementation for the well-known compression algorithm LZW
Tokex is a Toki Pona dialect/script that uses hexadecimal digits to spell out words.
The DCW stands for Damn Compression Wizard and is basically what it sounds like, a compression app that can compress a text file using Huffman Code (Lossless))
Highlights the key matches between your Given PDF and the description text
This code provides an implementation of Huffman Coding using C++. It enables efficient text compression and decompression through frequency-based encoding, constructing a Huffman Tree, assigning variable-length codes, and translating text to and from its binary representation.
In this term project we as a group, tried to revise the Huffman Algorithm of compression and implement it for encoding and decoding data by using the frequency of letters on the binary tree data structure.
🧾 Algoritmos de compressão de texto em C
This is a simple text compression utility using the lzw algorithm in c++
Deliverables relating to the Deep Learning Approaches to Natural Language Processing University Unit (Notes Courtesy to Dr Marc Tanti)
Proyecto en ensamblador que permite aplicar una compresión y descompresión a un texto almacenado en memoria. Estructura de Computadores - UPM
Javascript implementation of the Huffman Encoding algorithm (lossless data compression).
Store compressed text in Firestore