There are 0 repository under nf4 topic.
An innovative library for efficient LLM inference via low-bit quantization
This repository wraps the flux fill model as ComfyUI nodes. Compared to the flux fill dev model, these nodes can use the flux fill model to perform inpainting and outpainting work under lower VRM conditions
This project implements a classic Retrieval-Augmented Generation (RAG) system using HuggingFace models with quantization techniques. The system processes PDF documents, extracts their content, and enables interactive question-answering through a Streamlit web application.