aahmed-se / indexify

A scalable realtime and continuous indexing engine for Unstructured Data to build Generative AI Applications

Home Page:https://getindexify.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Indexify

Tests

Indexify is a reactive structured extraction engine for un-structured data.

Applications leveraging LLMs for autonomous planning or queries necessitate timely index updates aligned with data changes or new extraction methods. Indexify enables both, by applying feature extractors on data in real-time and updating one or many indexes.

Why use Indexify

  • Knowledge Base for LLMs: Real-time structured extraction and embedding generation from any un-structured data to feed relevant context to AI Applications.
  • Distributed Extraction Engine For Scale: Distributed extraction to scale indexing large amounts of data without sacrificing retrieval performance.
  • Custom Extractors: You can extend Indexify by writing a custom extractor for your use cases to extract specific information from data.
  • Pluggable Storage: Use one of the many storage backends to persist indexes.

Getting Started

To get started follow our documentation.

Documentation

Our comprehensive documentation is available - https://getindexify.ai

Contributions

Please open an issue to discuss new features, or join our Discord group. Contributions are welcome, there are a bunch of open tasks we could use help with!

If you want to contribute on the Rust codebase, please read the developer readme.

Contact

Join the Discord Server - https://discord.gg/mrXrq3DmV8

About

A scalable realtime and continuous indexing engine for Unstructured Data to build Generative AI Applications

https://getindexify.io

License:Apache License 2.0


Languages

Language:Rust 90.6%Language:TypeScript 7.3%Language:Makefile 0.7%Language:HTML 0.3%Language:Dockerfile 0.3%Language:Ruby 0.2%Language:CSS 0.2%Language:Jinja 0.2%Language:Shell 0.1%