migumar2 / libCSD

C++ Library implementing Compressed String Dictionaries

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Reconcile with hdt-cpp implementatation.

donpellegrino opened this issue · comments

The HDT-C++ Repository includes an implementation of Compressed String Dictionaries embedded within its codebase at https://github.com/rdfhdt/hdt-cpp/tree/develop/libhdt/src/libdcs. The HDT-C++ CSD.h file references:

"Compressed String Dictionaries"
Nieves R. Brisaboa, Rodrigo Canovas, Francisco Claude, Miguel A. Martinez-Prieto and Gonzalo Navarro.
10th Symposium on Experimental Algorithms (SEA'2011), p.136-147, 2011.

There is a bit of a mess of HDT-C++ library use documented at rdfhdt/hdt-cpp#19.

The libCSD README in this repository cites the same journal article as the basis for the implementation.

The article itself references http://libcds.recoded.cl (Wayback Machine) which is now https://github.com/fclaude/libcds.

Is this Martinez-Prieto implementation distinct from the Navarro / Claude implementation? If so, how should I choose between them? Or, is this codebase an old artifact along the lifecycle of one canonical Compressed String Dictionaries implementation?

Hi!

I'm not sure what is the current state of HDT-C++ (I can't remember when I wrote the last line of code for it), but I think that it uses an old version of our string dictionaries (https://doi.org/10.1007/978-3-642-20662-7_12).

libCSD improves this original work, but I think that PFC, which is the by default HDT dictionary, is essentially the same. Anyway, libcds must be the same because I never migrate to libcds2 and I think that this project is dead :(.

I'm sorry I can't be of more help.

Thanks.