drewnoakes / metadata-extractor-dotnet

Extracts Exif, IPTC, XMP, ICC and other metadata from image, video and audio files

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PDF support

oliver021 opened this issue · comments

Hello friends, I was wondering if it would not be a good idea to include a metadata extractor for other types of files, such as pdf, excel sheets, word documents, etc, since these types of files contain a lot of metadata as well, and I have not seen any library topic on metadata extraction that covers that function, it would be very good since the title of this library is not really limited to metadata of multimedia files.

The library is open to the addition of support for other kinds of data, with the following guidelines:

  • No dependencies on external libraries (we have only one exception to this for XMP processing)
  • Metadata must be representable using the directory/tag structure we use throughout

Support for PDF is being tracked in the sibling Java library in drewnoakes/metadata-extractor#327. I have no issue with supporting other document types as you suggest.

Okay, I make a pull request now, thanks for responding!

@oliver021 fantastic, thanks.

Hello,
Is there any status about this?
I don't find the mentionned pull request.

Hello, I have not been able to do anything about it, I had a drastic change of plans in my schedule, and I find myself with a very short time