crealytics / spark-excel

A Spark plugin for reading and writing Excel files

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Support for skipping hidden rows

spokelsedis opened this issue · comments

Is there an existing issue for this?

  • I have searched the existing issues

Current Behavior

As far as I can tell, this library doesn't have any support for reading if a row is hidden or not.

Expected Behavior

I have some sheets that I'd like to skip reading any hidden rows. It would be useful to either:

  • Add an option to skip reading hidden rows
  • Expose an is_row_hidden field, then I could filter out any rows with is_row_hidden = True

Steps To Reproduce

No response

Environment

- Spark version:
- Spark-Excel version:
- OS:
- Cluster environment

Anything else?

It looks like the Apache POI library supports reading hidden attributes, at least for cells/sheets (see https://github.com/apache/poi/blob/6943cc9a9c8528ca47bba2a499e408e41f0c893e/poi/src/main/java/org/apache/poi/hssf/record/BoundSheetRecord.java#L158).

That would be a nice contribution, we're open to PRs 👍

@nightscape I have a draft PR here: #817

Please let me know what you think.