deepchecks / deepchecks

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

Home Page:https://docs.deepchecks.com/stable

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[FEAT] Adding text properties (3)

hjain5164 opened this issue · comments

Is your feature request related to a problem? Please describe.
As discussed with @Nadav-Barak, the following text properties can be added but will not be the under the default text properties.

  • Count the number of unique URLs in the text.
  • Count the number of unique emails in the text.
  • Count the number of unique syllables in the text.
  • Average number of syllables in the text (not necessarily unique). Since the average number of syllables per sentence makes more sense.
  • Calculate the reading time of the text in seconds.
  • Count the number of unique date-times in the text. Let us do it later since this is not much useful for now.
  • Calculate the number of sentences in the text.
  • Calculate the average number of syllables per sentence in each text.

Describe the solution you'd like
Will be adding the above properties in the text_properties.py file. If the user wants to calculate these properties, then he can. These properties will not be calculated by default.

Describe alternatives you've considered
Discussed multiple properties and decided to go with this.

@Nadav-Barak or @noamzbr Can anyone assign this to me?