tattle-made / feluda

A configurable engine for analysing multi-lingual and multi-modal content.

Home Page:https://tattle.co.in/products/feluda/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Operator to detect language in an Audio file

aatmanvaidya opened this issue · comments

Operators are core components of Feluda, they are modules that help us analyse media items - text, image, video, audio etc. All the code for operators can be found at src/core/operators

The task is to create a Operator that can detect language in a Audio media item. This is an open ended issue, you will have to investigate on what methods to do this are best and effective.
For now the operator should be able to identify/detect the English, Hindi, Telugu and Tamil languages.

Expected Outcome

  • Write an operator, profile it for RAM and CPU usage. A key outcome should be that the operator should have stabilised RAM usage. Some methods on profiling have been documented on the Wiki here.
  • Write a test for the operator
  • Document the operator on the Wiki

@aatmanvaidya can I work on this? I think this can be a pre-requisite for detecting languages in videos as well?

Hi @Snehil-Shah , there is a similar issue for video's as well - #319
are you interested in picking that up? or else this audio one is also fine!

@aatmanvaidya This is fine. Can I be assigned?

Operator profile

Audio Length CPU Time (s) RAM Usage
10s 1.61 214.0MiB
30s 2.33 215.8MiB
1m 3.46 218.6MiB
5m 12.14 275.8MiB
10m 23.20 550.6MiB
30m 66 1.6GiB

This issue has been resolved here - #324