immersive-web / proposals

Initial proposals for future Immersive Web work (see README)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Optical Character Recognition

AdamSobieski opened this issue · comments

Introduction

I would like to propose that XR-device-based optical character recognition be considered as an important XR scenario.

Optical Character Recognition

With XR devices and their sensors, users could scan text and mathematics content, from papers, chalkboards, dry-erase boards, and other surfaces.

Multimodality

While using XR devices and their sensors to scan text and mathematics content, users could read the content aloud to enhance the accuracy of scans. This could also be interoperable with eye tracking.

Interactivity

Multimodal dialogue systems could interact with users to ensure that contents are thoroughly, properly, and accurately scanned.

Check out https://mathpix.com/ @Mathpix . I can imagine their approach would work in XR scenarios.

Do you know of any devices that are planning on having support for this?
If not, it's too early to take this up in the group.

@physikerwelt , thank you for the link to Mathpix. That is an impressive technology. Some interesting XR computer algebra system user interfaces can be envisioned.

@cabanier , while any XR device with external cameras and/or microphones is relevant, I do not know of a specific XR hardware vendor with interest at this time. Software vendors in the XR business collaboration space may have an interest in STEM collaboration scenarios but I haven't contacted any.

Please let me know if this specific proposal issue would be better to present again at another time.