StoryToolkitAI

Description

StoryToolkitAI is a film editing tool that can help editors work more efficiently by automatically transcribing audio and allowing them to search transcripts semantically with the help of AI.

The tool works locally, independent of any editing software, but it also functions as a Davinci Resolve Studio 18 integration via API. It is using OpenAI, OpenAI Whisper for speech-to-text, sentence transformers for semantic search and a few other AI technologies to get stuff done.

Recently, we've added a direct interface with ChatGPT which allows the use the state-of-the-art AI to analyze transcripts and have conversations about them with AI.

https://vimeo.com/759962195/dee07a067a

Key Features

Resolve Studio Integrations

Mark and Navigate Resolve Timelines via Transcript, plus other handy Resolve-only features
Import subtitles after transcription from the tool directly into Resolve
Easy Resolve Markers to Transcript Group vice versa
Advanced Search of Resolve timeline markers
Render to markers stills or clips feature
Other timecode-based features, like copying transcript text to clipboard with timecodes etc.

Planned Features

Our plan is to incorporate more AI technologies that make editors' work easier and more efficient, something similar to having an AI Assistant Editor which knows what is where in your footage and can even classify footage by meaning, emotions, visual content etc. Automated transcriptions are simply a means to an end.

The app is in this stage raw and not polished at all, but we use it daily in our editing room. It's for free not only out of sheer generosity, but also because we'd like to change how people approach editing by using AI.

Some of the above features are only available in the non-standalone version of the tool, but they will be available in the standalone version in the next release.

For detailed features info, go here.

Is it really completely free?

Yes, the tool runs locally and there's no need for any additional account to transcribe, translate to English, or use any of its features. We may develop features that depend on external services, but the current features will always be free and will never be capped.

Some features are released earlier only to our Patreon Patrons. If you want to support the development, check out our Patreon page and get some cool perks.

About data privacy

By the way, if you feel that your content is sensitive or subject to privacy laws, no worries: the tool does not send anything that you don't want to the Internet, it only uses your local machine to transcribe and translate your audio.

Currently, the only features that send data from your machine to the Internet are:

the Tool itself is checking whether your StoryToolkitAI API Token is valid (only when entered)
the Assistant is sending data to the Internet (directly to OpenAI).

Contributions

This tool is coded by Octavian Mot, your unfriendly filmmaker who hates to code and tries to keep it together as half of mots. Our team uses it daily in our editing room which allows us to update it with features that we need and think will be useful to others.

Feel free to get in touch with compliments, criticism, and even weird ideas for new features.

The tool would be useless without using the following open source projects:

OpenAI Whisper
Sentence Transformers
OpenAI ChatGPT
and many other packages that are listed in the requirements.txt file

Setup & Installation

For detailed installation instructions go here.

Known issues

Hallucinations during audio silence

In some cases, on chunks of audio that are silent, Whisper sometimes writes phrases that aren't there. This is a known issue. To prevent that from happening, try using the pre-detect speech option in the Transcription Settings Window.

Tool doesn't connect with Resolve

Make sure that, in Davinci Resolve Preferences -> General, "External Scripting using" is set to Local. Again, the tool only works with Resolve Studio and not the free version of Resolve (not that we know of).

Windows Standalone version doesn't start or doesn't connect to Resolve

If the tool just hangs when you start it up, or if it doesn't connect to Resolve, most likely there is a conflict with another Python installation on your machine. The best approach is to uninstall all other Python versions and try to run the tool again.

Tool freezing during Resolve playback

Currently, the tool gets stuck as it waits a reply from the Resolve API, while Resolve is playing back, but it gets un-stuck as soon as the playhead stops moving. This will be fixed in a future update soon.

Timecode issues with 23.976 timelines

A bug in the Resolve API which sometimes reports 23.976 fps as 23fps creates a bunch of issues mainly for operations that use timecode (transcript to playhead navigation, adding markers at the precise frame etc.). Unfortunately, this can only be fixed by Blackmagic within Resolve itself (fingers crossed for an update?)

Black Interface / Flickering on Intel Macs

Some users are experiencing weirdness with the interface on Intel Macs. This is due to a bug in Tcl/Tk - a package required to create the interface, which needs to be re-installed together with Python and everything else on the machine. Details here and a possible fix here.

RuntimeError: CUDA out of memory

If you get this message while transcribing on the GPU, it means that your GPU doesn't have enough memory to run the model you have selected. The solution is to either use a smaller model, or to transcribe on the CPU.

Tool freezes when chatting with Assistant

The Assistant feature requires an active connection with OpenAI servers, which sometimes can be slow or unresponsive. We'll try to improve this behavior in the future.

Please report any other issues

As mentioned, the tool is in a super raw state of development. We use it every day in our editing workflow, but some issues might escape us. Please report anything weird that you notice, and we'll look into it.

To report any issues, please use the Issues tab here on Github: https://github.com/octimot/StoryToolkitAI/issues

About

An editing tool that uses AI to transcribe and semantically search transcripts, integrated with ChatGPT and Davinci Resolve Studio.

GNU General Public License v3.0

Languages

Language:Python 100.0%