zulfiqar-ali01 / audio-visual-Transcription

Real-Time Audio-visual Speech Recongition

Home Page:https://github.com/zulfiqarAlibalti/audio-visual-Transcription

Repository from Github https://github.comzulfiqar-ali01/audio-visual-TranscriptionRepository from Github https://github.comzulfiqar-ali01/audio-visual-Transcription


Project Banner
python flask bootstrap html5 css3 pytorch

Video and Audio Transcription Web App

This web application provides video and audio transcription services using advanced AI models. Users can upload files or provide URLs to obtain transcriptions, making it an essential tool for content creators, researchers, and more.
  1. πŸ“„ Introduction
  2. βš™οΈ Tech Stack
  3. πŸ”‹ Features
  4. πŸš€ Usage
  5. πŸ› οΈ Installation
  6. πŸ“ž Contact

🚨 About

This project is a web application designed to transcribe video and audio files. Users can upload files directly or provide URLs for online content. Birds Image Classification.

This project is a web application designed to transcribe video and audio files. Users can upload files directly or provide URLs for online content. The application leverages powerful AI models to accurately transcribe the content, supporting various languages and models.

  • Python
  • Flask
  • Bootstrap
  • HTML
  • CSS
  • PyTorch

πŸ‘‰ File Upload and URL Input: Users can upload video or audio files or provide URLs for transcription.

πŸ‘‰ Language and Model Selection: Supports multiple languages and models for transcription, offering flexibility to the users.

πŸ‘‰ Real-Time Transcription: Provides transcription results in real-time, ensuring efficient processing.

πŸ‘‰ Transcription Status: Displays the status of the transcription process, keeping users informed.

πŸ‘‰ Flash Messages: Uses flash messages to provide feedback on the upload process and transcription status.

  1. Home Page:

    • Upload a file or provide a URL.
    • Select the desired language and model.
    • Click on the "Transcribe" button.
  2. Transcription:

    • View the transcription status.
    • Once completed, the transcribed text will be displayed.
  3. Audio Transcription:

    • Navigate to the audio transcription page.
    • Upload an audio file, select language and model, and click "Transcribe".
  1. Clone the repository: bash git clone https://github.com/yourusername/transcription-webapp.git

  2. Install dependencies: bash pip install -r requirements.txt

  3. Run the application: bash python app.py

  4. Navigate to the application: Open your web browser and go to http://127.0.0.1:5000/.

For any inquiries or issues and requires remaining code of frontend, please contact:

Enjoy a seamless transcription experience with our web app, where advanced AI meets user-friendly design.

About

Real-Time Audio-visual Speech Recongition

https://github.com/zulfiqarAlibalti/audio-visual-Transcription


Languages

Language:Python 33.3%Language:HTML 29.7%Language:JavaScript 29.5%Language:CSS 7.5%