There are 2 repositories under assemblyai topic.
QuickDigest AI facilitates seamless interaction with various data formats, real-time web search, and creative image generation for advertising
Build an Audio AI App with Python and AssemblyAI Course
The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio intelligence models, as well as the latest LeMUR models.
DiscordNPC lets you interact with ChatGPT through a Discord voice channel, enabling a natural conversation.
Retrieval Augmented Generation (RAG) on audio data with LangChain
Record voice, transcribe a prompt, picturize the prompt, create variations, get description of a celebrity and upload, other use cases on KB
The AssemblyAI Java SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio intelligence models, as well as the latest LeMUR models.
TagGPT: A simple ChatGPT based multimodal dialog generation engine that can "see/draw" and "hear/speak"
Transcribe audio using AssemblyAI with Semantic Kernel plugins.
The AssemblyAI Ruby SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio intelligence models, as well as the latest LeMUR models.
The AtlasVoice project aims to assist psychotherapist doctors by introducing a bot assistant and transcription generation. This initiative is designed to minimize the time spent on recorded sessions, allowing professionals to gain valuable insights into their patient interactions more efficiently.
Transcribe audio on Cloudflare Workers with AssemblyAI, Node.js, and TypeScript
The OpenAPI spec, AsyncAPI spec, and Postman collection for AssemblyAI's APIs
Transform podcast listening with our Podcast Summarizer Project! This innovative tool transcribes audio, extracts key content, and provides user-friendly summaries. The project utilizes AssemblyAI and Listen Notes APIs for transcription and episode details. Simply input an episode ID, click "Download Episode Summary," and experience podcast content
A basic web-app for image classification using Streamlit and TensorFlow.
this is voice-to-text script using assemblyai free api
this is voice chat bot application
Python-based system designed to transcribe audio files, split the transcripts into manageable chunks, create text embeddings using HuggingFace models, and employ advanced question-answering models for retrieval-based QA.
Speech recognition bot that is powered by Assembly AI.
In this project user will first login after that user can paste the YouTube link and generate blog for that YouTube video with YouTube title and link. Also user can see all the blogs generated by them. I tried to make user-interface simple and smooth for great user experience.
Transcripted Audio From Video With Chat Using LLM
Takes audio of poetry, visually imagines it in a video.
Transcribes audio WAV files or audio input from your microphone using AssemblyAI's transcription API. It prints the transcription in the terminal and saves it in a text file.
This repository contains a speech-to-text audio converter in English, using the AssemblyAI API.
Imagine an application that autonomously take down notes for you during meetings, lectures, and conversations. Check this out...
Lumina 🧠🎓 | Turn educational video lectures into engaging MCQ based quizzes and automatic assessment. Learning has never been this fun!
Server-side repo for a transcription app that converts user inputted audio to text using a machine learning API.
Python Speech-To-Text projects using AssemblyAI API
RealTimeTranscriber is an application that leverages the AssemblyAI platform to perform real-time transcription of audio input.
A webapp project for lablab.ai hackathon
VirtuAI Helper is a Python AI program that executes scripts based on user input, converses using OpenAI’s GPT-3, controls multimedia, navigates websites, and accepts text/voice inputs. It integrates VoiceVox Engine, a Japanese text-to-speech software with over 60 text-to-speech models you can choose.