Chen188 / bedrock-multimodal-agent

Multimodal Agent based on Bedrock, AST(Transcribe), TTS(Polly)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Multimodal Agent with LLM(Bedrock), AST(Transcribe), TTS(Polly)

Forked from Amazon Bedrock Voice Conversation and refine bugs, update according to Bedrock/Transcribe/Polly's new features.

How to run APP:

    python install -r ./requirements.txt
    export AWS_ACCESS_KEY_ID=<...>
    export AWS_SECRET_ACCESS_KEY=<...>
    export AWS_DEFAULT_REGION=<...> # Optional, defaults to us-east-1 
    export MODEL_ID=<...># Optional, defaults to Claude3 Haiku
    python ./app.py

About

Multimodal Agent based on Bedrock, AST(Transcribe), TTS(Polly)

License:MIT No Attribution


Languages

Language:Python 100.0%