ahmedbesbes / media-agent

Scrape data from social media and chat with it using Langchain

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Media Agent

Media Agent scrapes Twitter and Reddit submissions, summarizes them, and chats with them in an interactive terminal.


Demo

https://www.loom.com/share/f4954e7d34ef4b7b8491e2bf910e8521

Functionalities

  • Scrapes tweets/submissions on your behalf either from a list of user accounts or a list of keywords.
  • Embeds the tweets/submissions using OpenAI
  • Indexes the embeddings (i.e. vectors) in ChromaDB
  • Enriches the index with additional metadata
  • Creates a summary of the tweets/submissions and provides potential questions to answer
  • Opens a chat session on top of the tweets
  • Saves the conversation with its metadata
  • A rich terminal UI and logging features

Tools and libraries used

  • Langchain 🦜 to build and compose LLMs
  • ChromaDB to store vectors (a.k.a embeddings) and query them to build conversational bots
  • Tweepy to connect to your the Twitter API and extract Tweets and metadata
  • Praw to connect to Reddit API
  • Rich to build a cool terminal UX/UI
  • Poetry to manage dependencies

Third party services

  • OpenAI (πŸ”‘ needed)
  • Twitter (πŸ”‘ needed)
  • Reddit (πŸ”‘ needed)

Run the app locally

  • Install dependencies with poetry
poetry install --with dev
  • Add API credentials

Create .env file at the root of the project with the following keys:

OPENAI_API_KEY=<OPENAI KEY>
TWITTER_BEARER_TOKEN=<TWITTER BEARER TOKEN>
REDDIT_API_CLIENT_ID=<REDDIT_API_CLIENT_ID>
REDDIT_API_SECRET=<REDDIT_API_SECRET>
REDDIT_USER_AGENT=<REDDIT_USER_AGENT>

More info on these credentials here, here and here

  • Run the app
make run-media-agent

Star History

Star History Chart

Future features:

This is an on-going project, so feel free to contribute:

Here's what I plan in the near future:

  • Add more data sources: substack, press, LinkedIN
  • support open-source LLMs (Falcon?)
  • support Pinecone in addition to ChromaDB
  • add instructions to deployment on a cloud infrastructure
  • improve the prompts to have a more engaging/enriching conversations
  • add actions to open URLs and fetch content from it

About

Scrape data from social media and chat with it using Langchain


Languages

Language:Python 99.8%Language:Makefile 0.2%