Papyrus-Podcasts

Ad-free interactive video podcasting platform. (Open sourced)

MVP Features

Ad-free articles and podcasts to satisfy universal daily curiosity.
Micro-blogging of Books to voice book notes, abstract thoughts, mindful quotes with shorter audio formats called audio flips (140seconds Podcasts).
Social Collaboration amongst editors, writers and narrators.
Real-time Content sequence Editing with AI based feed recommendations for pushing quality content for our listeners
Interactive Video Podcast based Social network for sharing insights from books, documentaries, research papers, video lectures, articles.

Primary Tech Stack

Javascript based framework React for our web solutions
React Native for Android
Redux State management
Firebase Serverless Architecture
Firebase Cloud Functions for Serverless Compute.
Algolia Indexing for user search functionalities.
Hosted NoSQL solution Cloud Firestore Database.
Firebase Authentication (Security Rules)
Pytorch, Keras for training generative audio model
Agora Sdk Integration for Group Live Calls.

TIMELINE GENERATION ARCHITECTURE

What if we improve the performance with in memory cache?

Keep follow feed list for every active user in cache, when userX wants follow feed, fetch from cache rather than db, e.g. {userX: [py0, py1, py2 …]}, where py0 and py2 are posted by userY1, py1 is posted by userY2
When userY does a post, find all active followers of userY, append userY’s post to head of their follow feed in cache

Cache eviction: most inactive user’s follow feed will be removed if cache is full Cache writing: after db writes, so that db is always the source of truth

What if userZ is a celebrity and has 10 million active followers? Does it mean a single post of userZ should be copied 10 million times in cache? No, let’s keep those super stars in a separate db table & cache.

Celebrities are stored in a separate table from user table
Celebrity profile feed stored in a separate cache table from follow feed cache table, e.g. {userZ: [pz0, pz1, pz2, …]}, where pz0, pz1, pz2 are all posts of userZ.
Celebrity post are first written into db, and then append to the head of celebrity feed in cache, same as what we do to posts by normal users. When userX requests follow feed, find all celebrities that he/she follows, fetch those celebrity feeds from cache, merge into his/her existing follow feed.

1.create_post(user_id, image, text, timestamp) -> success/failure assume every post has an image, and optional text 2. comment_post(user_id, post_id, comment, timestamp) -> success/failure assume you can comment on a post, but not on another comment 3. like_post(user_id, post_id, timestamp) -> success/failure assume you can like a post, but not a comment

FURTHER SCALING

//MessageBrokerService //Data Partitioning //Round Robin Load Balancer //Click Stream Analyis //Google Analytics Dashboard //Demo Link //Data Model //System Api Specifications //Client and server side security measures //Capacity estimation //Payment integration (Stripe API)

punyaslokdutta / Papyrus-Podcasts

Papyrus-Podcasts

MVP Features

Primary Tech Stack

TIMELINE GENERATION ARCHITECTURE

FURTHER SCALING

About

Languages