Satyakama Paul's repositories
Plant_Disease_Detection
Plant Disease Detector Web Application
ai-audio-datasets-list
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
amazon-scraper
A simple web scraper to extract Product Data and Pricing from Amazon
awesome-flutter
An awesome list that curates the best Flutter libraries, tools, tutorials, articles and more.
awesome-public-datasets
A topic-centric list of HQ open datasets. PR ☛☛☛
BrainEnTech---dataset-1---funny-vs-boring-videos
This EEG dataset is created with the intention of classifying funny vs boring videos over approx 30 minutes. A subject is shown 5 minutes of a boring video followed by another 5 minutes of a funny video. This process persisted over 3 cycles. Brain EEG is aggregated over each second and their corresponding video classes are noted.
detection-of-jewellery-accessories-with-Image-Captioning
Final degree project that provides useful tools to train and test different architectures based on Image Captioning for the classification of jewellery accessories.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
EVP
Code for paper 'Audio-Driven Emotional Video Portraits'.
ImageSimilarity
Image similarity using Autoencoder
indian-food-app-detect-biriyani-pulao-friedrice
Starter app for fastai v3 model deployment on Render
Intelligent-Workloads-at-the-Edge
Intelligent Workloads at the Edge, published by Packt
knowledge_graph_from_unstructured_text
Building knowledge graph from input data
labelImg
:metal: LabelImg is a graphical image annotation tool and label object bounding boxes in images
NAB
The Numenta Anomaly Benchmark
one_class_sound_anomaly_detection_dcase2020_task2
DCASE 2020 Task 2 - Unsupervised Detection of Anomalous Sounds for Machine Condition Monitoring
sound_clustering_visualization_verOne
This repo is a streamlit app that records audio in browser, and helps to find the silimlarity between 2 or 3 classes of sound.
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
stable-diffusion-notebooks
AI projects in python, mostly Jupyter notebooks.
streamlit-audio-recorder
record audio streamlit app