Taaccoo's repositories
Depression-Detection-Through-Multi-Modal-Data
Conventionally depression detection was done through extensive clinical interviews, wherein the subject’s re- sponses are studied by the psychologist to determine his/her mental state. In our model, we try to imbibe this approach by fusing the 3 modalities i.e. word context, audio, and video and predict an output regarding the mental health of the patient. The output is divided into a binary yes/no denoting whether the patient has symptoms of depression. We’ve built a deep learning model that fuses these 3 modalities, assigning them appropriate weights, and thus gives an output.
a-PyTorch-Tutorial-to-Object-Detection
SSD: Single Shot MultiBox Detector | a PyTorch Tutorial to Object Detection
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
awesome
😎 Awesome lists about all kinds of interesting topics
Awesome-Incremental-Learning
Awesome Incremental Learning
awesome-nlp
:book: A curated list of resources dedicated to Natural Language Processing (NLP)
Awesome-Visual-Captioning
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
basic_vqa
Pytorch VQA : Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf)
bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Cnblogs-Theme-SimpleMemory
🍭 Cnblogs theme _ Basic theme : SimpleMemory
course-nlp
A Code-First Introduction to NLP course
darknet
Convolutional Neural Networks
detectron2
Detectron2 is FAIR's next-generation platform for object detection and segmentation.
easy-VQA
The Easy Visual Question Answering dataset.
EfficientDet.Pytorch
Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch
fairseq-image-captioning
Transformer-based image captioning extension of pytorch/fairseq
git-tips
:trollface:Git的奇技淫巧
grid-feats-vqa
Grid features pre-training code for visual question answering
keras-ncp
Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence
octotree
GitHub on steroids
openvqa
A lightweight, scalable, and general framework for visual question answering (VQA) research
Oscar
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
pythia
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
UNITER
Research code for "UNITER: Learning UNiversal Image-TExt Representations"