Iacopo Ghinassi's repositories
DeepTiling
A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive summarization and semantic search applications built on top of it.
MultiModalSA
MultiModal Sentiment Analysis architectures for CMU-MOSEI.
VQ-VAE_Topic
An implementation of the paper [Vector-Quantization-Based Topic Modeling](https://dl.acm.org/doi/10.1145/3450946), providing a series of VQ-VAE models for topic modelling. The model reaches state-of-the-art performance on Ng20 and enables the extraction of dense topic vectors for downstream tasks.
Language-Modelling-with-RNNs
A simple series of programs to train gated recurrent neural networks with PyTorch and generate text based on them.
ARP_Score
Average Relative Proximity metrics and experiments used in the paper "When Cohesion Lies in the Embedding Space: New Framework and Methodologies for Embedding-Based Reference-Free Metrics for Topic Segmentation".
Audio-Topic-Segmentation
Repository for the paper "Exploring pre-trained Audio Neural Representations for Audio Topic Segmentation"
bad-boids
A deliberately badly programmed implementation of Boids for teaching
Coursera_Capstone
Repository for the data science specialisation by IBM on Coursera
demorepo
It's a demo
DigitRecogniser
A very, very basic digit recogniser and gaussian calculators functions with basic Python
FrequencyApp
Shiny app to discover and visualise the occurrences of words and/or word-sets (i.e. dictionaries) in given txt files (up to 5)
git-is-great
RSE git Module
git-is-great-1
RSE Git Module
latin-bert-ise-wsd
Using Latin BERT for large scale word sense disambiguation on ISE corpus
Latin-ISE-WSD
A large scale automatic analysis of selected lemmas sense change across centuries based on the Latin-ISE corpus and the original BERT-based word sense disambiguation system by Bamman et al. (2020)
MultimodalTopicSegmentation
Repository implementing multimodal topic segmentation in the embedding space as described in the paper Multimodal Topic Segmentation with pre-trained Neural Encoders.
NSE-TopicSegmentation
A repository including a variety of neural architectures for supervised topic segmentation
SemanticEgoNetwork
codes to perform exploratory semantic network analysis on one concept of interest
SemanticNetworkVizR
codes to perform semantic network analysis on multiple concepts (defined as multiple words-set, i.e. dictionaries) across multiple texts with R
TextGeneration
Qui c'è la ricetta di base per addestrare un nuovo modello neurale di generazione di testo partendo da testi arbitrari.