Sonu Gupta's repositories

tosdr-terms-of-service-corpus

This repository contains python code to create a corpus of 12,215 terms of service documents scraped from TOSDR, intended for legal, privacy, and natural language processing research.

Language:HTMLStargazers:4Issues:2Issues:0

Doxing-on-Twitter

This repository contains my work on the prevention and anonymization of dox content on Twitter. It contains python code and demo of the proposed solution.

Language:Jupyter NotebookLicense:MITStargazers:2Issues:1Issues:0

British-Airway-Virtual-Internship

This repository contains solutions to the 2 different tasks that must be performed during the data science virtual internship provided by British Airways via Forage.

Language:Jupyter NotebookStargazers:1Issues:2Issues:0

cracking-the-data-science-interview

A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

Synthetic-financial-data

This repository contains python code used to create synthetic data samples of minority class for a financial dataset. It also contains a sample of generated synthetic data.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

Time-Series-Analysis-and-Anomaly-Detection

This repository contains code to perform EDA, outlier detection and forcasting on a multivariate time series.

Language:Jupyter NotebookStargazers:1Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

GPI--corpus

A corpus of privacy laws, regulations, and guidelines used in our paper "Creation and Analysis of an International Corpus of Privacy Laws".

Stargazers:0Issues:1Issues:0

langchain-chat-with-txt-files

Learning and building LLM application using Langchain 🦜🔗 and Open AI

Language:PythonStargazers:0Issues:0Issues:0

MalReG

The repository contains scripts and the annotated dataset used in our paper "MalReG: Detecting and Analyzing Malicious Retweeter Groups" (accepted at CoDS-COMAD 2019).

Language:PythonStargazers:0Issues:1Issues:0

neosemantics

Graph+Semantics: Import/Export RDF from Neo4j. Model mapping, inferencing and more.... If you like it, please ★ ⇧

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PrivacyQA

Unofficial model implementations for the PrivacyQA benchmark (https://github.com/AbhilashaRavichander/PrivacyQA_EMNLP)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

sonu-gupta.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

ThinkStats2

Text and supporting code for Think Stats, 2nd Edition

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:1Issues:0

word_cloud

A little word cloud generator in Python

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Zomatopy

A Python wrapper for the Zomato API.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0