Mainak Ghosh (ghoshmainak)

ghoshmainak

Geek Repo

Company:Max Planck Institute for Innovation and Competition

Home Page:https://ghoshmainak.github.io/

Twitter:@mainak7194

Github PK Tool:Github PK Tool

Mainak Ghosh's starred repositories

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:10158Issues:127Issues:748

covid-19-data

Data on COVID-19 (coronavirus) cases, deaths, hospitalizations, tests • All countries • Updated daily by Our World in Data

applied-methods-phd

Repo for Yale Applied Empirical Methods PHD Course

lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Language:PythonLicense:Apache-2.0Stargazers:1122Issues:11Issues:82

getting-started-with-the-twitter-api-v2-for-academic-research

A course on getting started with the Twitter API v2 for academic research

Language:PythonLicense:Apache-2.0Stargazers:572Issues:33Issues:5

pysonDB

A Simple , ☁️ Lightweight , 💪 Efficent JSON based database for 🐍 Python. PysonDB-V2 has been released ⬇️

Language:PythonLicense:MITStargazers:397Issues:9Issues:47

gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

Language:PythonLicense:Apache-2.0Stargazers:321Issues:6Issues:32

StataTraining

Stata & R code for course on the fundamentals of data analysis and visualization

Language:HTMLLicense:NOASSERTIONStargazers:192Issues:21Issues:2

The-Stata-Guide

Files for the Stata Guide on Medium https://medium.com/the-stata-guide

applied-microeconometrics

Course website for Ph.D. Applied Microeconometrics at the KDI School.

Language:HTMLStargazers:117Issues:4Issues:0

patentcity

Innovation across ages

Language:PythonLicense:MITStargazers:66Issues:4Issues:11
Language:Jupyter NotebookLicense:GPL-3.0Stargazers:56Issues:6Issues:4

openeditors

Webscraping data about editors of scientific journals.

Language:RLicense:CC0-1.0Stargazers:54Issues:4Issues:8

Stata4Econ

Reusable Stata Code from Various Projects

Language:StataLicense:MITStargazers:39Issues:3Issues:1
Language:Jupyter NotebookLicense:MITStargazers:32Issues:3Issues:14
Language:Jupyter NotebookLicense:MITStargazers:31Issues:2Issues:0
Language:PythonLicense:GPL-3.0Stargazers:25Issues:0Issues:0

patent_similarity_data

US utility patent similarity data creation and analysis tools

Language:Jupyter NotebookLicense:MITStargazers:25Issues:0Issues:0

academic-publishers

A list of academic publishers and their scholarly journals.

Language:RLicense:CC0-1.0Stargazers:12Issues:2Issues:1

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

datasci-singularity

A data science environment in a singularity container

License:MITStargazers:1Issues:1Issues:0