Kanishka (kanishkamisra)

kanishkamisra

Geek Repo

Location:UT Austin

Home Page:https://kanishka.website

Twitter:@kanishkamisra

Github PK Tool:Github PK Tool

Kanishka's starred repositories

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:7912Issues:79Issues:25
Language:PythonLicense:Apache-2.0Stargazers:2483Issues:39Issues:131

awesome-agi-cocosci

An awesome & curated list for Artificial General Intelligence, an emerging inter-discipline field that combines artificial intelligence and computational cognitive sciences.

Language:TeXLicense:CC0-1.0Stargazers:247Issues:13Issues:0

archive-CCM-site

NYU PSYCH-GA 3405.002 / DS-GS 3001.006 : Computational cognitive modeling

Language:Jupyter NotebookStargazers:202Issues:26Issues:1

lm-debugger

The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.

Language:PythonLicense:Apache-2.0Stargazers:151Issues:9Issues:6

REMEDI

Inspecting and Editing Knowledge Representations in Language Models

Language:PythonLicense:MITStargazers:97Issues:2Issues:2

MLC-ML

Applying Behaviorally-Informed Meta-Learning (BIML) to machine learning benchmarks

Language:PythonLicense:MITStargazers:46Issues:5Issues:0

GSM-IC

Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant sentences in problem descriptions. GSM-IC is constructed to evaluate the distractibility of language models.

verbphysics

Maxwell Forbes & Yejin Choi — ACL 2017

Language:PythonLicense:MITStargazers:26Issues:3Issues:0
Language:Jupyter NotebookStargazers:20Issues:2Issues:0

NLPResearchScaffolding

Scaffold for NLP researcher to quickly set up the codebase

Language:PythonLicense:MITStargazers:14Issues:0Issues:0

semantic_features_gpt_3

Code and data from semantic feature generation with GPT-3

Language:Jupyter NotebookStargazers:13Issues:1Issues:0

ryanize-bib

Ryanize .bib file

AOCHILDES

Python API for loading language data from American-English CHILDES database

Language:ShellLicense:MITStargazers:10Issues:0Issues:0

better-mlm-scoring

[Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring

Language:Jupyter NotebookStargazers:9Issues:1Issues:0

comps

Conceptual Minimal Pairs

Language:RLicense:NOASSERTIONStargazers:8Issues:3Issues:0

QAQA

Repository for the paper (QA)^2: Question Answering with Questionable Assumptions

License:Apache-2.0Stargazers:8Issues:2Issues:0

raven

RAting VErbal Novelty

Language:ShellLicense:MITStargazers:6Issues:2Issues:0

semantic-projection

Python implementation of semantic projection from Grand et al. (2022)

Language:Jupyter NotebookStargazers:5Issues:0Issues:0

features_in_context

Predict psycholoinguistic feature norms for words in context.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4Issues:5Issues:1

multilingual-lm-intervention

Multilingual causal mediation analysis

Language:Jupyter NotebookLicense:MITStargazers:2Issues:0Issues:0

PyTorchNN

Walkthrough of building simple neural networks with PyTorch

Language:Jupyter NotebookStargazers:2Issues:1Issues:0

HW2-ngrams

ngram language modeling and naive bayes classification

Language:PythonStargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0

PublicModelsAPI

Evaluating Neural Language Models for Linguistic Knowledge

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

raven-data

Data from the RAVEN project

Language:PythonLicense:MITStargazers:1Issues:0Issues:0