Beast code in Giters

Ansh Radhakrishnan's repositories

trl_custom

Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.

Language:Jupyter NotebookApache-2.01300

Dalle-Mini-RL

Fine-tuning Dalle-Mini with RL to not produce NSFW images

Language:Python100

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonApache-2.0000

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

Language:Python000

Easy-Transformer-Speedy

Language:Jupyter NotebookMIT000

elk

Keeping language models honest by directly eliciting knowledge encoded in their activations. Building on "Discovering latent knowledge in language models without supervision" (Burns et al. 2022)

Language:PythonNOASSERTION000

equinox

Callable PyTrees and filtered transforms => neural networks in JAX. https://docs.kidger.site/equinox/

Language:PythonApache-2.0000

factored_cognition

Language:Python000

fancy_einsum

Einsum with einops style variable names

Language:PythonMIT000

flax_minimal_gpt

This is a minimal implementation of a GPT style transformer model in FLAX, mostly done for learning purposes.

Language:Python000

image_to_text_rlhf

Language:Python000

littlebookofsemaphores

Python answers to Puzzles in The Little Book of Semaphores

Language:Python000

minimal-opt

Language:Python000

minitorch

Language:Python000

ml-interviews-book-answers

Answers to https://huyenchip.com/ml-interviews-book/

000

mlab2_pre_exercises

Pre-course exercises for the August 2022 MLAB cohort.

Language:Python000

Module-0

Module 0 - Fundamentals

Language:Python000

Module-1

Module 1 - Autodifferentiation

Language:Python000

Module-2

Module 2 - Tensors

Language:Python000

Module-3

Module 3 - Efficiency

Language:Python000

numpy-100

100 numpy exercises (with solutions)

Language:PythonMIT000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

NOASSERTION000

Sorting-Transformer-Interp

A mechanistic interpretability project meant to analyze how a simple transformer learns to sort a sequence of 10 digits.

Language:Python000