XanderJC

followers

0

following

stars

Cambridge

Organizations

vanderschaarlab

Alex J. Chan 's repositories

scalable-birl

Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.

Language:PythonMIT41 2 9

medkit-learn

The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation (NeurIPS 2021) by Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, and Mihaela van der Schaar.

Language:PythonNOASSERTION27 3 1

attention-based-credit

Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt, and Mihaela van der Schaar

Language:PythonMIT18 4 3

transductive-dropout

Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift (ICML 2020) by Alex J. Chan, Ahmed M. Alaa, Zhaozhi Qian, and Mihaela van der Schaar.

Language:Jupyter NotebookMIT8 30

synthetic-model-combination

Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning (NeurIPS 2022) by Alex J. Chan and Mihaela van der Schaar.

Language:Jupyter NotebookMIT3 10

inverse-online

Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies (ICLR 2022) by Alex J. Chan, Alicia Curth, and Mihaela van der Schaar.

Language:PythonMIT2 30

XanderJC.github.io

Personal website

Language:HTML1 10

AML_bayes_opt

Supporting code for the Advanced Machine Learning module, MPhil Machine Learning and Machine Intelligence

Language:Jupyter Notebook010

data_collection

Language:PythonMIT010

llm-articulation

Language:Jupyter NotebookMIT010

MCMC-Project

Code for my project comparing theoretical bounds with practical convergence diagnostics in MCMC.

Language:Jupyter NotebookMIT000

my-cookiecutter

My cookiecutter template for ML projects

Language:Python010

deepspeed_llama

Finetuning LLaMA with DeepSpeed

Language:Python000

mphil-thesis

Supplementary code for my MPhil thesis.

Language:Jupyter NotebookMIT010

rnn-handwriting-generation

Handwriting generation by RNN with TensorFlow, based on "Generating Sequences With Recurrent Neural Networks" by Alex Graves

000

RowingManager

Language:Python000

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.0000

TruthfulQA

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Language:Jupyter NotebookApache-2.0000

XanderJC

Config files for my GitHub profile.

010