mbalesni

followers

following

stars

London, UK

https://www.mikitabalesni.com

Mikita Balesni's repositories

openpilot-pipeline

Training pipeline for end-to-end self-driving with Comma AI's Openpilot. WIP

Language:Jupyter Notebook98 8 17

deepspeed_llama

Finetuning LLaMA with DeepSpeed

Language:Python1000

self-attention-rl

Re-implementation of an RL + Transformer paper: https://arxiv.org/abs/1907.08027

Language:Jupyter Notebook2 20

gpt-honest-articulation

Exploring GPT-3 ability to articulate its knowledge

Language:Jupyter Notebook1 20

react-election-registration

A simple election check-in app for use by students in university elections.

Language:JavaScript1 10

mbalesni.github.io

Language:HTMLMIT000

tgnews

My submission to Telegram Data Clustering contest (ranked 5th/122, team of 2)

Language:Python010

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonApache-2.0000

ai-safety-paper-notes

Summaries, notes and questions on AI safety research papers.

000

anthropic-hack-23

Language:Python000

ARENA_2.0

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

Language:HTML000

BIG-Bench-Hard

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

MIT000

censored-cognition

Language:Python000

DeepTraffic

Deep Learning models for network traffic classification

MPL-2.0000

diia-integration

000

ebm-driving

Language:Jupyter NotebookApache-2.0000

g-in-llms

Language:Python000

grok

Language:PythonMIT000

ibc

(Fork of) Official implementation of Implicit Behavioral Cloning, as described in our CoRL 2021 paper, see more at https://implicitbc.github.io/

Language:PythonApache-2.0000

iphone-checker

Language:Python000

llm-security-challenge

Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the OverTheWire wargames environment, showing the models' surprising ability to do action-oriented cyberexploits in shell environments

Apache-2.0000

mats-3-aligning-lms

A common repo of the MATS 3.0 stream on Aligning Language Models

Language:Jupyter Notebook000

onnx2pytorch

Transform ONNX model to PyTorch representation

Language:PythonApache-2.0000

posters

000

presentations

000

setup-python

Set up your GitHub Actions workflow with a specific version of Python [ALWAYS CACHE]

Language:TypeScriptMIT000

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Apache-2.0000

TCP

[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.

Apache-2.0000

vote-verification

Web app with a custom anonymous & secure voting verification protocol.

Language:JavaScript000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

MIT000