Mikita Balesni's repositories
openpilot-pipeline
Training pipeline for end-to-end self-driving with Comma AI's Openpilot. WIP
deepspeed_llama
Finetuning LLaMA with DeepSpeed
self-attention-rl
Re-implementation of an RL + Transformer paper: https://arxiv.org/abs/1907.08027
gpt-honest-articulation
Exploring GPT-3 ability to articulate its knowledge
react-election-registration
A simple election check-in app for use by students in university elections.
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
ai-safety-paper-notes
Summaries, notes and questions on AI safety research papers.
ARENA_2.0
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
DeepTraffic
Deep Learning models for network traffic classification
ibc
(Fork of) Official implementation of Implicit Behavioral Cloning, as described in our CoRL 2021 paper, see more at https://implicitbc.github.io/
llm-security-challenge
Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the OverTheWire wargames environment, showing the models' surprising ability to do action-oriented cyberexploits in shell environments
mats-3-aligning-lms
A common repo of the MATS 3.0 stream on Aligning Language Models
onnx2pytorch
Transform ONNX model to PyTorch representation
setup-python
Set up your GitHub Actions workflow with a specific version of Python [ALWAYS CACHE]
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
TCP
[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
vote-verification
Web app with a custom anonymous & secure voting verification protocol.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision