daniel-furman

The blockCV package creates spatially or environmentally separated training and testing folds for cross-validation to provide a robust error estimation in spatially structured environments. See

000

cellpose-training-kaggle-data21

Data for the Sartorius Cell Instance Segmentation comp with cellpose transforms.

Language:Python010

computational-mathematics

A collection of MATLAB scripts from undergraduate math classes. Focus on numerical methods and computational linear algebra.

Language:MATLABMIT010

dash-rq-demo

Long running tasks in Dash using RQ

MIT000

DS-case-prep

DS case interview questions alongside frame-worked answers. Questions sourced from various resources, primarily FAANGs.

Language:Python010

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Language:PythonMIT000

online-dating-field-experiment

Final project for info 241 @ UC Berkeley, Spring 22

Language:Jupyter Notebook010

Random-recipes

A variety of ML utils across different clouds and frameworks.

Language:Jupyter Notebook000

RandomDS

A quasi-random collection of DS code.

Language:Jupyter Notebook010

scattertext

Beautiful visualizations of how language differs among document types.

Apache-2.0000

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.0000

Stanford_Penn_MIDRC_Deidentifier

A deidentifier / deidentification pipeline developed by Stanford and Penn as part of the MIDRC organization.

Apache-2.0000

test

Measuring Massive Multitask Language Understanding | ICLR 2021

Language:PythonMIT000

transformers_llama

Code and models for BERT on STILTs

Language:PythonApache-2.0000

typio

000