Yohan's repositories
sharechat-scraper
This repository contains code for scraping publicly available data from targeted content tags on the Indian social network https://sharechat.com/
awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
help_project
Central repository for H.E.L.P. project. Project details at https://quant-quest.com/landingPage/helpproject
testpatrika
testpatrika
text-generation-inference
Large Language Model Text Generation Inference
cookbook
Open-source AI cookbook
deep_learning_curriculum
Language model alignment-focused deep learning curriculum
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
IndicLLMSuite
A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages
LaVague
Text2Action AI to automate browser interaction
LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
metaseq
Repo for external large-scale work
NeMo
NeMo: a framework for generative AI
NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
plurality
Root repository for ⿻數位 Plurality: The Future of Collaborative Technology and Democracy by E. Glen Weyl, Audrey Tang and the Plurality Community
PyRIT
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.
task-standard
METR Task Standard
tesseract
Tesseract Open Source OCR Engine (main repository)
Text-Steganography-Benchmark
Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.