Mario Cho's repositories
deeplearning-note
Deep Learning Note: tutorial, documentation, code links, etc...
DatabaseConnector
An R package for connecting to databases using JDBC.
datasets-hu
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
FinanceDataReader
Financial data reader
google-research
Google AI Research
metavoice-src
Foundational model for human-like, expressive TTS
mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
NeMo-Megatron-Launcher
NeMo Megatron launcher and tools
official-images
Primary source of truth for the Docker "Official Images" program
pgvector
Open-source vector similarity search for Postgres
pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
Qwen1.5
Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
ROCm-docker
Dockerfiles for the various software layers defined in the Radeon Open Compute Platform
slurm-k8s-cluster
A Slurm cluster for Kubernetes
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
torch-xla-SPMD
Pytorch/XLA SPMD Test code in Google TPU