chengyineng38's starred repositories
llms_for_good
Aligning Large Language Models to business preferences on Databricks
WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
aisys-building-blocks
Building blocks for foundation models.
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
llm-numbers
Numbers every LLM developer should know
many-model-forecasting
Bootstrap your large scale forecasting solution on Databricks with Many Models Forecasting (MMF) Project.
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
Deep-Learning-in-Production
In this repository, I will share some useful notes and references about deploying deep learning-based models in production.
mlops-stacks
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
Content-AWS-Certified-Data-Analytics---Speciality
DAS-C01 ACG/LA by Brock Tubre and John Hanna
ide-best-practices
Best practices for working with Databricks from an IDE
deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
python-is-cool
Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
spark-nlp-workshop
Public runnable examples of using John Snow Labs' NLP for Apache Spark.