Xiangrui Meng's starred repositories
pyspark-ai
English SDK for Apache Spark
llm-numbers
Numbers every LLM developer should know
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
terraform-provider-databricks
Databricks Terraform Provider
ML-Engineering
Reference code base for ML Engineering, Manning Publications
joblib-spark
Joblib Apache Spark Backend
IncrementalMoments.jl
Julia package to computes statistics on streams of data
StratifiedRandomSampling
Spark Exercise
intellij-jsonnet
Intellij Jsonnet Plugin
spark-salesforce
Spark data source for Salesforce
cloud-custodian
Rules engine for cloud security, cost optimization, and governance, DSL in yaml for policies to query, filter, and take actions on resources
databricks-cli
(Legacy) Command Line Interface for Databricks
drunken-data-quality
Spark package for checking data quality
ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, Axolotl, etc.
tensorframes
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
lazy-linalg
A package full of linear algebra operators for Apache Spark MLlib's linalg package