Xiangrui Meng (mengxr)

mengxr

Geek Repo

Company:Databricks

Github PK Tool:Github PK Tool

Xiangrui Meng's starred repositories

pyspark-ai

English SDK for Apache Spark

Language:PythonLicense:Apache-2.0Stargazers:818Issues:0Issues:0

llm-numbers

Numbers every LLM developer should know

Stargazers:3937Issues:0Issues:0

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

Stargazers:8871Issues:0Issues:0
Language:TypeScriptLicense:Apache-2.0Stargazers:45Issues:0Issues:0

sparkext

Spark DL Inferencing using external frameworks

Language:ShellLicense:Apache-2.0Stargazers:6Issues:0Issues:0

terraform-provider-databricks

Databricks Terraform Provider

Language:GoLicense:NOASSERTIONStargazers:415Issues:0Issues:0

ML-Engineering

Reference code base for ML Engineering, Manning Publications

Language:Jupyter NotebookStargazers:113Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:24Issues:0Issues:0

byteps

A high performance and generic framework for distributed DNN training

Language:PythonLicense:NOASSERTIONStargazers:3578Issues:0Issues:0

joblib-spark

Joblib Apache Spark Backend

Language:PythonLicense:Apache-2.0Stargazers:239Issues:0Issues:0

koalas

Koalas: pandas API on Apache Spark

Language:PythonLicense:Apache-2.0Stargazers:3319Issues:0Issues:0

IncrementalMoments.jl

Julia package to computes statistics on streams of data

Language:JuliaLicense:NOASSERTIONStargazers:3Issues:0Issues:0
Language:ScalaStargazers:6Issues:0Issues:0
Language:ScalaLicense:MITStargazers:2598Issues:0Issues:0

intellij-jsonnet

Intellij Jsonnet Plugin

Language:JavaLicense:Apache-2.0Stargazers:88Issues:0Issues:0

spark-salesforce

Spark data source for Salesforce

Language:ScalaLicense:Apache-2.0Stargazers:79Issues:0Issues:0

cloud-custodian

Rules engine for cloud security, cost optimization, and governance, DSL in yaml for policies to query, filter, and take actions on resources

Language:PythonLicense:Apache-2.0Stargazers:5261Issues:0Issues:0

databricks-cli

(Legacy) Command Line Interface for Databricks

Language:PythonLicense:NOASSERTIONStargazers:380Issues:0Issues:0

drunken-data-quality

Spark package for checking data quality

Language:ScalaLicense:Apache-2.0Stargazers:222Issues:0Issues:0

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonLicense:Apache-2.0Stargazers:35005Issues:0Issues:0

Rserve

Fast, flexible and powerful server providing access to R from many languages and systems

Language:CLicense:NOASSERTIONStargazers:278Issues:0Issues:0

quartz

Code for Quartz Scheduler

Language:JavaLicense:Apache-2.0Stargazers:6127Issues:0Issues:0

ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, Axolotl, etc.

Language:PythonLicense:Apache-2.0Stargazers:6144Issues:0Issues:0

SparkADMM

Generic Implementation of Consensus ADMM over Spark

Language:PythonLicense:Apache-2.0Stargazers:83Issues:0Issues:0

sparklyr

R interface for Apache Spark

Language:RLicense:Apache-2.0Stargazers:929Issues:0Issues:0

photon-ml

A scalable machine learning library on Apache Spark

Language:TerraLicense:NOASSERTIONStargazers:790Issues:0Issues:0

superset

Apache Superset is a Data Visualization and Data Exploration Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:59947Issues:0Issues:0

tensorframes

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:750Issues:0Issues:0
Language:ScalaLicense:Apache-2.0Stargazers:976Issues:0Issues:0

lazy-linalg

A package full of linear algebra operators for Apache Spark MLlib's linalg package

Language:ScalaLicense:Apache-2.0Stargazers:10Issues:0Issues:0