Ruifeng Zheng (zhengruifeng)

zhengruifeng

Geek Repo

Company:Databricks

Location:Beijing, China

Github PK Tool:Github PK Tool

Ruifeng Zheng's repositories

spark-libFM

An implement of Factorization Machines (LibFM)

Language:ScalaLicense:Apache-2.0Stargazers:248Issues:34Issues:16

SparkGBM

Spark-based GBM

Language:TerraLicense:Apache-2.0Stargazers:56Issues:7Issues:2

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:2Issues:2Issues:0

aexpy

AexPy /eɪkspaɪ/ is Api EXplorer in PYthon for detecting API breaking changes in Python packages. (ISSRE'22)

Language:PythonLicense:MPL-2.0Stargazers:0Issues:1Issues:0

arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

arrow-datafusion

Apache Arrow DataFusion and Ballista query engines

Language:RustLicense:Apache-2.0Stargazers:0Issues:1Issues:0

breeze

Breeze is a numerical processing library for Scala.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

dbt-databricks

A dbt adapter for Databricks.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

flink

Apache Flink

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

hive

Apache Hive

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

LightGBM

A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.

Language:C++License:MITStargazers:0Issues:1Issues:0

modin

Modin: Scale your Pandas workflows by changing a single line of code

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

numpy

The fundamental package for scientific computing with Python.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

py4j

Py4J enables Python programs to dynamically access arbitrary Java objects

Language:JavaLicense:NOASSERTIONStargazers:0Issues:1Issues:0

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

scikit-learn

scikit-learn: machine learning in Python

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

scipy

SciPy library main repository

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

spark-connect-go

Apache Spark Connect Client for Golang

Language:GoLicense:Apache-2.0Stargazers:0Issues:1Issues:0

spark-website

Apache Spark Website

License:Apache-2.0Stargazers:0Issues:1Issues:0

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0