Beast code in Giters

Richard (Rick) Zamora's repositories

pynvml

Provide Python access to the NVML library for GPU diagnostics

Language:Python3 40

cudf

cuDF - GPU DataFrame Library

Language:CudaApache-2.01 20

dask

Parallel computing with task scheduling

Language:PythonBSD-3-Clause1 20

distributed

A distributed task scheduler for Dask

Language:PythonBSD-3-Clause1 20

A library that sits on top of RAPIDS cuDF library providing a range of benefits for processing extremely large tabular datasets, particularly those that do not fit in GPU or CPU memory. NVTabular has many capabilities including fast terabyte-scale data preparation and accelerated tabular data loading, all on GPU, which streamline the first step for both training and inference to any deep recommender system pipelines.

Language:PythonApache-2.01 10

arrow

Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.

Language:C++Apache-2.0020

coiled-benchmarks

Language:PythonBSD-3-Clause000

core

Core Utilities for NVIDIA Merlin

Language:PythonApache-2.0010

cugraph

cuGraph - RAPIDS Graph Analytics Library

Apache-2.0000

cuml

cuML - RAPIDS Machine Learning Library

Language:CudaApache-2.0010

cuxfilter

GPU accelerated cross filtering with cuDF.

Apache-2.0000

dask-blog

Dask development blog

Language:HTML010

dask-cuda

Utilities for Dask and CUDA interactions

Language:PythonApache-2.0010

dask-expr-rapids

Language:PythonBSD-3-Clause010

dask-match

Language:Python000

dask-sql

Distributed SQL Engine in Python using Dask

Language:PythonMIT010

design-docs

Experimental repo for proposals of future work

010

fastparquet

python implementation of the parquet columnar file format.

Language:PythonApache-2.0010

filesystem_spec

A specification that python filesystems should adhere to.

Language:PythonBSD-3-Clause010

Morpheus

Morpheus SDK

Language:PythonApache-2.0000

NeMo-Curator

Scalable toolkit for data curation

Apache-2.0000

pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Language:PythonBSD-3-Clause000

pynvml-feedstock

A conda-smithy repository for pynvml.

Language:ShellBSD-3-Clause010

rapids-dask-dependency

Apache-2.0000

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.0000

rjzamora.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:HTMLMIT020

s3fs

S3 Filesystem

Language:PythonBSD-3-Clause010

systems

Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature stores, nearest neighbor search, and exploration strategies) into end-to-end recommendation pipelines that can be served with Triton Inference Server.

Apache-2.0000

ucx-py

Python bindings for UCX

Language:Python020

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Apache-2.0000