linjiuning

linjiuning

Geek Repo

Github PK Tool:Github PK Tool

linjiuning's starred repositories

generative-recommenders

Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152, ICML'24).

Language:PythonLicense:Apache-2.0Stargazers:450Issues:0Issues:0

AutoPhrase

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

Language:C++License:Apache-2.0Stargazers:1167Issues:0Issues:0

plato

腾讯高性能分布式图计算框架Plato

Language:C++License:NOASSERTIONStargazers:1893Issues:0Issues:0

euler

A distributed graph deep learning framework.

Language:C++License:Apache-2.0Stargazers:2885Issues:0Issues:0

allennlp

An open-source NLP research library, built on PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:11705Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:127991Issues:0Issues:0

bert4keras

keras implement of transformers for humans

Language:PythonLicense:Apache-2.0Stargazers:5324Issues:0Issues:0

gensim

Topic Modelling for Humans

Language:PythonLicense:LGPL-2.1Stargazers:15393Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:31818Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:313Issues:0Issues:0

graph-learn

An Industrial Graph Neural Network Framework

Language:C++License:Apache-2.0Stargazers:1266Issues:0Issues:0

dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Language:PythonLicense:Apache-2.0Stargazers:13158Issues:0Issues:0

DeepIE

DeepIE: Deep Learning for Information Extraction

Language:PythonStargazers:1927Issues:0Issues:0

pyjava

This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache Arrow as the exchanging data format.

Language:PythonStargazers:46Issues:0Issues:0

sqlflow

Brings SQL and AI together.

Language:GoLicense:Apache-2.0Stargazers:5042Issues:0Issues:0

ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, Axolotl, etc.

Language:PythonLicense:Apache-2.0Stargazers:6175Issues:0Issues:0

mlflow

Open source platform for the machine learning lifecycle

Language:PythonLicense:Apache-2.0Stargazers:17786Issues:0Issues:0

tablesaw

Java dataframe and visualization library

Language:JavaLicense:Apache-2.0Stargazers:3478Issues:0Issues:0

joinery

Data frames for Java

Language:JavaLicense:GPL-3.0Stargazers:692Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:38767Issues:0Issues:0

smile

Statistical Machine Intelligence & Learning Engine

Language:JavaLicense:NOASSERTIONStargazers:5963Issues:0Issues:0

aerosolve

A machine learning package built for humans.

Language:ScalaLicense:Apache-2.0Stargazers:4794Issues:0Issues:0

CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Language:JavaLicense:GPL-3.0Stargazers:9547Issues:0Issues:0

Mallet

MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.

Language:JavaLicense:NOASSERTIONStargazers:969Issues:0Issues:0

mahout

Mirror of Apache Mahout

Language:HTMLLicense:Apache-2.0Stargazers:2128Issues:0Issues:0

h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6776Issues:0Issues:0

tidb

TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://www.pingcap.com/tidb-serverless/

Language:GoLicense:Apache-2.0Stargazers:36434Issues:0Issues:0

tikv

Distributed transactional key-value database, originally created to complement TiDB

Language:RustLicense:Apache-2.0Stargazers:14694Issues:0Issues:0

byzer-lang

Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.

Language:ScalaLicense:Apache-2.0Stargazers:1820Issues:0Issues:0

spark-notes

Deep Dive into Apache Spark 深入研读Spark源码

License:Apache-2.0Stargazers:260Issues:0Issues:0