tanggen's repositories

intro_ds

Code to accompany Mastering Data Science from PT press

Language:PythonLicense:Apache-2.0Stargazers:326Issues:25Issues:4

regression2chatgpt

《解构大语言模型:从线性回归到通用人工智能》配套代码

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:139Issues:1Issues:3

spark_hbase

An example in Scala of reading data saved in hbase by Spark and an example of converter for python

Language:ShellLicense:Apache-2.0Stargazers:25Issues:7Issues:4

scikit-learn

scikit-learn: machine learning in Python

Language:PythonLicense:NOASSERTIONStargazers:1Issues:2Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:2Issues:0

spark-knowledgebase

Spark Knowledge Base

License:NOASSERTIONStargazers:1Issues:2Issues:0

aas

Code to accompany Advanced Analytics with Spark from O'Reilly Media

Language:ScalaLicense:NOASSERTIONStargazers:0Issues:2Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

camus

LinkedIn's Kafka to HDFS pipeline.

Language:JavaStargazers:0Issues:2Issues:0

crfsuite

CRFsuite: a fast implementation of Conditional Random Fields (CRFs)

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

dotvim

The configuration for vim

Language:Vim ScriptStargazers:0Issues:2Issues:0

elasticsearch

Open Source, Distributed, RESTful Search Engine

Language:JavaLicense:NOASSERTIONStargazers:0Issues:1Issues:0

flink

Mirror of Apache Flink

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

flink_vs_spark

Run local test of Apache Flink streaming and Apache Spark streaming

Language:ShellLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:JavaScriptStargazers:0Issues:2Issues:0

Ligase

Ligase is a Golang-based implementation of Matrix homeserver, powered by finogeeks https://www.finogeeks.com/Finchat

Language:GoLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

Ligase-tests

Tests for Ligase

License:MITStargazers:0Issues:1Issues:0

opensource

【编程随想】收藏的开源项目清单

License:CC0-1.0Stargazers:0Issues:2Issues:0

oryx

Oryx 2: Lambda architecture on Spark, Kafka for real-time large scale machine learning

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

p0f-mtu

p0f with patches to save MTU value and export it via API

Language:CStargazers:0Issues:2Issues:0

pipeline

Real-time, End-to-End, Advanced Analytics and Machine Learning Recommendation Pipeline

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:2Issues:0

scala-kafka

Quick up and running using Scala for Apache Kafka

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0

shc

The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

synapse

Synapse: Matrix reference homeserver

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

tidb

TiDB is an open source distributed HTAP database compatible with the MySQL protocol

Language:GoLicense:Apache-2.0Stargazers:0Issues:1Issues:0