Artur Kuziakhmetov (ikuzart)

ikuzart

Geek Repo

Company:AstraZeneca

Location:Cambridge, United Kingdom

Github PK Tool:Github PK Tool

Artur Kuziakhmetov's starred repositories

awesome-selfhosted

A list of Free Software network services and web applications which can be hosted on your own servers

computer-science

🎓 Path to a free self-taught education in Computer Science!

Projects

:page_with_curl: A list of practical projects that anyone can solve in any programming language.

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:39201Issues:2025Issues:0

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonLicense:Apache-2.0Stargazers:36032Issues:758Issues:9507

data-science

📊 Path to a free self-taught education in Data Science!

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonLicense:NOASSERTIONStargazers:13787Issues:201Issues:2313

ThinkStats2

Text and supporting code for Think Stats, 2nd Edition

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:4018Issues:254Issues:95

returns

Make your functions return something meaningful, typed, and safe!

Language:PythonLicense:BSD-2-ClauseStargazers:3454Issues:45Issues:415

trurl

trurl is a command line tool for URL parsing and manipulation.

Language:CLicense:NOASSERTIONStargazers:3115Issues:24Issues:86

ytsaurus

YTsaurus is a scalable and fault-tolerant open-source big data platform.

Language:C++License:Apache-2.0Stargazers:1814Issues:38Issues:357

heartrate

Simple real time visualisation of the execution of a Python program.

Language:PythonLicense:MITStargazers:1751Issues:31Issues:11

quilt

Quilt is a data mesh for connecting people with actionable data

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1322Issues:19Issues:119

cc-pyspark

Process Common Crawl data with Python and Spark

Language:PythonLicense:MITStargazers:401Issues:23Issues:25