Grzegorz Sajko's starred repositories

resume.github.com

Resumes generated using the GitHub informations

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:35855Issues:374Issues:65

data-engineering-zoomcamp

Free Data Engineering course!

Language:Jupyter NotebookStargazers:24160Issues:441Issues:124

math-as-code

a cheat-sheet for mathematical notation in code form

mlops-zoomcamp

Free MLOps course from DataTalks.Club

Language:Jupyter NotebookStargazers:10865Issues:181Issues:93

instructor

structured outputs for llms

Language:PythonLicense:MITStargazers:7139Issues:48Issues:261

Kats

Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.

Language:PythonLicense:MITStargazers:4863Issues:80Issues:185

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:Apache-2.0Stargazers:4859Issues:79Issues:74

lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

Language:RustLicense:Apache-2.0Stargazers:3700Issues:40Issues:892

KeyBERT

Minimal keyword extraction with BERT

Language:PythonLicense:MITStargazers:3385Issues:32Issues:200

diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Language:PythonLicense:NOASSERTIONStargazers:1268Issues:24Issues:20

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:1033Issues:41Issues:69

weywot

My notes on using Linux

Language:ShellStargazers:857Issues:10Issues:0

you-dont-need-a-bigger-boat

An end-to-end implementation of intent prediction with Metaflow and other cool tools

Language:PythonLicense:MITStargazers:835Issues:20Issues:4
Language:Jupyter NotebookLicense:MITStargazers:343Issues:3Issues:3

rusty

AI-powered CLI tool to help you remember bash commands.

Language:RustLicense:MITStargazers:327Issues:4Issues:3

Furland

Building a real-time twitter graph of your friends

Sensei

Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI

Language:PythonStargazers:218Issues:6Issues:0

data-engineering

Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

Language:Jupyter NotebookStargazers:193Issues:8Issues:2

obsidian-tweet-to-markdown

An Obsidian.md plugin to save tweets as Markdown files.

Language:TypeScriptLicense:MITStargazers:190Issues:7Issues:36

LUISE

LUI: Autonomous Collective Decision Making via Large Language Models

Language:PythonLicense:MITStargazers:104Issues:11Issues:0

whisper-nextjs

Next.js app for serverless deployments of OpenAI Whisper on Banana.dev

autofinetune

auto fine tune of models with synthetic data

Language:PythonLicense:MITStargazers:70Issues:3Issues:0

qgqa-flashcards

Question Generation - Question Answering for Automatic Flashcards

Language:JavaScriptStargazers:64Issues:3Issues:0

de4ml

Supporting materials/code examples for my course in data engineering for machine learning.

Language:PythonStargazers:38Issues:7Issues:0

queensland-ai-fastai-course-resources

Resources (including recap slides and notebooks) to support the Queensland AI & Queensland AI Hub community fast.ai course.

Language:Jupyter NotebookLicense:MITStargazers:33Issues:2Issues:0

spchengine

Scripts to create a basic search on podcast data in general

Language:PythonLicense:GPL-3.0Stargazers:10Issues:1Issues:1

LoRaSystemForSoils

An underground, wireless, open-source, low-cost system for monitoring oxygen, temperature, and soil moisture

Language:C++Stargazers:6Issues:2Issues:0

EcosystemCreatorRepo

Repo for Ecosystem Creator project based on Synthetic Silviculture Paper

Language:C++Stargazers:4Issues:1Issues:0