Grzegorz Sajko's starred repositories

open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:453Issues:0Issues:0

InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Language:PythonLicense:Apache-2.0Stargazers:6079Issues:0Issues:0

tweety

Twitter Scraper

Language:PythonStargazers:450Issues:0Issues:0

Sensei

Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI

Language:PythonStargazers:218Issues:0Issues:0
Language:PythonLicense:MITStargazers:169Issues:0Issues:0

autofinetune

auto fine tune of models with synthetic data

Language:PythonLicense:MITStargazers:70Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:35858Issues:0Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:1033Issues:0Issues:0

instructor

structured outputs for llms

Language:PythonLicense:MITStargazers:7142Issues:0Issues:0

LUISE

LUI: Autonomous Collective Decision Making via Large Language Models

Language:PythonLicense:MITStargazers:104Issues:0Issues:0

lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

Language:RustLicense:Apache-2.0Stargazers:3700Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:342Issues:0Issues:0

data-engineering-zoomcamp

Free Data Engineering course!

Language:Jupyter NotebookStargazers:24161Issues:0Issues:0

spchengine

Scripts to create a basic search on podcast data in general

Language:PythonLicense:GPL-3.0Stargazers:10Issues:0Issues:0

you-dont-need-a-bigger-boat

An end-to-end implementation of intent prediction with Metaflow and other cool tools

Language:PythonLicense:MITStargazers:835Issues:0Issues:0

diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Language:PythonLicense:NOASSERTIONStargazers:1268Issues:0Issues:0

whisper-nextjs

Next.js app for serverless deployments of OpenAI Whisper on Banana.dev

Language:JavaScriptStargazers:92Issues:0Issues:0

rusty

AI-powered CLI tool to help you remember bash commands.

Language:RustLicense:MITStargazers:327Issues:0Issues:0

data-engineering

Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

Language:Jupyter NotebookStargazers:193Issues:0Issues:0

obsidian-tweet-to-markdown

An Obsidian.md plugin to save tweets as Markdown files.

Language:TypeScriptLicense:MITStargazers:190Issues:0Issues:0

de4ml

Supporting materials/code examples for my course in data engineering for machine learning.

Language:PythonStargazers:38Issues:0Issues:0

resume.github.com

Resumes generated using the GitHub informations

Language:JavaScriptStargazers:61771Issues:0Issues:0

LoRaSystemForSoils

An underground, wireless, open-source, low-cost system for monitoring oxygen, temperature, and soil moisture

Language:C++Stargazers:6Issues:0Issues:0

mlops-zoomcamp

Free MLOps course from DataTalks.Club

Language:Jupyter NotebookStargazers:10865Issues:0Issues:0

Furland

Building a real-time twitter graph of your friends

Language:C#Stargazers:268Issues:0Issues:0

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:Apache-2.0Stargazers:4860Issues:0Issues:0

EcosystemCreatorRepo

Repo for Ecosystem Creator project based on Synthetic Silviculture Paper

Language:C++Stargazers:4Issues:0Issues:0

math-as-code

a cheat-sheet for mathematical notation in code form

License:MITStargazers:15002Issues:0Issues:0

qgqa-flashcards

Question Generation - Question Answering for Automatic Flashcards

Language:JavaScriptStargazers:64Issues:0Issues:0

weywot

My notes on using Linux

Language:ShellStargazers:859Issues:0Issues:0