Spyros Mouselinos's repositories
MiniAnnotator
A quick GUI transcription annotator tool with categories and subcategories.
AgentSims
AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.
beginners-typescript-tutorial
An interactive TypeScript tutorial for beginners
DataMiningUW
Notes for the course
drawdata
Draw datasets from within Jupyter.
GeometryAgents
This repository contains the datasets and evaluation code for the work "Beyond Lines and Circles: Unveiling the Geometric Reasoning Gap in Large Language Models"
IEEE-BigData-2024-Cup
Code for IEEE BigData 2024 Cup: Predicting Chess Puzzle Difficulty
IsarStep
Code and dataset for the paper "IsarStep: a Benchmark for High-level Mathematical Reasoning"
mathematics_dataset
This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.
mindiffusion
Repository of lessons exploring image diffusion models, focused on understanding and education.
Model-Selection-Reasoning
Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)
monobox
An ad-free, lightweight music player that lets you play music from your local library on your mobile device.
obsidian2cosma
Convert Obsidian vaults to collection of notes readable by Cosma and Zettlr
picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
PIXIU
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project
๐๐ฟ๐ Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations ๐๐ฟ๐ Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. ๐โจ Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning๐๐
transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We significantly improve the systematic generalization of transformer models on a variety of datasets using simple tricks and careful considerations.
xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.