Márton Miháltz's starred repositories
pandarallel
A simple and efficient tool to parallelize Pandas operations on all available CPUs
CTranslate2
Fast inference engine for Transformer models
gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
llm-numbers
Numbers every LLM developer should know
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
greptimedb
An Open-Source, Cloud-Native, Unified Time Series Database for Metrics, Logs and Events with SQL/PromQL supported. Available on GreptimeCloud.
pedalboard
🎛 🔊 A Python library for audio.
compress-fasttext
Tools for shrinking fastText models (in gensim format)
Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
google-research
Google Research
model-analysis
Model analysis tools for TensorFlow
awesome-hungarian-nlp
A curated list of NLP resources for Hungarian
faster-than-requests
Faster requests on Python 3
sagemaker-training-toolkit
Train machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.
NYTK-NerKor
The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.
dl-translate
Library for translating between 200 languages. Built on 🤗 transformers.
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format