Márton Miháltz (mmihaltz)

mmihaltz

Geek Repo

Company:Meltwater AB, Stockholm

Location:Stockholm, Sweden

Home Page:https://sites.google.com/site/mmihaltz/

Github PK Tool:Github PK Tool

Márton Miháltz's starred repositories

pandarallel

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Language:PythonLicense:BSD-3-ClauseStargazers:3599Issues:0Issues:0

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:3083Issues:0Issues:0

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonLicense:MITStargazers:51471Issues:0Issues:0

llm-numbers

Numbers every LLM developer should know

Stargazers:4002Issues:0Issues:0

spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

Language:PythonLicense:MITStargazers:1030Issues:0Issues:0

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language:PythonLicense:MITStargazers:6948Issues:0Issues:0

gpt4all

GPT4All: Chat with Local LLMs on Any Device

Language:C++License:MITStargazers:67725Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29190Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:25936Issues:0Issues:0

seqio

Task-based datasets, preprocessing, and evaluation for sequence models.

Language:PythonLicense:Apache-2.0Stargazers:546Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:319Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2593Issues:0Issues:0

greptimedb

An Open-Source, Cloud-Native, Unified Time Series Database for Metrics, Logs and Events with SQL/PromQL supported. Available on GreptimeCloud.

Language:RustLicense:Apache-2.0Stargazers:4024Issues:0Issues:0

pedalboard

🎛 🔊 A Python library for audio.

Language:C++License:GPL-3.0Stargazers:5007Issues:0Issues:0

compress-fasttext

Tools for shrinking fastText models (in gensim format)

Language:Jupyter NotebookLicense:MITStargazers:165Issues:0Issues:0

Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Language:PythonStargazers:2995Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33492Issues:0Issues:0

model-analysis

Model analysis tools for TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:1249Issues:0Issues:0

awesome-hungarian-nlp

A curated list of NLP resources for Hungarian

Stargazers:217Issues:0Issues:0

faster-than-requests

Faster requests on Python 3

Language:NimLicense:MITStargazers:1093Issues:0Issues:0

sagemaker-training-toolkit

Train machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.

Language:PythonLicense:Apache-2.0Stargazers:478Issues:0Issues:0

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:29276Issues:0Issues:0

BentoML

The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!

Language:PythonLicense:Apache-2.0Stargazers:6828Issues:0Issues:0

eli5

A library for debugging/inspecting machine learning classifiers and explaining their predictions

Language:Jupyter NotebookLicense:MITStargazers:2745Issues:0Issues:0

lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

Language:TypeScriptLicense:Apache-2.0Stargazers:3436Issues:0Issues:0

NYTK-NerKor

The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.

Language:ShellLicense:CC-BY-SA-4.0Stargazers:14Issues:0Issues:0

dl-translate

Library for translating between 200 languages. Built on 🤗 transformers.

Language:PythonLicense:MITStargazers:423Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:17605Issues:0Issues:0

PreSumm

code for EMNLP 2019 paper Text Summarization with Pretrained Encoders

Language:PythonLicense:MITStargazers:1277Issues:0Issues:0

electra

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Language:PythonLicense:Apache-2.0Stargazers:2315Issues:0Issues:0