Stefan Schweter (stefan-it)

stefan-it

Geek Repo

Location:Near Munich, Germany

Home Page:https://schweter.ml

Github PK Tool:Github PK Tool


Organizations
flairNLP
GermanT5
Hugging-Face-Helping-Hand
Hugging-Face-Supporter
LEL-A

Stefan Schweter's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:48992Issues:553Issues:197

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:8538Issues:73Issues:85

KeyBERT

Minimal keyword extraction with BERT

Language:PythonLicense:MITStargazers:3277Issues:32Issues:191

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1197Issues:29Issues:87

uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Language:PythonLicense:Apache-2.0Stargazers:933Issues:13Issues:24

llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Language:PythonLicense:MITStargazers:747Issues:17Issues:50

recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Language:PythonLicense:Apache-2.0Stargazers:544Issues:16Issues:5

community-content

Hetzner Online Community Project

Language:MarkdownLicense:MITStargazers:266Issues:13Issues:207

LEAR

The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".

zett

Code for Zero-Shot Tokenizer Transfer

scaling

Language models scale reliably with over-training and on downstream tasks

Language:Jupyter NotebookLicense:MITStargazers:83Issues:8Issues:3

improved-t5

Experiments for efforts to train a new and improved t5

ScandEval

Evaluation of language models on mono- or multilingual tasks.

Language:PythonLicense:MITStargazers:66Issues:5Issues:300

spacebyte

A byte-level decoder architecture that matches the performance of tokenized Transformers.

Language:Jupyter NotebookStargazers:38Issues:1Issues:0
Stargazers:20Issues:0Issues:0

transformer-smaller-training-vocab

Temporary remove unused tokens during training to save ram and speed.

Language:PythonLicense:MITStargazers:20Issues:3Issues:2
Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0

fundus-evaluation

Evaluation of the Fundus News Scraper https://github.com/flairNLP/fundus

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

BEAR

BEAR dataset

Stargazers:6Issues:0Issues:0

eacl24-german-legal-questions

Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24

Language:PythonLicense:Apache-2.0Stargazers:6Issues:4Issues:0

tech-report

Raw data, scripts, etc. to produce the tables and figures of our technical report

License:Apache-2.0Stargazers:5Issues:0Issues:0

ChroniclingAmericaQA

ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages

License:MITStargazers:4Issues:0Issues:0

Multi-Level-Training-Framework

Official implementation of "A Multi-level Framework for Accelerating Training Transformer Models""

Language:PythonStargazers:4Issues:0Issues:0

XAMPLER

XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples

Language:PythonStargazers:3Issues:0Issues:0

umLabeller

Inspection tool for characterizing the semantic compositionality of subword tokenization in English

Language:PythonStargazers:3Issues:8Issues:0
License:NOASSERTIONStargazers:3Issues:0Issues:0

newsagency-classification

Recognition of news agency mentions in historical news articles (BERT-based token classification).

Language:Jupyter NotebookLicense:MITStargazers:1Issues:6Issues:2

maibaam-code

Code for preprocessing data for UD annotations and for tagging/parsing experiments of MaiBaam

Language:PythonStargazers:1Issues:0Issues:0

turkish-lm-bias

Investigating Gender Bias in Turkish Language Models

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0