Joel Niklaus (JoelNiklaus)

JoelNiklaus

Geek Repo

Company:University of Bern, Stanford University

Location:Bern

Home Page:niklaus.ai

Twitter:@joelniklaus

Github PK Tool:Github PK Tool


Organizations
googlers

Joel Niklaus's starred repositories

public-apis

A collective list of free APIs

Language:PythonLicense:MITStargazers:299978Issues:4134Issues:604

jina

☁️ Build multimodal AI applications with cloud-native stack

Language:PythonLicense:Apache-2.0Stargazers:20555Issues:208Issues:1938

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18164Issues:118Issues:502

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:14363Issues:134Issues:2039

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language:PythonLicense:NOASSERTIONStargazers:12284Issues:221Issues:606

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11920Issues:103Issues:865

Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

Language:JavaScriptLicense:Apache-2.0Stargazers:5716Issues:82Issues:163

composer

Supercharge Your Model Training

Language:PythonLicense:Apache-2.0Stargazers:5065Issues:51Issues:532

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonLicense:Apache-2.0Stargazers:5015Issues:31Issues:50

nlpaug

Data augmentation for NLP

Language:Jupyter NotebookLicense:MITStargazers:4351Issues:41Issues:221

awesome-machine-learning-interpretability

A curated list of awesome responsible machine learning resources.

smart_open

Utils for streaming large files (S3, HDFS, gzip, bz2...)

Language:PythonLicense:MITStargazers:3117Issues:47Issues:391

adapter-transformers

Huggingface Transformers + Adapters = ❤️

Language:PythonLicense:Apache-2.0Stargazers:1976Issues:24Issues:340

spacy-transformers

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

Language:PythonLicense:MITStargazers:1328Issues:32Issues:0

contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

Language:PythonLicense:MITStargazers:1185Issues:17Issues:108

EasyNMT

Easy to use, state-of-the-art Neural Machine Translation for 100+ languages

Language:PythonLicense:Apache-2.0Stargazers:1103Issues:19Issues:90

long-range-arena

Long Range Arena for Benchmarking Efficient Transformers

Language:PythonLicense:Apache-2.0Stargazers:701Issues:24Issues:53

Legal-Text-Analytics

A list of selected resources, methods, and tools dedicated to Legal Text Analytics.

License:CC-BY-SA-4.0Stargazers:577Issues:48Issues:0

mistral

Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.

Language:PythonLicense:Apache-2.0Stargazers:548Issues:16Issues:96

electra_pytorch

Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)

deep-significance

Enabling easy statistical significance testing for deep neural networks.

Language:PythonLicense:GPL-3.0Stargazers:320Issues:8Issues:8

legalbench

An open science effort to benchmark legal reasoning in foundation models

lex-glue

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

ASTRA

Self-training with Weak Supervision (NAACL 2021)

Language:PythonLicense:MITStargazers:155Issues:11Issues:2

Legal-Entity-Recognition

A Dataset of German Legal Documents for Named Entity Recognition

eyecite

Find legal citations in any block of text

Language:PythonLicense:BSD-2-ClauseStargazers:113Issues:17Issues:70

oldp

Open Legal Data Platform

Language:PythonLicense:MITStargazers:94Issues:7Issues:39

awesome-legal-data

Collection of Datasets for Legal Text Processing

Diard

From document (PDF) or document images to analysis ready semi-structured data.

Language:PythonLicense:Apache-2.0Stargazers:20Issues:4Issues:4