Odun (ToluClassics)

ToluClassics

Geek Repo

Company:University of Waterloo, Waterloo

Location:AOE

Github PK Tool:Github PK Tool


Organizations
castorini
project-miracl

Odun's repositories

candle-tutorial

Tutorial for Porting PyTorch Transformer Models to Candle (Rust)

mlx-transformers

MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers and provides a way to load and run models in Apple Silicon devices.

Language:PythonLicense:Apache-2.0Stargazers:39Issues:6Issues:4

search-related-articles

This repository is dedicated to collecting blog posts and articles on the implementation of state-of-the-art techniques in semantic search, dense retrieval, and retrieval augmented generation (RAG).

LowResourceOCR

This work is an adaptation of CNN+Transformer architecture to training text recognition models for Yorùbá & Igbo Languages

rustserini

A port of Pyserini and Anserini in Rust

datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0

Low-Resource-NLP-Tutorials

Getting started in NLP for low resource languages

Language:Jupyter NotebookStargazers:1Issues:3Issues:2

pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0

afriberta

AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

anserini

Anserini is a Lucene toolkit for reproducible information retrieval research

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

candle

Minimalist ML framework for Rust

Language:RustLicense:Apache-2.0Stargazers:0Issues:1Issues:0

candle-model-archive

A model archiving package for loading and inference candle models similar to torch-serve

License:Apache-2.0Stargazers:0Issues:2Issues:0
Language:RustStargazers:0Issues:2Issues:0

checklist

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

covidex

A multi-stage neural search engine for the COVID-19 Open Research Dataset

Language:TypeScriptLicense:MITStargazers:0Issues:2Issues:0

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:JavaScriptStargazers:0Issues:2Issues:0

j4rs

Java for Rust

Language:RustLicense:Apache-2.0Stargazers:0Issues:1Issues:0

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pygaggle

a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

roots-search-tool

Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

rust-tokenizers

Rust-tokenizer offers high-performance tokenizers for modern language models, including WordPiece, Byte-Pair Encoding (BPE) and Unigram (SentencePiece) models

Language:RustLicense:Apache-2.0Stargazers:0Issues:1Issues:0

scikit-llm

Seamlessly integrate powerful language models like ChatGPT into scikit-learn for enhanced text analysis tasks.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

text-embeddings-inference

A blazing fast inference solution for text embeddings models

License:NOASSERTIONStargazers:0Issues:0Issues:0

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:CSSStargazers:0Issues:2Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:JavaScriptStargazers:0Issues:4Issues:0