Tomohiro Manabe's starred repositories

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:12911Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8985Issues:0Issues:0

MS-MARCO-Web-Search

A large-scale information-rich web dataset, featuring millions of real clicked query-document labels

License:MITStargazers:303Issues:0Issues:0

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonLicense:MITStargazers:1516Issues:0Issues:0

H2_ALSH

Accurate and Fast ALSH for Maximum Inner Product Search (KDD 2018)

Language:C++License:GPL-3.0Stargazers:24Issues:0Issues:0

sitq

Learning to Hash for Maximum Inner Product Search

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

bpr

Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering

Language:PythonLicense:NOASSERTIONStargazers:166Issues:0Issues:0

machine-learning-round-table

Gather around the table, and have a discussion to catch up the latest trend of machine learning ๐Ÿค–

Stargazers:304Issues:0Issues:0

esci-data

Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search

Language:PythonLicense:Apache-2.0Stargazers:236Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:138838Issues:0Issues:0

pecos

PECOS - Prediction for Enormous and Correlated Spaces

Language:PythonLicense:Apache-2.0Stargazers:506Issues:0Issues:0

building-search-app-w-ml

ใ€ŽๆฉŸๆขฐๅญฆ็ฟ’ใซใ‚ˆใ‚‹ๆคœ็ดขใƒฉใƒณใ‚ญใƒณใ‚ฐๆ”นๅ–„ใ‚ฌใ‚คใƒ‰ใ€ใฎใ‚ตใƒณใƒ—ใƒซใ‚ณใƒผใƒ‰ใฎใƒชใƒใ‚ธใƒˆใƒช

Language:PythonStargazers:16Issues:0Issues:0

sphinx

The Sphinx documentation generator

Language:PythonLicense:NOASSERTIONStargazers:6378Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:15Issues:0Issues:0
Language:JavaLicense:MITStargazers:9Issues:0Issues:0

ir100

ๆƒ…ๅ ฑๆคœ็ดข100ๆœฌใƒŽใƒƒใ‚ฏ

License:MITStargazers:89Issues:0Issues:0

solr

Apache Solr open-source search software

Language:JavaLicense:Apache-2.0Stargazers:1145Issues:0Issues:0

mac-precision-touchpad

Windows Precision Touchpad Driver Implementation for Apple MacBook / Magic Trackpad

Language:CLicense:NOASSERTIONStargazers:8877Issues:0Issues:0

digital_video_introduction

A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: ๐Ÿ‡บ๐Ÿ‡ธ ๐Ÿ‡จ๐Ÿ‡ณ ๐Ÿ‡ฏ๐Ÿ‡ต ๐Ÿ‡ฎ๐Ÿ‡น ๐Ÿ‡ฐ๐Ÿ‡ท ๐Ÿ‡ท๐Ÿ‡บ ๐Ÿ‡ง๐Ÿ‡ท ๐Ÿ‡ช๐Ÿ‡ธ

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:15367Issues:0Issues:0

rally

Macrobenchmarking framework for Elasticsearch

Language:PythonLicense:Apache-2.0Stargazers:1933Issues:0Issues:0

elasticsearch

Free and Open, Distributed, RESTful Search Engine

Language:JavaLicense:NOASSERTIONStargazers:69234Issues:0Issues:0

elasticsearch-learning-to-rank

Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch

Language:JavaLicense:Apache-2.0Stargazers:1474Issues:0Issues:0
Language:PythonLicense:MITStargazers:33Issues:0Issues:0

awesome-neural-models-for-semantic-match

A curated list of papers dedicated to neural text (semantic) matching.

Language:HTMLLicense:MITStargazers:774Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:4Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:4Issues:0Issues:0

computer-science-flash-cards

Mini website for testing both general CS knowledge and enforce coding practice and common algorithm/data structure memorization.

Language:HTMLLicense:CC-BY-SA-4.0Stargazers:8385Issues:0Issues:0

coding-interview-university

A complete computer science study plan to become a software engineer.

License:CC-BY-SA-4.0Stargazers:303385Issues:0Issues:0

PRML

PRML algorithms implemented in Python

Language:Jupyter NotebookLicense:MITStargazers:11371Issues:0Issues:0