Hayate Iso (isomap)

isomap

Geek Repo

Company:@megagonlabs

Location:Mountain View

Home Page:https://isomap.github.io/

Twitter:@iso_map

Github PK Tool:Github PK Tool


Organizations
aistairc

Hayate Iso's starred repositories

tech-interview-handbook

💯 Curated coding interview preparation materials for busy software engineers

Language:TypeScriptLicense:MITStargazers:117896Issues:2111Issues:102

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:36662Issues:372Issues:316

OpenHands

🙌 OpenHands: Code Less, Make More

Language:PythonLicense:MITStargazers:32902Issues:291Issues:1428

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:29125Issues:308Issues:94

faker

Faker is a Python package that generates fake data for you.

Language:PythonLicense:MITStargazers:17661Issues:222Issues:738

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:10238Issues:162Issues:739

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:8556Issues:46Issues:587

AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7773Issues:89Issues:100

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:5993Issues:71Issues:478

pdfminer.six

Community maintained fork of pdfminer - we fathom PDF

Language:PythonLicense:MITStargazers:5876Issues:117Issues:705

awesome-leetcode-resources

Awesome LeetCode resources to learn Data Structures and Algorithms and prepare for Coding Interviews.

llama-stack-apps

Agentic components of the Llama Stack APIs

Language:PythonLicense:MITStargazers:3713Issues:117Issues:50

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2575Issues:24Issues:27

dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Language:PythonLicense:NOASSERTIONStargazers:2498Issues:40Issues:23

mozc

Mozc - a Japanese Input Method Editor designed for multi-platform

Language:C++License:NOASSERTIONStargazers:2403Issues:92Issues:896

LLM-eval-survey

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

WebLaTex

A complete alternative for Overleaf with VSCode + Web + Git Integration + Copilot + Grammar & Spell Checker + Live Collaboration Support. Based on GitHub Codespace and Dev container.

Language:TeXLicense:MITStargazers:994Issues:8Issues:13

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonLicense:Apache-2.0Stargazers:548Issues:16Issues:71

Overleaf-Workshop

Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.

Language:TypeScriptLicense:AGPL-3.0Stargazers:482Issues:3Issues:99

notebooks

Code examples and jupyter notebooks for the Cohere Platform

Language:Jupyter NotebookLicense:MITStargazers:472Issues:17Issues:13

Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

linktree

Simple site to group all my profiles on social networks in one place. A free Linktree alternative.

Language:CSSLicense:MITStargazers:423Issues:4Issues:5

lost-in-the-middle

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Language:PythonLicense:MITStargazers:304Issues:5Issues:14

multi-downloader-nx

Downloader for Crunchyroll, Hidive, AnimeOnegai, and AnimationDigitalNetwork with CLI and GUI

Language:TypeScriptLicense:MITStargazers:270Issues:11Issues:430

InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Language:PythonLicense:MITStargazers:263Issues:9Issues:20

jsonresume-fake

Fully generated fake resumes using machine learning models trained off ~6000 JSON resumes.

Language:PythonLicense:UnlicenseStargazers:215Issues:14Issues:6

xatu

🕊️ Code and Data for XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates (Zhang et al; LREC-COLING 2024)

Language:PythonLicense:BSD-3-ClauseStargazers:5Issues:4Issues:0
License:NOASSERTIONStargazers:4Issues:4Issues:0

ambignlg

:dog: Data for AmbigNLG: Addressing Task Ambiguity in Instruction for NLG (Ayana Niwa and Hayate Iso; EMNLP 2024)

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0