theblackcat102

followers

following

stars

iKala

Taiwan, Taipei

https://theblackcat102.github.io

Organizations

basiclab

NCTU-Stunion

theblackcat102's starred repositories

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.017372 156 1346

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonMIT8314 69 192

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonApache-2.03093 50 100

CTranslate2

Fast inference engine for Transformer models

Language:C++MIT2953 56 646

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.01947 34 75

Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Language:PythonApache-2.01801 21 102

resource-stream

CUDA related news and material links

awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

CMMLU

CMMLU: Measuring massive multitask language understanding in Chinese

Language:Python605 12 31

cleora

Cleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.

Language:Jupyter NotebookNOASSERTION478 35 23

adept-inference

Inference code for Persimmon-8B

Language:PythonApache-2.0410 16 7

self-correction-llm-papers

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

Apache-2.0332 11 1

CLIP-SAM

Experiment on combining CLIP with SAM to do open-vocabulary image segmentation.

Language:Jupyter Notebook317 6 5

qmoe

Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

Language:PythonApache-2.0253 6 4

neurips_llm_efficiency_challenge

NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day

Language:Python240 16 16

ModuleFormer

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

Language:PythonApache-2.0216 11 5

ToolkenGPT

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)

Language:Python210 4 21

TransformerPrograms

[NeurIPS 2023] Learning Transformer Programs

Language:Python152 3 4

ml-calibration

relplot: Utilities for measuring calibration and plotting reliability diagrams

Language:Jupyter NotebookNOASSERTION78 100

SemDeDup

Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).

Language:PythonNOASSERTION74 3 7

barrel-rec-pytorch

Language:Python53 3 2

Bingo

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookMIT4700

ievals

Official github repo for TMMLU+, Large scale traditional chinese massive multitask language understanding

Language:PythonMIT3700

hades

Fast singularity detection with kernel

Language:Jupyter NotebookBSD-3-Clause31 10

Beyond-Neural-Scaling

Implementation of Beyond Neural Scaling beating power laws for deep models and prototype-based models

Language:PythonMIT29 1 1

arc-agents

Experiments with LLMs on the Abstraction and Reasoning Corpus (ARC)

Language:PythonApache-2.04 20

SFABD

Semantic Fusion Augmentation and Semantic Boundary Detection: A Novel Approach to Multi-Target Video Moment Retrieval (SFABD)

Language:Python400

pscan_kernel

Language:Python3 20

json-schema-corpus

Corpus of over 80thousand JSON Schema documents, collected from open source GitHub repositories.

Apache-2.0300