Yihong Chen (yihong-chen)

yihong-chen

Geek Repo

Company:UCL NLP

Location:London,UK

Home Page:https://scholar.google.com/citations?user=TipbNkkAAAAJ

Twitter:@yihong_thu

Github PK Tool:Github PK Tool


Organizations
uclnlp

Yihong Chen's starred repositories

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55274Issues:517Issues:953

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49386Issues:562Issues:208

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35954Issues:368Issues:312

python-fire

Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

Language:PythonLicense:NOASSERTIONStargazers:26814Issues:370Issues:317

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25811Issues:212Issues:230

pandas-ai

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Language:PythonLicense:NOASSERTIONStargazers:12455Issues:103Issues:672

triton

Development repository for the Triton language and compiler

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11622Issues:169Issues:229

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10700Issues:140Issues:343

skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Language:PythonLicense:Apache-2.0Stargazers:6441Issues:71Issues:1685

pyinstrument

🚴 Call stack profiler for Python. Shows you why your code is slow!

Language:PythonLicense:BSD-3-ClauseStargazers:6410Issues:53Issues:156

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5939Issues:68Issues:269

camel

🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org

Language:PythonLicense:Apache-2.0Stargazers:5263Issues:63Issues:365

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5233Issues:39Issues:37

loro

Reimagine state management with CRDTs. Make your app collaborative effortlessly.

Language:RustLicense:MITStargazers:3653Issues:31Issues:96

markwhen

Make a cascading timeline from markdown-like text. Supports simple American/European date styles, ISO8601, images, links, locations, and more.

Language:HTMLLicense:MITStargazers:3479Issues:31Issues:143

trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Language:PythonLicense:Apache-2.0Stargazers:3375Issues:30Issues:356

pyarmor

A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.

Language:PythonLicense:NOASSERTIONStargazers:3223Issues:44Issues:1547

semantra

Multi-tool for semantic search

Language:PythonLicense:MITStargazers:2467Issues:34Issues:60

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++License:NOASSERTIONStargazers:1030Issues:26Issues:192

llama2.rs

A fast llama2 decoder in pure Rust.

Language:RustLicense:MITStargazers:1004Issues:11Issues:21

crdt-richtext

Rich text CRDT that implements Peritext and Fugue

Language:RustLicense:MITStargazers:272Issues:6Issues:0
Language:PythonLicense:MITStargazers:244Issues:8Issues:13

AStarNet

Official implementation of A* Networks

Language:PythonLicense:MITStargazers:130Issues:5Issues:5

nways_accelerated_programming

N-Ways to GPU Programming Bootcamp

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:57Issues:5Issues:13

gekcs

How to Turn Your Knowledge Graph Embeddings into Generative Models

Language:PythonLicense:GPL-3.0Stargazers:39Issues:2Issues:0

Neuron2Graph

Tools for exploring Transformer neuron behaviour, including input pruning and diversification.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:17Issues:2Issues:1
Language:PythonStargazers:6Issues:0Issues:0

AsEP-dataset

NeurIPS 2024 Dataset and Benchmark Submission "AsEP: Benchmarking Deep Learning Methods for Antibody-specific Epitope Prediction"

Language:Jupyter NotebookLicense:MITStargazers:4Issues:0Issues:0