Ken Tsui (kenhktsui)

kenhktsui

Geek Repo

Location:Hong Kong

Github PK Tool:Github PK Tool

Ken Tsui's starred repositories

sgt

Sequence Graph Transform

Language:Jupyter NotebookStargazers:104Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:8987Issues:0Issues:0

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:26974Issues:0Issues:0

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookLicense:MITStargazers:809Issues:0Issues:0
Language:PythonLicense:MITStargazers:3922Issues:0Issues:0

turndown

🛏 An HTML to Markdown converter written in JavaScript

Language:HTMLLicense:MITStargazers:8084Issues:0Issues:0

firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

Language:TypeScriptLicense:AGPL-3.0Stargazers:5813Issues:0Issues:0

SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Language:PythonLicense:NOASSERTIONStargazers:511Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:10693Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:276Issues:0Issues:0

KoPA

[Paper][Preprint 2023] Making Large Language Models Perform Better in Knowledge Graph Completion

Language:PythonLicense:MITStargazers:103Issues:0Issues:0

reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

Language:TypeScriptLicense:Apache-2.0Stargazers:5037Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1169Issues:0Issues:0

corenet

CoreNet: A library for training deep neural networks

Language:PythonLicense:NOASSERTIONStargazers:6635Issues:0Issues:0

aws-neuron-sdk

Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and integrated with your favorite AWS services

Language:PythonLicense:NOASSERTIONStargazers:418Issues:0Issues:0

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonLicense:MITStargazers:430Issues:0Issues:0

public-apis

A collective list of free APIs

Language:PythonLicense:MITStargazers:295472Issues:0Issues:0

rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

License:MITStargazers:266Issues:0Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:890Issues:0Issues:0

trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Language:PythonLicense:Apache-2.0Stargazers:3071Issues:0Issues:0

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1616Issues:0Issues:0

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:30050Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:59712Issues:0Issues:0

Olive

Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.

Language:PythonLicense:MITStargazers:1306Issues:0Issues:0

WeightWatcher

The WeightWatcher tool for predicting the accuracy of Deep Neural Networks

Language:PythonLicense:Apache-2.0Stargazers:1406Issues:0Issues:0

ELR

Official Implementation of Early-Learning Regularization Prevents Memorization of Noisy Labels

Language:PythonLicense:MITStargazers:286Issues:0Issues:0

pycma

Python implementation of CMA-ES

Language:PythonLicense:NOASSERTIONStargazers:1037Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:794Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:17329Issues:0Issues:0

semantic-router

Superfast AI decision making and intelligent processing of multi-modal data.

Language:PythonLicense:MITStargazers:1523Issues:0Issues:0