Tony's repositories

agent_tutorials

various agent tutorials

Stargazers:0Issues:0Issues:0

ai-workshop-code

Code I wrote for my AI & LLM workshops

Stargazers:0Issues:0Issues:0

astro

The web framework for content-driven websites. ⭐️ Star to support our work!

License:NOASSERTIONStargazers:0Issues:0Issues:0

AutoCoder

We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.

License:Apache-2.0Stargazers:0Issues:0Issues:0

cognita

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

License:Apache-2.0Stargazers:0Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

corenet

CoreNet: A library for training deep neural networks

License:NOASSERTIONStargazers:0Issues:0Issues:0

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

License:Apache-2.0Stargazers:0Issues:0Issues:0

docs

documentation for content creation

Stargazers:0Issues:0Issues:0

elia

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

GrokkedTransformer

Code for the paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'

License:MITStargazers:0Issues:0Issues:0

iceberg

Apache Iceberg

License:Apache-2.0Stargazers:0Issues:0Issues:0

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

License:MITStargazers:0Issues:0Issues:0

llama-fs

A self-organizing file system with llama 3

License:MITStargazers:0Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

License:GPL-3.0Stargazers:0Issues:0Issues:0

MiniCPM

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:AstroStargazers:0Issues:0Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

License:Apache-2.0Stargazers:0Issues:0Issues:0

PATE

Official repository for “PATE: Proximity-Aware Time series anomaly Evaluation”.

License:MITStargazers:0Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

License:MITStargazers:0Issues:0Issues:0

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

License:Apache-2.0Stargazers:0Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Stargazers:0Issues:0Issues:0

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

License:Apache-2.0Stargazers:0Issues:0Issues:0

Scrapegraph-ai

Python scraper based on AI

License:MITStargazers:0Issues:0Issues:0

Time-LLM

[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

License:Apache-2.0Stargazers:0Issues:0Issues:0

Time-Series-Library

A Library for Advanced Deep Time Series Models.

License:MITStargazers:0Issues:0Issues:0

timesfm

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

License:Apache-2.0Stargazers:0Issues:0Issues:0