bet0x

followers

following

stars

@rackspace

Mexico

http://www.barrahome.org

Alberto Ferrer's repositories

AI_Research

My Gen AI research

CC0-1.0000

dataset

darija <-> english dataset

NOASSERTION000

every-chatgpt-gui

Every front-end GUI client for ChatGPT

MIT000

fastapi_tritonserver

Language:Python000

functionary

Chat language model that can interpret and execute functions/plugins

Language:PythonMIT010

GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

000

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause010

Hermes-Function-Calling

Language:PythonMIT010

HippoRAG

HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents.

Language:PythonMIT000

infinity

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.

Language:PythonMIT010

k2-data-prep

Apache-2.0000

k2-train

Apache-2.0000

Liger-Kernel

Efficient Triton Kernels for LLM Training

BSD-2-Clause000

llm-text-completion-finetune

Guide on text completion large language model fine-tuning, including example scripts and training data acquiring.

Language:PythonApache-2.0000

lloco

The official repo for "LLoCo: Learning Long Contexts Offline"

Language:PythonMIT000

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonGPL-3.0000

milvus_cli

Milvus Command Line

Apache-2.0000

omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

GPL-3.0000

openagi

Paving the way for open agents and AGI for all.

Language:Python000

openai_trtllm

OpenAI compatible API for TensorRT LLM triton backend

Language:RustMIT000

ray_vllm_inference

A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.

Language:PythonApache-2.0010

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonMIT000

tensorrt-test

TensorRT LLM Benchmark Configuration

000

tidb.ai

A [WIP] out-of-the-box RAG (Retrieval-Augmented Generation) app based on the [WIP] vector storage in TiDB Serverless.

Language:TypeScriptApache-2.0010

TinyAgent

MIT000

tiptap

The headless rich text editor framework for web artisans.

Language:TypeScriptMIT000

TPI-LLM

TPI-LLM: A High-Performance Tensor Parallelism Inference System for Edge LLM Services.

Apache-2.0000

unsloth

5X faster 60% less memory QLoRA finetuning

Language:PythonApache-2.0010

Verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Language:PythonBSD-3-Clause000

workbench-example-mistral-finetune

An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model

Apache-2.0000