Haris Jabbar (MaveriQ)

MaveriQ

Geek Repo

Company:Ludwig Maximillian University

Location:Munich

Github PK Tool:Github PK Tool

Haris Jabbar's repositories

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

agency-jekyll-theme

Agency Theme for Jekyll

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

amber-data-prep

Data preparation code for Amber 7B LLM

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

MicroLlama

This is a 300M MicroLlama version of TinyLlama

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

dolma

Data and tools for generating and inspecting OLMo pre-training data.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

jekyll-theme-neumorphism

Neumorphism designed Jekyll theme for personal websites, portfolios and resumes.

License:MITStargazers:0Issues:0Issues:0

langchain-chatbot-demo

Examples of chatbot implementations with Langchain and Streamlit

Stargazers:0Issues:0Issues:0

linggpt

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LLaMA-Efficient-Tuning

Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)

License:Apache-2.0Stargazers:0Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

minbpe_spark_gcp

Implementing MinBPE training on GCP DataProc (serverless spark on GCP)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MobiLlama

MobiLlama : Small Language Model tailored for edge devices

License:Apache-2.0Stargazers:0Issues:0Issues:0

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pandas-ai

PandasAI is the Python library that integrates Gen AI into pandas, making data analysis conversational

License:MITStargazers:0Issues:0Issues:0

paralegal

Streamit app with langchain and huggingface

Language:PythonStargazers:0Issues:0Issues:0

promptbench

A unified evaluation framework for large language models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

promptsource

Toolkit for creating, sharing and using natural language prompts.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

python-package-template

A template repo for Python packages from AllenAI

License:Apache-2.0Stargazers:0Issues:0Issues:0

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spacyface

Align the token outputs from Spacy and Huggingface to help understand what language structures transformers see

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sql-eval

Evaluate the accuracy of LLM generated outputs

License:Apache-2.0Stargazers:0Issues:0Issues:0

Streamlit-Authenticator

A secure authentication module to validate user credentials in a Streamlit application.

License:Apache-2.0Stargazers:0Issues:0Issues:0

tiktokenizer

Online playground for OpenAPI tokenizers

License:MITStargazers:0Issues:0Issues:0

useb

Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/2104.06979.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

License:NOASSERTIONStargazers:0Issues:0Issues:0