Asif Shaikat's repositories
GPTTokenCounter
A plain script to compare and count token for OpenAI
asifshaikat
My github profile
automate_your_network
The book in PDF format for all to enjoy!
codes3
The source code of CodeS (SIGMOD 2024).
hello-world
First Repository
IndicLLMSuite
A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages
audio_clip_processing_pipeline
Audio Clips Processing Pipeline
AutoRAG
RAG AutoML Tool - Find optimal RAG pipeline for your own data.
crawl4ai
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
diago
Short of Dialog + GO. Library/Framework for building VOIP solutions in GO
Learning-LLMs
Understanding the Implementing a ChatGPT-like LLM from scratch, step by step
ML-ground
Machine Learning for myself with all
piper
A fast, local neural text to speech system
RaspberryPi_WebRTC
Native WebRTC uses v4l2 hardware h264 and software openh264 encoder for live streaming on Raspberry Pi.
RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
screenpipe
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.
Some-cheatsheets-notes-and-resources-for-AWS-SAA-C03-exam
Some cheat-sheets, notes and resources for AWS-SAA-C03 exam.
swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
ultravox
A fast multimodal LLM for real-time voice
WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Web-LLM-Assistant-Llama-cpp
A Python-based web-assisted large language model (LLM) search assistant using Llama.cpp
whisper-asr-webservice
OpenAI Whisper ASR Webservice API
WordLlama
Things you can do with the token embeddings of an LLM