James Melvin Ebenezer's repositories
Liah-Lie_in_a_haystack
needle in a haystack for LLMs
ao
The torchao repository contains api's and workflows for quantization and pruning gpu models.
awesome-language-agents
List of language agents based on paper "Cognitive Architectures for Language Agents"
axolotl
Go ahead and axolotl questions
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
browser-agent
A browser AI agent, using GPT-4
chatbot-ui
An open source ChatGPT UI.
chatgpt-google-extension
A browser extension that enhance search engines with ChatGPT
crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
dicebear
DiceBear is an avatar library for designers and developers. 🌍
flash-attention
Fast and memory-efficient exact attention
ImageBind
ImageBind One Embedding Space to Bind Them All
lago
Open Source Metering and Usage Based Billing API ⭐️ Consumption tracking, Subscription management, Pricing iterations, Payment orchestration & Revenue analytics
lectures
Material for cuda-mode lectures
lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
next-s3-upload
Upload files from your Next.js app to S3
open-information-retrieval
Implementation of Production Ready Information Retrieval System
ring-attention
ring-attention experiments
ring-attention-pytorch
Explorations into Ring Attention, from Liu et al. at Berkeley AI
ring-flash-attention
Ring attention implementation with flash attention
RingAttention
Transformers with Arbitrarily Large Context
unsloth
2-5X faster 70% less memory QLoRA & LoRA finetuning
xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.