Sanjib Narzary's repositories
awesome-llm
Curated list of open source and openly accessible large language models
bodo-tokenizers
Pre tokenized models for Bodo. This repositoryincludes all the tokenized models to be used in the Neural Machine Translation. The models include pre tokenized models trained using ByteLevelBPETokenizer, BPETokenizer, SentencePieceBPETokenizer, BertWordPieceTokenizer
bodo_news_crawler
Bodo News Crawler
a-PyTorch-Tutorial-to-Machine-Translation
Attention Is All You Need | a PyTorch Tutorial to Machine Translation
alpaca-lora
Instruct-tune LLaMA on consumer hardware
bodo-tokenizer
Tokenizer for Bodo language
furo
A clean customizable documentation theme for Sphinx
Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP & Whisper - Automatically Detect , Segment and Generate Anything with Image, Text, and Speech Inputs
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
InstructMT
A collection of instruction data and scripts for machine translation.
llama2.c
Inference Llama 2 in one file of pure C
lobe-chat
🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.
maybe
Personal finance and wealth management app
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
PanoHead
Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"
ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.
quivr
Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Shoonya
Shoonya - Platform to Annotate and label data at scale.
Stirling-PDF
locally hosted web application that allows you to perform various operations on PDF files
StructGPT
The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"
ucse514-2023
Repository for UCSE514 Web and Internet Technology companion repository for assignment submission and learning collaboration in open repository.