transformers

There are 104 repositories under transformers topic.

generative-ai-for-beginners
microsoft / generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
ai azure chatgpt dall-e generative-ai generativeai gpt language-model llms openai prompt-engineering semantic-search transformers
Language:Jupyter Notebook 61917
annotated_deep_learning_paper_implementations
labmlai / annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
attention deep-learning deep-learning-tutorial gan literate-programming lora machine-learning neural-networks optimizers pytorch reinforcement-learning transformer transformers
Language:Python 54005
LLaMA-Factory
hiyouga / LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers
Language:Python 31043
lucidrains / vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
artificial-intelligence attention-mechanism transformers computer-vision image-classification
Language:Python 19684
amusi / CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
cvpr cvpr2020 computer-vision deep-learning machine-learning object-detection image-segmentation paper image-processing visual-tracking python cvpr2021 semantic-segmentation cvpr2022 transformer transformers cvpr2023 cvpr2024
17743
haystack
deepset-ai / haystack
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
ai bert chatgpt generative-ai gpt-3 information-retrieval language-model large-language-models llm machine-learning nlp python pytorch question-answering rag retrieval-augmented-generation semantic-search squad summarization transformers
Language:Python 16806
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
adapter diffusion llm lora parameter-efficient-learning python pytorch transformers
Language:Python 15881
arc53 / DocsGPT
GPT-powered chat for documentation, chat with your documents
ai chatgpt docsgpt information-retrieval language-model llm machine-learning natural-language-processing python pytorch rag react semantic-search transformers web-app
Language:Python 14610
BlinkDL / RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
attention-mechanism chatgpt deep-learning gpt gpt-2 gpt-3 language-model linear-attention lstm pytorch rnn rwkv transformer transformers
Language:Python 12409
PaddleNLP
PaddlePaddle / PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
nlp embedding bert ernie paddlenlp pretrained-models transformers information-extraction question-answering search-engine semantic-analysis sentiment-analysis neural-search uie document-intelligence compression llm distributed-training llama
Language:Python 11975
stas00 / ml-engineering
Machine Learning Engineering Open Book
ai large-language-models llm machine-learning machine-learning-engineering mlops pytorch scalability slurm transformers
Language:Python 11010
xenova / transformers.js
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
browser javascript transformers webml
Language:JavaScript 10993
NVIDIA / Megatron-LM
Ongoing research training transformer models at scale
large-language-models model-para transformers
Language:Python 10019
qubvel-org / segmentation_models.pytorch
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
computer-vision deeplab-v3-plus deeplabv3 fpn image-processing image-segmentation imagenet models pretrained-models pretrained-weights pspnet pytorch segformer segmentation segmentation-models semantic-segmentation transformers unet unet-pytorch unetplusplus
Language:Python 9414
NielsRogge / Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
bert gpt-2 layoutlm pytorch transformers vision-transformer
Language:Jupyter Notebook 9085
tokenizers
huggingface / tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
bert gpt language-model natural-language-processing natural-language-understanding nlp transformers
Language:Rust 8895
txtai
neuml / txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
embeddings information-retrieval language-model large-language-models llm machine-learning neural-search nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search vector-search-engine
Language:Python 8715
speechbrain / speechbrain
A PyTorch-based Speech Toolkit
asr audio audio-processing deep-learning huggingface language-model pytorch speaker-diarization speaker-recognition speaker-verification speech-enhancement speech-processing speech-recognition speech-separation speech-to-text speech-toolkit speechrecognition spoken-language-understanding transformers voice-recognition
Language:Python 8586
EleutherAI / gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
gpt gpt-2 gpt-3 language-model transformers
Language:Python 8206
lucidrains / PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
artificial-intelligence attention-mechanisms deep-learning human-feedback reinforcement-learning transformers
Language:Python 7671
openvinotoolkit / openvino
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
inference deep-learning openvino ai computer-vision diffusion-models generative-ai llm-inference natural-language-processing nlp performance-boost speech-recognition stable-diffusion deploy-ai optimize-ai transformers yolo recommendation-system good-first-issue
Language:C++ 6859
EleutherAI / gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
deepspeed-library gpt-3 transformers language-model
Language:Python 6830
bertviz
jessevig / bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
bert gpt2 machine-learning natural-language-processing neural-network nlp pytorch roberta transformer transformers visualization
Language:Python 6796
intel-analytics / ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc.
gpu llm pytorch transformers
Language:Python 6499
niedev / RTranslator
Open source real-time translation app for Android that runs locally
android android-app bluetooth-le mobile-app nllb offline onnx onnxruntime realtime-translator sentencepiece transformers translation translator whisper
Language:C++ 6410
BERTopic
MaartenGr / BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
bert ldavis machine-learning nlp sentence-embeddings topic topic-modeling topic-modelling topic-models transformers
Language:Python 6003
lucidrains / DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
artificial-intelligence attention-mechanism deep-learning multi-modal text-to-image transformers
Language:Python 5552
SkalskiP / courses
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
computer-vision deep-learning deep-neural-networks generative-model machine-learning mlops multimodal natural-language-processing nlp stable-diffusion transformers tutorial
Language:Python 5312
imoneoi / openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
large-language-models open-source transformers
Language:Python 5221
superduper
superduper-io / superduper
Superduper: Integrate AI models and machine learning workflows with your database to implement custom AI applications, without moving your data. Including streaming inference, scalable model hosting, training and vector search.
ai chatbot data database distributed-ml inference llm-inference llm-serving llmops ml mlops mongodb pretrained-models python pytorch rag semantic-search torch transformers vector-search
Language:Jupyter Notebook 4656
lucidrains / x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
artificial-intelligence attention-mechanism deep-learning transformers
Language:Python 4584
cmhungsteve / Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
attention-mechanism attention-mechanisms awesome-list computer-vision deep-learning detr papers self-attention transformer transformer-architecture transformer-awesome transformer-cv transformer-models transformer-with-cv transformers vision-transformer visual-transformer vit
4546
alignment-handbook
huggingface / alignment-handbook
Robust recipes to align language models with human and AI preferences
llm rlhf transformers
Language:Python 4493
marqo
marqo-ai / marqo
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
chatgpt clip deep-learning gpt hacktoberfest hnsw information-retrieval knn large-language-models machine-learning machinelearning multi-modal natural-language-processing search-engine semantic-search tensor-search transformers vector-search vision-language visual-search
Language:Python 4482
lucidrains / deep-daze
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun
artificial-intelligence deep-learning implicit-neural-representation multi-modality siren text-to-image transformers
Language:Python 4373
bentrevett / pytorch-sentiment-analysis
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
bert cnn cnn-text-classification fasttext lstm lstm-sentiment-analysis natural-language-processing nlp pytorch pytorch-nlp pytorch-tutorial pytorch-tutorials recurrent-neural-networks rnn sentiment-analysis sentiment-classification torchtext transformers tutorial word-embeddings
Language:Jupyter Notebook 4340