transformer-architecture

There are 4 repositories under transformer-architecture topic.

Plachtaa / VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
emotional-speech gpt text-to-speech transformer-architecture tts vall-e voice-clone
Language:Python 7965
cmhungsteve / Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
transformer attention-mechanism vision-transformer deep-learning awesome-list transformer-cv transformer-architecture transformer-awesome transformer-with-cv transformer-models visual-transformer computer-vision papers attention-mechanisms self-attention vit detr transformers
4959
tairov / llama2.mojo
Inference Llama 2 in one file of pure 🔥
inference llama llama2 modular mojo parallelize performance simd tensor transformer-architecture vectorization
Language:Mojo 2118
spago
nlpodyssey / spago
Self-contained Machine Learning and Natural Language Processing library in Go
deep-learning machine-learning natural-language-processing neural-network computation-graph automatic-differentiation artificial-intelligence deeplearning transformer-architecture recurrent-networks lstm language-model question-answering bert bert-as-service nlp automatic-translation bart named-entities-recognition
Language:Go 1832
Ma-Lab-Berkeley / CRATE
Code for CRATE (Coding RAte reduction TransformEr).
compression sparsification transformer-architecture white-box-architecture
Language:Python 1252
awslabs / sockeye
Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch
attention-is-all-you-need attention-mechanism attention-model deep-learning deep-neural-networks encoder-decoder machine-learning machine-translation neural-machine-translation pytorch seq2seq sequence-to-sequence sequence-to-sequence-models sockeye transformer transformer-architecture transformer-network translation
Language:Python 1217
berniwal / swin-transformer-pytorch
Implementation of the Swin Transformer in PyTorch.
deep-learning machine-learning pytorch artificial-intelligence transformer-pytorch transformer-architecture attention-model
Language:Python 848
joeynmt
joeynmt / joeynmt
Minimalist NMT for educational purposes
education joey-nmt machine-translation neural-machine-translation nmt nmt-frameworks nmt-tutorial pytorch-transformers rnn-pytorch seq2seq seq2seq-pytorch transformer transformer-architecture
Language:Python 706
linwhitehat / ET-BERT
The repository of ET-BERT, a network traffic classification model on encrypted traffic. The work has been accepted as The Web Conference (WWW) 2022 accepted paper.
pre-training transformer-architecture burst-analysis mask-burst-modeling same-origin-burst-prediction pytorch encrypted-traffic-analysis
Language:Python 553
cuiziteng / Illumination-Adaptive-Transformer
🌕 [BMVC 2022] You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction. SOTA for low light enhancement, 0.004 seconds try this for pre-processing.
bmvc exposure-correction image-enhancement image-reconstruction image-restoration low-level-vision low-light-enhance low-light-image-enhancement pytorch transformer-architecture transformer-models
Language:Python 547
wgcban / ChangeFormer
[IGARSS'22]: A Transformer-Based Siamese Network for Change Detection
change-detection remote-sensing siamese-network transformer-encoder transformer-architecture attention-mechanism pytorch deep-learning satellite-imagery multi-temporal climate-change
Language:Python 535
fastnlp / CPT
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
ptms pretrained-models chinese transformer-architecture text-generation language-understanding
Language:Python 492
google-research / maxvit
[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...
architecture classification cnn computer-vision image image-processing mlp object-detection transformer transformer-architecture vision-transformer segmentation resnet
Language:Jupyter Notebook 487
kyegomez / MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
ai artificial-intelligence attention-mechanism machine-learning mamba ml pytorch ssm torch transformer-architecture transformers zeta
Language:Python 459
abdur75648 / Deep-Learning-Specialization-Coursera
This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes building various deep learning models from scratch and implementing them for object detection, facial recognition, autonomous driving, neural machine translation, trigger word detection, etc.
andrew-ng andrew-ng-machine-learning computer-vision convolutional-neural-networks coursera deep-learning deep-learning-andrew-ng deep-learning-coursera deep-learning-specialization face-recognition lstm machine-learning neural-networks nlp object-detection recurrent-neural-networks tensorflow transformer-architecture unet-segmentation updated
Language:Jupyter Notebook 455
wjf5203 / SeqFormer
SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)
transformer-architecture video-instance-segmentation
Language:Python 346
sgrvinod / a-PyTorch-Tutorial-to-Transformers
Attention Is All You Need | a PyTorch Tutorial to Transformers
pytorch pytorch-tutorial attention-is-all-you-need transformer transformer-architecture transformer-tutorial
Language:Python 344
hkproj / transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
attention-is-all-you-need attention-model notes study-notes transformer transformer-architecture
320
ZixuanKe / PyContinual
PyContinual (An Easy and Extendible Framework for Continual Learning)
continual-learning natural-language-processing catastrophic-forgetting knowledge-transfer capsule-network attention-mechanism text-classification transfer-learning transformer-architecture
Language:Python 319
PengBoXiangShang / multigraph_transformer
IEEE TNNLS 2021, transformer, multi-graph transformer, graph, graph classification, sketch recognition, sketch classification, free-hand sketch, official code of the paper "Multi-Graph Transformer for Free-Hand Sketch Recognition"
multi-graph-transformer graph sketch-recognition transformer transformer-architecture pytorch pytorch-implementation sparse-graphs sketch
Language:Python 300
UIC-Liu-Lab / ContinualLM
An Extensible Continual Learning Framework Focused on Language Models (LMs)
catastrophic-forgetting continual-learning knowledge-transfer language-model domain-adaptive-pretraining transfer-learning transformer-architecture natural-language-processing language-modeling
Language:Python 289
VSainteuf / pytorch-psetae
PyTorch implementation of the model presented in "Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention"
agriculture computer-vision cvpr cvpr2020 deep-learning earth-observation machine-learning pytorch remote-sensing satellite-image self-attention spatio-temporal time-series-classification transformer-architecture
Language:Python 210
zhongkaifu / Seq2SeqSharp
Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.
seq2seq encoder-decoder neural-network deep-learning lstm attention-model cuda gpu tensor machine-translation transformer-encoder transformer sequence-to-sequence transformer-architecture translation image text vision-transformer
Language:C# 209
ernie
labteral / ernie
Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.
bert roberta albert distilbert tensorflow keras tensorflow2 nlp sentence-classification transformers huggingface bert-model bert-models bert-as-service bert-embeddings huggingface-transformer transformer-architecture transformer-tensorflow2 natural-language-processing
Language:Python 201
prakhar21 / TextAugmentation-GPT2
Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
gpt-2 nlp-machine-learning transformer-architecture text-augmentation natural-language-processing natural-language-generation textclassification
Language:Python 193
onlybooks / llm
LLM을 활용한 실전 AI 애플리케이션 개발
ai chatgpt llamaindex llm rag transformer-architecture
Language:Jupyter Notebook 185
huangjia2019 / llm-gpt
异步图书：《 GPT图解大模型是怎样构建的》- 这套代码是AI Coder出现之前，自己用纯手工搭建的一套简单有效的NLP经典算法集合。在大语言模型推动的AI Coder兴起之后，很少有机会再创作这么有“手工风”的代码了，不知道这是值得开心还是值得遗憾的事情。
gpt llm nlp transformer-architecture
Language:Jupyter Notebook 182
miccaiif / TransMEF
Official PyTorch implementation of our AAAI22 paper: TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework via Self-Supervised Multi-Task Learning.
multi-exposure-fusion transformer-architecture self-supervised-learning aaai2022 pytorch-implementation
Language:Python 168
LongRoPE
jshuadvd / LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
ai artificial-intelligence deep-learning llm machine-learning gpt language-model natural natural-language-processing natural-language-understanding nlp tokenization tokenizers transformers attention-is-all-you-need attention-mechanisms transformer-architecture natural-language-inference natural-language-procressing
Language:Python 152
vilari-mickopf / mmwave-gesture-recognition
Basic Gesture Recognition Using mmWave Sensor - TI AWR1642
mmwave mmwave-sensor awr1642 texas-instruments tensorflow gesture-recognition transformer-architecture keras python conv1d conv2d lstm neural-network resnet ai machine-learning
Language:Python 140
jcwang123 / BA-Transformer
[MICCAI 2021] Boundary-aware Transformers for Skin Lesion Segmentation
pytorch transformer-architecture miccai skin-lesion-segmentation
Language:Python 130
quanghuy0497 / Transformers4Vision
A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-shot Learning. Keep updated frequently.
attention-mechanism paper-reviews transformer-architecture vision-transformer
114
ra1ph2 / Vision-Transformer
Implementation of Vision Transformer from scratch and performance compared to standard CNNs (ResNets) and pre-trained ViT on CIFAR10 and CIFAR100.
computer-vision transformer-architecture convolutional-neural-networks attention-mechanism pytorch jupyter-notebook
Language:Jupyter Notebook 113
jet-universe / particle_transformer
Official implementation of "Particle Transformer for Jet Tagging".
hep jet-tagging transformer-architecture
Language:Python 109
snailpt / MSCFormer
Multi-Scale Convolutional Transformer Network for Motor Imagery Brain-Computer Interface
eeg-classification motor-imagery-classification multi-scale-convolutional-neural-network transformer-architecture
Language:Python 105
kyegomez / Algorithm-Of-Thoughts
My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"
artificial-intelligence gpt4 gpt4-api gpt4all prompt-engineering swarms transformer-architecture transformer-models ai-reasoning
Language:Python 98

transformer-architecture

Plachtaa / VALL-E-X

cmhungsteve / Awesome-Transformer-Attention

tairov / llama2.mojo

nlpodyssey / spago

Ma-Lab-Berkeley / CRATE

awslabs / sockeye

berniwal / swin-transformer-pytorch

joeynmt / joeynmt

linwhitehat / ET-BERT

cuiziteng / Illumination-Adaptive-Transformer

wgcban / ChangeFormer

fastnlp / CPT

google-research / maxvit

kyegomez / MultiModalMamba

abdur75648 / Deep-Learning-Specialization-Coursera

wjf5203 / SeqFormer

sgrvinod / a-PyTorch-Tutorial-to-Transformers

hkproj / transformer-from-scratch-notes

ZixuanKe / PyContinual

PengBoXiangShang / multigraph_transformer

UIC-Liu-Lab / ContinualLM

VSainteuf / pytorch-psetae

zhongkaifu / Seq2SeqSharp

labteral / ernie

prakhar21 / TextAugmentation-GPT2

onlybooks / llm

huangjia2019 / llm-gpt

miccaiif / TransMEF

jshuadvd / LongRoPE

vilari-mickopf / mmwave-gesture-recognition

jcwang123 / BA-Transformer

quanghuy0497 / Transformers4Vision

ra1ph2 / Vision-Transformer

jet-universe / particle_transformer

snailpt / MSCFormer

kyegomez / Algorithm-Of-Thoughts