There are 0 repository under decoder-model topic.
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context
Experimental project for AI and NLP based on Transformer Architecture
Implementation of the GPT-3 paper: Language Models are Few-Shot Learners
Generate caption on images using CNN Encoder- LSTM Decoder structure
Transformers Intuition
从零实现基础的Transformer的Decoerder-Only模型,并进行模型升级,构建专属于自己的LLM模型
Generative AI fine-tune and inference for sequence classification tasks
This project aims to simplify texts from research papers using advanced natural language processing (NLP) techniques, making them more accessible to a broader audience
An explainable and simplified version of OLMo model
a dna sequence generation/classification using transformers
Code and dataset used to train dialect adapters for decoder models.
An LLM based tool for generation of cheese advirtisements
A miniGPT inspired from the original NanoGPT released by OpenAI. This is a notebook to walk through the decoder part of the transformer architecture with details outlined.
Build Text summarizer for arabic language
Custom decoder Transformer that treats a patient's medical journey like a story told through diagnosis codes instead of words.
A multimodal vision model that takes in an image and a prompt query, and output the answer
Decoder model for language modelling
🖼️ Generate descriptive captions for images using a CNN-LSTM model, combining computer vision and NLP for effective storytelling.
On the Design and Performance of Machine Learning Based Error Correcting Decoders
Text Generation using RNN, LSTM, and Transformer
Coding A Decoder Only Transformer Like ChatGPT From Scratch
Intent Detection API using BERT and Flask
Using LLMs in huggingface for sentiment analysis, translation, summarization and extractive question answering
Offline CNN demo, Z-score, auto offset and sgolay control
Offline Kalman filter demo, Z-score and Ridge regression to adapt KF parameters