model-inference

There are 1 repository under model-inference topic.

bentoml / OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna
Language:Python 11777
wangxb96 / Awesome-EdgeAI
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
data-preprocessing edge-ai edge-computing efficient-algorithm machine-learning model-acceleration model-compression model-inference tiny-ml awesome-list deep-learning model-deployment model-design
93
bentoml / CLIP-API-service
CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search
ai-applications clip cloud-native mlops model-inference model-inference-service model-serving openai-clip
Language:Jupyter Notebook 60
hegongshan / Storage-for-AI-Paper
Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)
data-storage deep-learning mlsys storage-system storage-for-ai pytorch tensorflow checkpoint data-preprocessing data-loading data-preparation dataloader model-inference model-storage model-training
41
array2d / deepx
Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & Heterogeneous Hardware Support
deep-learning-framework distributed-training heterogeneous-computing high-performance-computing model-inference model-serving cuda-acceleration simd-optimization
Language:C++ 38
EmbeddedLLM / embeddedllm
EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU
llm llm-inference directml directx-12 gemma llama mistral phi-3 llm-serving model-inference open-source-llm aipc npu windows ipexllm cpu openvino openvino-inference-engine
Language:Python 35
DAVIDNYARKO123 / edge-tpu-silva
Streamlining the process for seamless execution of PyCoral in running TensorFlow Lite models on an Edge TPU USB.
coral-tpu edge-computing fps machine-learning model-inference object-detection pycoral raspberry-pi raspberry-pi-4
Language:Python 29
kdeps / kdeps
Build self-hosted RAG AI Agents powered by open-source LLMs, use LLM models from Ollama and Huggingface, add external API calls, python and shell scripts for context-aware LLM interactions, add validation checks, and build Bring Your Own Infrastructure (BYOI) Dockerized AI Agent images.
api artificial-intelligence dockerized huggingface llm multimodal opensource nvidia cuda docker llama mistral fine-tuning mlops model-inference llmops agents ai-agents llm-agent saas
Language:Go 19
Koldim2001 / Image_captioning
Генерация описаний к изображениям с помощью различных архитектур нейронных сетей
attention-mechanism captioning-images computer-vision image-captioning lstm model-deployment model-inference nlp soft-attention streamlit website word-embeddings
Language:Jupyter Notebook 17
brian-kipkoech-tanui / sagemaker-ML-workflow
Image Classifiers are used in the field of computer vision to identify the content of an image and it is used across a broad variety of industries, from advanced technologies like autonomous vehicles and augmented reality, to eCommerce platforms, and even in diagnostic medicine.
aws aws-ec2 aws-lambda aws-s3 aws-statemachine aws-step-functions endpoint image-classification json model-evaluation model-inference model-testing python sagemaker sagemaker-deployment sagemaker-studio
Language:HTML 4
ChaitanyaC22 / Udacity-AWS-MLE-ND-Project2-Build-a-ML-Workflow-For-Scones-Unlimited-On-Amazon-SageMaker
The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.
aws aws-ec2 aws-iam aws-lambda aws-s3 aws-sagemaker aws-statemachine aws-step-functions deployment endpoint image-classification json lambda-functions model-evaluation model-inference model-testing python python3 sagemaker-deployment sagemaker-studio
Language:HTML 4
AlvinHon / distributed-model-inference
Example distributed system for ML model inference by using Kafka, including spring boot REST+JPA server with Java consumer program
jpa kafka model-inference spring-boot
Language:Java 1
itancio / churn
machine-learning model-inference python3 streamlit
Language:Python 1
SayamAlt / Cyberbullying-Classification-using-fine-tuned-DistilBERT
Successfully fine-tuned a pretrained DistilBERT transformer model that can classify social media text data into one of 4 cyberbullying labels i.e. ethnicity/race, gender/sexual, religion and not cyberbullying with a remarkable accuracy of 99%.
cyberbullying-detection data-exploration distilbert-model exploratory-data-analysis fine-tune-bert-tensorflow llm model-inference model-training-and-evaluation multiclass-classification natural-language-processing text-classification text-preprocessing text-tokenization
Language:Jupyter Notebook 1
SayamAlt / Financial-News-Sentiment-Analysis
Successfully developed a fine-tuned DistilBERT transformer model which can accurately predict the overall sentiment of a piece of financial news up to an accuracy of nearly 81.5%.
data-exploration-and-preprocessing distilbert-model fine-tune-bert-tensorflow hugging-face-transformers model-architecture-and-implementation model-inference model-training-and-evaluation multiclass-classification natural-language-processing sentiment-analysis text-preprocessing text-tokenization
Language:Jupyter Notebook 1
thehrsr / CAR-DAMAGE-DETECTION
This project is a web-based application that uses a pre-trained Mask R-CNN model to detect and classify car damage types (scratch, dent, shatter, dislocation) from images. Users can upload an image of a car, and the application will highlight damaged areas with bounding boxes and masks, providing a clear visual representation of the detected damage
ai computer-vision custom-dataset deep-learning flask image-processing image-segmentation machine-learning mask-rcnn model-inference mrcnn object-detection python tensorflow web-application
Language:Jupyter Notebook 1
C-bianc / NER-task
Token classification for named entities
lstm model-inference ner ray-tune token-classification
Language:Jupyter Notebook 0
GauravG-20 / Udacity-AWS-MLE-ND-Project2-Build-a-ML-Workflow-For-Scones-Unlimited-On-Amazon-SageMaker
The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.
aws aws-lambda aws-s3 aws-sagemaker aws-state-machine json jupyter-notebook model-inference python sagemaker-deployment sagemaker-studio aws-endpoint udacity-nanodegree udacity-scholarship-course udacity-machine-learning-fundamentals
Language:HTML 0
KrajShuffle / Classifying_SpeechAudio_CNN
CNN Based Approach for Audio File Classification. Contains Notebooks Illustrating Data Preprocessing, Feature Extraction, Model Training, & Model Inference Workflows & Overall Pipeline
convolutional-neural-networks data-preprocessing feature-engineering feature-extraction model-inference model-training-and-evaluation speech-classification metrics-visualization
Language:Jupyter Notebook 0
vinit714 / --Deep-Learning-for-Fashion-MNIST--Accessory-Classification-Project
This repository contains Python code to classify fashion items using a Convolutional Neural Network (CNN) implemented with TensorFlow and Keras. It includes data preprocessing, model building, training, evaluation, and visualization of results.
cnn data-augmentation data-preprocessing evaluation-metrics fashion-mnist model-inference normalization visualization
Language:Jupyter Notebook 0
kwame-mintah / gcp-cloud-run-function-model-inference
A cloud run function to invoke a prediction against a machine learning model that has been trained outside of a cloud provider.
cloud-functions fastapi gcp google-cloud-platform model-inference
Language:Python
santosh / image-classifier
POC of image classification using scikit-learn.
computer-vision machine-learning model-inference model-training parking-lot scikit-learn
Language:Python
SayamAlt / English-to-Spanish-Language-Translation-using-Seq2Seq-and-Attention
Successfully established a Seq2Seq with attention model which can perform English to Spanish language translation up to an accuracy of almost 97%.
attention-is-all-you-need attention-model bert-transformer exploratory-data-analysis fine-tuning-bert hugging-face-transformers language-translation luong-attention model-architecture-and-implementation model-inference model-training-and-evaluation natural-language-processing neural-machine-translation seq2seq-modeling text-generation text-preprocessing text-tokenization
Language:Jupyter Notebook
SayamAlt / Global-Equity-Forecasting-using-LSTM
Successfully established an LSTM model to effectively forecast global equity based on over 20+ years of historical data of global equity.
data-loaders deep-learning feature-scaling forecast-evaluation gradient-clipping lstm-neural-networks model-inference model-training-and-evaluation pytorch recurrent-neural-networks time-series-datasets time-series-forecasting xavier-initialization
Language:Jupyter Notebook
SayamAlt / Global-News-Headlines-Text-Summarization
Successfully established a text summarization model using Seq2Seq modeling with Luong Attention, which can give a short and concise summary of the global news headlines.
attention-mechanism data-exploration-and-preprocessing luong-attention model-architecture-and-implementation model-inference natural-language-processing seq2seq-model text-generation text-summarization text-tokenization
Language:Jupyter Notebook
SayamAlt / Grapevine-Leaves-Image-Classification-Using-CNNs
Successfully developed an image classification model using PyTorch to classify the species of grapevine leaves based on their corresponding images.
alexnet-pytorch convolutional-neural-networks data-loader deep-learning densenet169 fine-tuning-cnns image-classification mobilenetv3-large model-inference model-training-and-evaluation multiclass-classification resnet50 vgg19
Language:Jupyter Notebook
SayamAlt / Luxury-Apparel-Product-Category-Classification-using-fine-tuned-DistilBERT
Successfully developed a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify various distinct types of luxury apparels into their respective categories i.e. pants, accessories, underwear, shoes, etc.
deep-learning distilbert-fine-tuning distilbert-model exploratory-data-analysis fine-tuning-bert model-inference model-training-and-evaluation multiclass-classification natural-language-processing text-classification text-preprocessing text-tokenization
Language:Jupyter Notebook
SayamAlt / Mental-Health-Classification-using-fine-tuned-DistilBERT
Successfully established a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify several distinct types of mental health statuses such as anxiety, stress, personality disorder, etc. with an accuracy of 77%.
data-visualization deep-learning distilbert-fine-tuning distilbert-model model-evaluation model-inference model-training-and-evaluation multiclass-text-classification natural-language-processing text-classification text-preprocessing text-tokenization
Language:Jupyter Notebook
SayamAlt / Natural-Scenes-Image-Classification-using-CNNs
Successfully established an image classification model using PyTorch to classify the images of several distinct natural sceneries such as mountains, glaciers, forests, seas, streets and buildings with an accuracy of 86%.
convolutional-neural-networks deep-learning fine-tuning-cnns flask-deployment image-classification image-transformations model-inference model-training-and-evaluation multiclass-classification pytorch resnet50-model torch-dataloader
Language:Jupyter Notebook
SayamAlt / Oral-Disease-Classification-using-CNN
Successfully developed an image classification model using PyTorch to classify two types of oral diseases, namely caries and gingivitis.
binary-classification convolutional-neural-networks data-loader deep-learning image-classification image-transformations model-inference model-training-and-evaluation pytorch
Language:Jupyter Notebook
SayamAlt / Symptoms-Disease-Text-Classification
Successfully developed a fine-tuned BERT transformer model which can accurately classify symptoms to their corresponding diseases upto an accuracy of 89%.
bert-fine-tuning data-exploration-and-preprocessing exploratory-data-analysis fine-tune-bert-tensorflow hugging-face-transformers model-architecture-and-implementation model-inference model-training-and-evaluation multiclass-classification natural-language-processing text-classification text-preprocessing text-tokenization
Language:Jupyter Notebook
SayamAlt / Wine-Cultivator-Classification-using-ANN
Successfully established an ANN model which can classify wine cultivators based on several characteristics of distinct wines.
artificial-neural-networks classification deep-learning model-inference model-training-and-evaluation multiclass-classification pytorch
Language:Jupyter Notebook

model-inference

bentoml / OpenLLM

wangxb96 / Awesome-EdgeAI

bentoml / CLIP-API-service

hegongshan / Storage-for-AI-Paper

array2d / deepx

EmbeddedLLM / embeddedllm

DAVIDNYARKO123 / edge-tpu-silva

kdeps / kdeps

Koldim2001 / Image_captioning

brian-kipkoech-tanui / sagemaker-ML-workflow

ChaitanyaC22 / Udacity-AWS-MLE-ND-Project2-Build-a-ML-Workflow-For-Scones-Unlimited-On-Amazon-SageMaker

AlvinHon / distributed-model-inference

itancio / churn

SayamAlt / Cyberbullying-Classification-using-fine-tuned-DistilBERT

SayamAlt / Financial-News-Sentiment-Analysis

thehrsr / CAR-DAMAGE-DETECTION

C-bianc / NER-task

GauravG-20 / Udacity-AWS-MLE-ND-Project2-Build-a-ML-Workflow-For-Scones-Unlimited-On-Amazon-SageMaker

KrajShuffle / Classifying_SpeechAudio_CNN

vinit714 / --Deep-Learning-for-Fashion-MNIST--Accessory-Classification-Project

kwame-mintah / gcp-cloud-run-function-model-inference

santosh / image-classifier

SayamAlt / English-to-Spanish-Language-Translation-using-Seq2Seq-and-Attention

SayamAlt / Global-Equity-Forecasting-using-LSTM

SayamAlt / Global-News-Headlines-Text-Summarization

SayamAlt / Grapevine-Leaves-Image-Classification-Using-CNNs

SayamAlt / Luxury-Apparel-Product-Category-Classification-using-fine-tuned-DistilBERT

SayamAlt / Mental-Health-Classification-using-fine-tuned-DistilBERT

SayamAlt / Natural-Scenes-Image-Classification-using-CNNs

SayamAlt / Oral-Disease-Classification-using-CNN

SayamAlt / Symptoms-Disease-Text-Classification

SayamAlt / Wine-Cultivator-Classification-using-ANN