ISSAI (IS2AI)

ISSAI

IS2AI

Geek Repo

Institute of Smart Systems and Artificial Intelligence

Location:Nur-Sultan, Kazakhstan

Home Page:issai.nu.edu.kz

Github PK Tool:Github PK Tool

ISSAI's repositories

TurkicASR

A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.

Language:PythonLicense:CC-BY-4.0Stargazers:54Issues:6Issues:2

TurkicTTS

A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek.

thermal-facial-landmarks-detection

SF-TL54: Thermal Facial Landmark Dataset with Visual Pairs.

Language:Jupyter NotebookLicense:MITStargazers:37Issues:2Issues:5

KazEmoTTS

An open-source Kazakh Emotional Text-to-Speech Dataset

telegram-bot-chatgpt

Telegram bot to interact with ChatGPT via voice messages

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

Central-Asian-Food-Dataset

42 food classes from Kazakh National and Central Asian cuisine

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

OpenThermalPose

An Open-Source Annotated Thermal Human Pose Dataset and Initial YOLOv8-Pose Baselines

License:MITStargazers:10Issues:0Issues:0

faces-in-event-streams

This repo contains code and instructions for the detection of faces in event streams

Language:PythonLicense:MITStargazers:9Issues:2Issues:8

Kazakh-Speech-Commands-Dataset

Kazakh Speech Commands Dataset

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9Issues:1Issues:0
Language:PythonStargazers:7Issues:1Issues:0

AnyFace

Input-Agnostic Face Detection

Language:Jupyter NotebookLicense:MITStargazers:6Issues:1Issues:0

KazQAD

An open-source Kazakh Question Answering Dataset

License:CC-BY-SA-4.0Stargazers:6Issues:5Issues:0

KazParC

An open-source parallel corpus for machine translation across Kazakh, English, Russian, and Turkish

Language:Jupyter NotebookStargazers:5Issues:1Issues:0

KazSAnDRA

An open-source Kazakh Sentiment Analysis Dataset of Reviews and Attitudes (KazSAnDRA) and baseline sentiment classification models

Language:PythonStargazers:3Issues:1Issues:0

COHI-O365

The most diverse in number of images/labels/classes fisheye synthetic dataset with source codes and models. As well as a benchmarking testing real dataset.

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

Column-Design-Optimization

Column design optimization

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

Common-Objects-in-Hemispherical-Images-Dataset

39 classes of objects sampled from the MS COCO dataset captured with a hemispherical/fisheye camera

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

city-identification

This repo contains dataset and models for city classification

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

city-sustainability-indexes

This repo contains code and models for detecting city sustainability indexes

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

TatarTTS

TatarTTS: An Open-Source Text-to-Speech Synthesis Dataset for the Tatar Language

Vision-Language-Models-for-Activity-Recognition-and-Abnormality-Detection-for-Elderly

VLM PrismerZ model for recognition of emergency and non-emergneyc situations via vision and language transformers. PrismerZ is directed on understanding the contextual information and completing image captioning and visiom qiestion answering tasks.

docker-flask-api-template

This is docker Flask API template with GPU support. As an example the project has X-Ray disease classificator project in it.

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

talk-llm

Talk with ChatGPT

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

.github

ISSAI

Stargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Enhancing-Ambient-Assisted-Living-with-Multi-Modal-Vision-and-Language-Models

This project is aimed at detecting the abnormal behaviour or emergency cases using vision-language model (VLM), large language model (LLM), human detection model, text-to-speech (TTS) and speech-to-text models (STT). The framework can detect the subtle sings of emergency and actively interact with the user to make an accurate decision.

Stargazers:0Issues:0Issues:0

HPE-depth-fisheye

This project used synthetic data created using Nvidia Omniverse to train a camera-view invariant multi-pose HPE model for depth and fisheye cameras.

License:MITStargazers:0Issues:0Issues:0

serge

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Language:SvelteLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TatarSCR

An Open-Source Speech Commands Dataset for the Tatar Language

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0