ONKAR Susladkar's repositories

AudioText-Classififcation

Audio Aware Text Classification

Language:PythonStargazers:1Issues:0Issues:0

MPII-Humankeypoint

Human key point distribution

Language:PythonLicense:GPL-3.0Stargazers:1Issues:1Issues:0

ActionAI

custom human activity recognition modules by pose estimation and cascaded inference using sklearn API

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:0Issues:0

AdelaiDepth

This repo contains the projects: 'Virtual Normal', 'DiverseDepth', and '3D Scene Shape'. They aim to solve the monocular depth estimation, 3D scene reconstruction from single image problems.

Language:PythonLicense:CC0-1.0Stargazers:0Issues:0Issues:0

Airplane-Detection-for-Satellites

Airplanes are detected on images taken from satellite

Language:PythonStargazers:0Issues:0Issues:0

Cell-Detection

Used open cv and tensorflow for traing the images and segmented the traning and test result

Language:PythonStargazers:0Issues:2Issues:0

CoFormer-WACV-2024

Source code of "Textual Alchemy: CoFormer for Scene Text Understanding", published in WACV 2024

Language:PythonStargazers:0Issues:0Issues:0

Concept-Segmentation

Medical Imagning

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Darklight-Pytorch

A CNN for dark recognition

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

efficient_net_v2

Pytorch implementation of efficientnet v2 backbone with detectron2 for object detection (Just for fun)

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

FEEDNet

This paper first introduces a novel model, named FEEDNet, for accurately segmenting the nuclei in HE stained WSIs. FEEDNet is an encoder-decoder network that uses LSTM units and “feature enhancement blocks” (FE-blocks). Our proposed FE-block avoids the loss of location information incurred by pooling layers by concatenating the downsampled version of the original image to preserve pixel intensities. FEEDNet uses an LSTM unit to capture multi-channel representations compactly

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

GlyphControl-release

[NeurIPS2023] This is an official inference code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"

License:MITStargazers:0Issues:0Issues:0

Handwriting-Synthesis

A GAN model that can generate handwritten samples

License:MITStargazers:0Issues:0Issues:0

IIT-Jodhpu--Challange

IIT-Jodhpur challange for Research Associate

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

ModernConvNets

Revisions and implementations of modern Convolutional Neural Networks architectures in TensorFlow and Keras

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-llama

LLaMA 2 implemented from scratch in PyTorch

License:MITStargazers:0Issues:0Issues:0

slbert

SLBERT: A Novel Pre-training Framework for Joint Speech and Language Modeling

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

VIACT_test

Clustrinng in cpp

Language:C++Stargazers:0Issues:0Issues:0
Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0

yolov4

yoloV4

Language:PythonLicense:MITStargazers:0Issues:1Issues:0