Awsaf (awsaf49)

awsaf49

Geek Repo

Company:@Google

Location:Dhaka, Bangladesh

Home Page:https://awsaf49.github.io

Twitter:@awsaf49

Github PK Tool:Github PK Tool

Awsaf's starred repositories

mlx

MLX: An array framework for Apple silicon

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:13860Issues:159Issues:169

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8202Issues:68Issues:187

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:5132Issues:27Issues:25

DAMO-YOLO

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.

Language:PythonLicense:Apache-2.0Stargazers:3689Issues:137Issues:126

anomalib

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Language:PythonLicense:Apache-2.0Stargazers:3239Issues:37Issues:790

asitop

Perf monitoring CLI tool for Apple Silicon

Language:PythonLicense:MITStargazers:3018Issues:28Issues:53

ml-fastvit

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Language:PythonLicense:NOASSERTIONStargazers:1747Issues:32Issues:0

OneFormer

OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023

Language:Jupyter NotebookLicense:MITStargazers:1351Issues:20Issues:104

ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Language:PythonLicense:Apache-2.0Stargazers:1139Issues:16Issues:170

Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.

resource-stream

CUDA related news and material links

LLM-Training-Puzzles

What would you do with 1000 H100s...

Language:Jupyter NotebookLicense:MITStargazers:766Issues:12Issues:3

swin2sr

[ECCV] Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 3.3M runs https://replicate.com/mv-lab/swin2sr

Language:PythonLicense:Apache-2.0Stargazers:537Issues:12Issues:0

vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Language:PythonLicense:MITStargazers:443Issues:10Issues:13

awesome-vision-and-language

A curated list of awesome vision and language resources (still under construction... stay tuned!)

inst-inpaint

A novel inpainting framework that can remove objects from images based on the instructions given as text prompts.

Language:PythonLicense:MITStargazers:329Issues:13Issues:14

caption-upsampling

This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.

Language:PythonLicense:Apache-2.0Stargazers:145Issues:2Issues:0

CFINet

The official implementation for ICCV'23 paper "Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning"

Language:PythonLicense:Apache-2.0Stargazers:100Issues:3Issues:21

attention

several types of attention modules written in PyTorch

Language:PythonStargazers:29Issues:3Issues:0

hila

Official PyTorch code for HILA

Language:PythonLicense:MITStargazers:28Issues:4Issues:2

cad

Content-Adaptive Downsampling in Convolutional Neural Networks (CVPR 2023 Workshop on Efficient Deep Learning for Computer Vision)

Language:PythonLicense:Apache-2.0Stargazers:21Issues:2Issues:0

VideoSwin

Keras Implementation of Video Swin Transformers for 3D Video Modeling

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:19Issues:2Issues:4

detect-fake-text

LLM - Detect AI Generated Text || Identify which essay was written by a large language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13Issues:2Issues:1

UniFormerV2

[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5Issues:2Issues:0

mldl-i

MLDL-I: Machine Learning and Deep Learning - I || Offered course at IRAB

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:3Issues:1Issues:0

synatt

Syn-Att: Synthetic Speech Attribution via Semi-Supervised Unknown Multi-Class Ensemble of CNNs

Language:PythonLicense:MITStargazers:3Issues:2Issues:1

yolov5-wandb

YOLOv5 🚀 in PyTorch > ONNX > CoreML / WandB > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:3Issues:1Issues:0

Video-FocalNets

Keras Implementation of Video-FocalNets

Language:Jupyter NotebookLicense:MITStargazers:1Issues:2Issues:1