Sham's starred repositories
supervision
We write your reusable computer vision tools. 💜
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
ai-video-summarizer-and-timestamp-generator-LLM-p
Summarize Youtube Videos and Generate Timestamps Efficiently using LLM [Google Gemini Pro, OpenAI ChatGPT]
Connect-Health
Virtual patient-doctor connections via video calls, with additional chatbot and AI doctor interaction options. Patients can conveniently upload reports for OCR-based summarization.
stable-headshot
Custom fork of stable-diffusion-webui for headshot photo generation
local-llms-analyse-finance
In this project, I explored how local LLMs can be used to label data and support analyses. Specifically, I used Llama2 model to automatically categorise my bank transaction data.
pdf-workdesk
A Streamlit-powered application that provides a user-friendly interface for editing PDF documents.
devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
data-to-paper
data-to-paper: Backward-traceable AI-driven scientific research
video-editor
A simple video editor for trimming and resizing clips.
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
StyleSync_PyTorch
PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"