Sudarsun Santhiappan's repositories
cnlp_transformers
Transformers for Clinical NLP
ctakes-rest-package
Quick repo to get setup running cTAKES REST server and making calls
docker-spark-cluster
A simple spark standalone cluster for your testing environment purposses
PyPDF2
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
fabric
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
fastc
Simple and Lightweight Text Classifiers with LLM Embeddings
gemma-cookbook
A collection of guides and examples for the Gemma open models from Google.
GenAI-Projects
A wide curration of open-source projects and applications in the emerging field of generative AI.
hollama
A minimal web-UI for talking to Ollama servers
ISLP_labs
Up-to-date version of labs for ISLP
LivePortrait
Bring portraits to life!
LLM101n
LLM101n: Let's build a Storyteller
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
lm-evaluation-harness
A framework for few-shot evaluation of language models.
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
mimic3-benchmarks
Python suite to construct benchmark machine learning datasets from the MIMIC-III 💊 clinical database.
pdfs
Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)
philter-ucsf
Open source clinical text de-identification
pystatsml
Statistics and Machine Learning in Python
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
tamil-nlp-catalog
Awesome List of Tamil NLP & AI Resources
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation