Xiaorui Jiang's repositories
generative-ai__GoogleCloudPlatform
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
PMC-LLaMA
The official codes for "PMC-LLaMA: Towards Building Open-source Language Models for Medicine"
scholarly
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
llama2__dataprofessor
This chatbot app is built using the Llama 2 open source LLM from Meta.
Transformers-Tutorials__NielsRogge_at_GitHub
This repository contains demos I made with the Transformers library by HuggingFace.
systematic-review-datasets__CSMeD_NeurIPS2023
[NeurIPS 2023] CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews
LLMmed_Nature_Communications_2024
Large Language Models Streamline Automated Machine Learning for Clinical Studies
Large-Language-Model-Notebooks-Course
Practical course about Large Language Models.
xmc.dspy
Extreme Multi-label Classification with DSPy. Code coming soon.
SDoH_NPJ_Digital_Medicine_2024
Large Language Models to Identify Social Determinants of Health in Electronic Health Records | Paper: https://www.nature.com/articles/s41746-023-00970-0
BERTCSRS
BERT for Complex Systematic Review Screening
reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
synergy-dataset__asreview
SYNERGY - Open machine learning dataset on study selection in systematic reviews
MedCAT
Medical Concept Annotation Tool
mimic-code
MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases
rhetorical-role-baseline
OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve access to justice in India. BUILD is the first benchmark dataset created by OpenNyAI
causal-text-papers
Curated research at the intersection of causal inference and natural language processing.
OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.
gdsr_antonie_maastricht
đź”— A Graph-augmented Dense Statute Retriever. (EACL 2023)
SIGIR2017-SysRev-Collection
A Test Collection for Evaluating Retrieval of Studies for Inclusion in Systematic Reviews
few-shot-learning__GPT3_style
Few-shot Learning of GPT-3
pytorch-deep-learning
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.