Yiqiao Jin's starred repositories
awesome-instruction-learning
Papers and Datasets on Instruction Tuning and Following. ✨✨✨
screen_annotation
The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and describe the UI elements present on the screen: their type, location, OCR text and a short description. It has been introduced in the paper `ScreenAI: A Vision-Language Model for UI and Infographics Understanding`.
screen_qa
ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K screenshots from Rico. It should be used to train and evaluate models capable of screen content understanding via question answering.
theme-academic-cv
🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful academic résumé or educational website using Hugo and GitHub. No code.
Awesome-Code-LLM
A curated list of language modeling researches for code and related datasets.
generate-adversarial-text
bring-your-own-dataset (byod) and generate adversarial text examples
project-page-template
🧸 YAAPPT: Yet Another Academic Project Page Template.
Academic-project-page-template
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
machine-learning-interview
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
bioinformatics
:microscope: Path to a free self-taught education in Bioinformatics!
semi-offline-RL
Semi-Offline Reinforcement Learning for Optimized Text Generation
twitter-scrapper-snscrape
Tutorial to scrape tweets from Twitter using snscrape
enron-formality
Code and data from the paper "Email formality in the workplace: A case study on the Enron corpus"
VERY-NEG-and-VERY-POS-Lexicons
Two lexicons for extreme words
chatgpt-prompt-engineering
Jupyter code notebooks of "ChatGPT Prompt Engineering for Developers" by DeepLearning.AI and OpenAI.
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)