Lizhecheng02

Zhecheng Li's repositories

RAG-ChatBot

A basic application using langchain, streamlit, and large language models to build a system for Retrieval-Augmented Generation (RAG) based on documents, also includes how to use Groq and deploy your own applications.

Language:Jupyter Notebook28 10

DRS

Repository for our paper "DRS: Deep Question Reformulation With Structured Output".

Language:Python5 10

Kaggle-PII_Data_Detection

Implement named entity recognition (NER) using regex and fine-tuned LLM, with a total of 15 categories. The ultimate goal is to apply the model to detect personally identifiable information (PII) in student writing.

Language:Jupyter Notebook5 10

Kaggle-Automated_Essay_Scoring_2.0

(1) Train large language models to help people with automatic essay scoring. (2) Extract essay features and train new tokenizer to build tree models for score prediction.

Language:Python4 20

Kaggle-Detect_Sleep_States

Predicting changes in sleep states based on sleep monitoring data. (Mainly PrecTime model)

Language:Jupyter Notebook4 10

Kaggle-LLM-Detect_AI_Generated_Text

Detect whether the text is AI-generated by training a new tokenizer and combining it with tree classification models or by training language models on a large dataset of human & AI-generated texts.

Language:Jupyter Notebook3 10

Kaggle-LLM_Science_Exam

Implementing science-related multiple-choice question answering based on LLMs and RAG.

Language:Jupyter Notebook3 10

Custom-ChatGPT

Using the question-answer dataset on Hugging Face to fine-tune ChatGPT and compare the fine-tuned model with original ChatGPT.

Language:Python2 10

Kaggle-CIBMTR

In this competition, you’ll develop models to improve the prediction of transplant survival rates for patients undergoing allogeneic Hematopoietic Cell Transplantation (HCT) — an important step in ensuring that every patient has a fair chance at a successful outcome, regardless of their background.

Language:Jupyter Notebook200

Kaggle-Eedi

Develop an nlp-based method to predict the affinity between misconceptions and incorrect answers (distractors) in multiple-choice questions.

Language:Jupyter Notebook2 10

Kaggle-LMSYS

Analyze a dataset of conversations from the Chatbot Arena, where various LLMs provide responses to user prompts. The goal is to develop a model that enhances chatbot interactions, ensuring they align more closely with human preferences.

Language:Jupyter Notebook2 10

Kaggle-Multilingual_Chatbot_Arena

This competition challenges you to predict which responses users will prefer in a head-to-head battle between chatbots powered by large language models (LLMs).

Language:Jupyter Notebook100

MultiModal

Basic implementation code for multimodal models and some applications or fine-tuning tasks based on them.

Language:Jupyter Notebook1 10

GUI-Python

Simple Python frontend mini-program, mainly including the use of libraries such as Streamlit, etc. Help understand how to use various APIs.

Language:Jupyter Notebook010

Kaggle-CMI-Detect_Sleep_States

The goal of this competition is to detect sleep onset and wake. You will develop a model trained on wrist-worn accelerometer data in order to determine a person's sleep state.

Language:Jupyter Notebook000