summer1278

Xia Cui's starred repositories

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonApache-2.01417800

garak

LLM vulnerability scanner

Language:PythonApache-2.092200

MIUA2024.github.io

MIUA2024 Website

Language:HTMLMIT200

tweetnlp

TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/understand tweets such as sentiment analysis, emoji prediction, and named entity recognition, powered by state-of-the-art language models specialised on Twitter.

Language:PythonMIT28400

nova

NOVA is a tool for annotating and analyzing behaviours in social interactions. It supports Annotators using Machine Learning already during the coding process. Further it features both, discrete labels and continuous scores and a visuzalization of streams recorded with the SSI Framework.

Language:C#GPL-3.017200

Cost-Sensitive_Bert_and_Transformers

Transformers for Cost-Sensitive BERT for Generalisable Sentence Classification on Imbalanced Data

Language:PythonApache-2.01800

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.02900300

DialogRPT

EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"

Language:PythonMIT33600

atom

:atom: The hackable text editor

Language:JavaScriptMIT6002500

OpenChatKit

Language:PythonApache-2.0900800

annotators-agreement-dataset

800

awesome-human-label-variation

A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, accompanying The 'Problem' of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation (EMNLP 2022)

6700

Text_Classification

Text Classification Algorithms: A Survey

Language:PythonMIT178500

gridspace-stanford-harper-valley

The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.

Language:PythonCC-BY-4.03700

counsel-chat

This repository holds the code for working with data from counselchat.com

Language:Jupyter NotebookMIT13700

AnnoMI

Official repository for the AnnoMI dataset: the first public collection of expert-annotated MI transcripts.

5400

django

The Web framework for perfectionists with deadlines.

Language:PythonBSD-3-Clause7740400

sentiment-predictor-for-stress-detection

Voice stress analysis (VSA) aims to differentiate between stressed and non-stressed outputs in response to stimuli (e.g., questions posed), with high stress seen as an indication of deception. In this work, we propose a deep learning-based psychological stress detection model using speech signals. With increasing demands for communication between humans and intelligent systems, automatic stress detection is becoming an interesting research topic. Stress can be reliably detected by measuring the level of specific hormones (e.g., cortisol), but this is not a convenient method for the detection of stress in human- machine interactions. The proposed algorithm first extracts Mel- filter bank coefficients using pre-processed speech data and then predicts the status of stress output using a binary decision criterion (i.e., stressed or unstressed) using CNN (Convolutional Neural Network) and dense fully connected layer networks.

Language:Jupyter Notebook8000

summer1278

Xia Cui's starred repositories

haystack

garak

MIUA2024.github.io

tweetnlp

nova

Cost-Sensitive_Bert_and_Transformers

stanford_alpaca

DialogRPT

atom

OpenChatKit

annotators-agreement-dataset

awesome-human-label-variation

Text_Classification

gridspace-stanford-harper-valley

counsel-chat

AnnoMI

django

sentiment-predictor-for-stress-detection

OpenPrompt

bert_nli

acl22-depression-phq9

mindspore

mlm-scoring

generalized-fairness-metrics

transformers-interpret

gym

gym-pcgrl

pyttsx3

pygame

google-research