cdx's repositories
alignment-scripts
Scripts to preprocess training and test data and to run fast_align and giza
alpaca-lora
Instruct-tune LLaMA on consumer hardware
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
CGMH
Codes for <CGMH: Constrained Sentence Generation by Metropolis-Hastings Sampling>
chatgpt-evaluation
This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity"
CodeXGLUE
CodeXGLUE
controlled-response-generation
Explicitly controlling style and content of response generation
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
financial-news-dataset
109,110 news from Reuters.
GuidedLDA
semi supervised guided topic model with custom guidedLDA
keras-transformer
Transformer implemented in Keras
LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
Microduino_Tutorials
Microduino Tutorials
pyLDAvis
Python library for interactive topic model visualization. Port of the R LDAvis package.
python-topic-model
Implementation of various topic models
lm-evaluation-harness
A framework for few-shot evaluation of language models.
RyuApps
Creates a simple Ryu app using the tutorials and then adds on to it.
sacreBLEU
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
text-feat-lib
Provide a comprehensive list of tokenizers, features, and general NLP things used for text analysis with examples. The initial focus is on features used for twitter data and sentiment analysis.