Giters
gururise
/
AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
1496
Watchers:
27
Issues:
25
Forks:
146
gururise/AlpacaDataCleaned Issues
Chinese sft data
Updated
5 months ago
How to format dataset fields in model prompt?
Closed
5 months ago
Comments count
1
Where is the 9k cleaned alpaca data in the paper Alpagasus?
Updated
a year ago
Comments count
2
Is there a boost in performance for full fine-tuning versus LoRA?
Closed
a year ago
Comments count
2
The MNLI score in lm-evaluation-harness
Updated
a year ago
Is the "alpaca_data_cleaned_archive.json" file having all cleaned data?
Closed
a year ago
Comments count
2
PIQA dataset's metric
Updated
a year ago
Command to run the evaluation
Updated
a year ago
Identify code snippet in "input" fields
Updated
a year ago
Comments count
1
Evaluation Metric
Updated
a year ago
Comments count
8
Contributing to the dataset curation with Argilla and the Alpaca Garbage collector
Closed
a year ago
Comments count
2
Diffs as data
Closed
a year ago
Comments count
1
80% of math outputs are wrong
Closed
a year ago
Comments count
1
Correct or potentially to be cleaned?
Closed
a year ago
Comments count
6
Separate instructions by functionality
Closed
a year ago
Any chance we could improve the dataset beyond fixing?
Updated
a year ago
Comments count
41
good job
Closed
a year ago
Comments count
1
What about starting a crowdfunding campaign to collect money to run the examples against GPT-4?
Updated
a year ago
Comments count
5
How are you going about cleaning?
Updated
a year ago
Comments count
4
overall approach
Closed
a year ago
Idea about better cleaning
Closed
a year ago
Comments count
3
Incorrect key string in alpaca_data_cleaned.json
Closed
a year ago
Hosting your dataset on the Hugging Face Hub
Closed
a year ago
Comments count
3
ModuleNotFoundError: No module named 'utils'
Closed
a year ago
Comments count
1
Adding scripts for data cleaning
Closed
a year ago
Comments count
3