
Install prerequisites
Installautotrain-advanced
and wandb
.
- Command Line
- Notebook
pass@1
on the GSM8k Benchmarks.
Prepare the dataset
Hugging Face AutoTrain expects your CSV custom dataset to have a specific format to work properly.-
Your training file must contain a
text
column, which the training uses. For best results, thetext
column’s data must conform to the### Human: Question?### Assistant: Answer.
format. Review a great example intimdettmers/openassistant-guanaco
. However, the MetaMathQA dataset includes the columnsquery
,response
, andtype
. First, pre-process this dataset. Remove thetype
column and combine the content of thequery
andresponse
columns into a newtext
column in the### Human: Query?### Assistant: Response.
format. Training uses the resulting dataset,rishiraj/guanaco-style-metamath
.
Train using autotrain
You can start training using the autotrain
advanced from the command line or a notebook. Use the --log
argument, or use --log wandb
to log your results to a W&B Run.
- Command Line
- Notebook
