1 Comment

Great one.

Regarding the overtraining of LLaMA 3 you mentionned, did you get some informations regarding a risk of overfitting when overtraining? I wonder if this is a risk to consider when training such a large language model...

Expand full comment