1 Comment

Why is it so frustrating to read this statement? three-digit addition is performed accurately by GPT-3 less than 1% of the time on any model with less than 6B parameters, but this jumps to 8% accuracy on a 13B parameter model and 80% accuracy on a 175B parameter model.

Expand full comment