This is wrong.
I asked GPT-4 right now to perform this task and it got 3/3 3-digit calculations correct, and on a 4-digit calculation was off by 10 (not 1!).
And note that artihmetic is seen as a weak spot of LLMs, it is not a good example to attack the claim that they have emergent properties.