@atoponce This is most likely an issue with tokenization, rather than something fundamental. That is, the model can't see the structure of the numbers the way we do.

If you write the numbers more unconventionally, they don't get tokenized, and the model can perform the task.