The very strategy of asking LLMs to “reason” or explain an answer tends to make them more accurate.
Because instead of the first token being “Yes” or “No”, it’s “That depends,” or If we look at…"
Thus increasing the number of tokens that determines the answer from 1, to theoretically hundreds or more.
Those are all one token. A token can be a whole sentence. Tokenization tends to be based on LZW compression which combines common phrases (of any length, e.g. “Once upon a time” could be a single token because it’s recurring)
“Yes” is almost always followed by an explanation of a single idea while “It depends” is followed by several possible explanations.