Transformer: Matching by tokens vs match by character; also whitespace sensitivity