Paper CANINE Pre-training an Efficient Tokenization-Free Encoder for Language Representation Paper Charformer Fast Character Transformers via Gradient-based Subword Tokenization