Custom Tokenizer

Learn about language model tokenization

Large language models process text using tokens, which are common sequences of characters found in a set of text. The models learn to understand the statistical relationships between these tokens, and excel at producing the next token in a sequence of tokens.

📝 Token text input

0 characters

đŸ“Ļ Encoding

🔍 Token visualization

Enter some text for visualization of tokens

â†Šī¸ Decoding

Resets the vocabulary and stored tokens