Japanese Tokenizer Comparison
This is a demo comparing Japanese tokenizers.
You can compare the tokenization results of tools that are available with just a pip install in Python.
Results
Examples
Examples
Pages:
How to install each library
pip install janome
pip install nagisa
pip install sudachipy sudachidict_core
pip install mecab-python3
pip install fugashi ipadic
pip install fugashi unidic-lite
pip install tiktoken
pip install tiktoken