Paper at Computational Linguistics
We have a paper accepted at Computational Linguistics.
-
Marco Cognetta and Naoaki Okazaki. Tokenization as Finite-State Transduction. Computational Linguistics, 51(4):1119–1149, December 2025. (doi: 10.1162/coli.a.23)