Publications

Awards

Language Resources Award 2025 (2025-03-07)

Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Shota Hirai, Sakae Mizuki, Rio Yokota, Naoaki Okazaki

Swallow LLM

URL
Sponsorship Award (Hitachi), the 31th Annual Meeting of The Association for Natural Language Processing (2025-03-13)

Xin Zhao, Naoki Yoshinaga, Daisuke Oba

Multifaceted Analysis of Recalling Factual Knowledge in Large Language Models

URL
Special Committee Award, the 31th Annual Meeting of The Association for Natural Language Processing (2025-03-13)

Eri Onami, Taiki Miyanishi, Koki Maeda, Shuhei Kurita

LegalViz: Legal Text Visualization by Text To Diagram Generation

URL
Special Committee Award, the 31th Annual Meeting of The Association for Natural Language Processing (2025-03-13)

Masanari Ohi, Masahiro Kaneko, Naoaki Okazaki, Nakamasa Inoue

Multi-modal, Multi-task, Multi-criteria Automatic Evaluation with Vision Language Models

URL
Special Committee Award, the 31th Annual Meeting of The Association for Natural Language Processing (2025-03-13)

Keito Sasagawa, Koki Maeda, Issa Sugiura, Shuhei Kurita, Naoaki Okazaki, Daisuke Kawahara

LLM-jp-3 VILA: Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model

URL
Special Committee Award, the 31th Annual Meeting of The Association for Natural Language Processing (2025-03-13)

Panatchakorn Anantaprayoon, Masahiro Kaneko, Naoaki Okazaki

Mitigating Social Bias in Large Language Models by Self-Correction

URL
Young Researcher’s Encouragement Award, the 31th Annual Meeting of The Association for Natural Language Processing (2025-03-13)

Koki Maeda

llm-jp-eval-mm: An Evaluation Suite For Japanese-centric Vision and Language Models

URL
Excellence Award, the Association for Natural Language Processing (2025-03-10)

Youmi Ma, An Wang, Naoaki Okazaki

DREEAM: Guiding Attention with Evidence for Improving Document-Level Relation Extraction

URL
Sponsorship Award (CyberAgent, Inc.), the 19th Symposium of Young Researcher Association for NLP Studies (2024-09-06)

Koshiro Saito, Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

Easily detectable LLMs without sacrificing its generative capability

URL
Encouragement Award, the 19th Symposium of Young Researcher Association for NLP Studies (2024-09-06)

Koshiro Saito, Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

Easily detectable LLMs without sacrificing its generative capability

URL
Best Paper Award, the 261th Meeting of Special Interest Group of Natural Language Processing (SIGNL), Information Processing Society of Japan (IPSJ) (2024-09-03)

Koshiro Saito, Sakae Mizuki, Masanari Ohi, Taishi Nakamura, Taihei Shiotani, Koki Maeda, Youmi Ma, Kakeru Hattori, Kazuki Fujii, Takumi Okamoto, Shigeki Ishida, Hiroya Takamura, Rio Yokota, Naoaki Okazaki

Advantages of Training LLMs on Japanese Text

URL
Young Researcher’s Encouragement Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

Masanari Ohi

Likelihood-based Mitigation of Evaluation Bias in Large Language Models

URL
Young Researcher’s Encouragement Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

Yuki Wata

Sampling-based Membership Inference Attack to Large Language Models

URL
Young Researcher’s Encouragement Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

Mengsay Loem

Enhancing Learning and Inference Capabilities of Language Models via Dicussions with Adversarial Utterances

URL
Young Researcher’s Encouragement Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

Ayana Niwa

AmbiNLG: Instruction Text Disambiguation for Natural Language Generation

URL
Young Researcher’s Encouragement Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

Shota Koyama

N-gram F-score between the Original Text, Reference Text, and Corrected Text for Automatic Evaluation of Grammatical Error Correction

URL
Best Paper Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

Naoaki Okazaki, Kakeru Hattori, Shota Hirai, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, Sakae Mizuki

Swallow Corpus: Japanese Large-Scale Web Corpus

URL
Best Paper Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Shota Hirai, Sakae Mizuki, Rio Yokota, Naoaki Okazaki

Constructing Large Language Models with Strong Japanese Capability Through Continual Pre-training

URL
Encouragement Award, the 18th Symposium of Young Researcher Association for NLP Studies (2023-08-31)

Youmi Ma, An Wang, Naoaki Okazaki

Constructing Document-Level Relation Extraction Corpora in Japanese

URL
Sponsorship Award (PKSHA Technology), the 18th Symposium of Young Researcher Association for NLP Studies (2023-08-31)

Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

OUTFOX: LLM-generated Essay Detection through In-context Learning with Adversarially Generated Examples

URL
Sponsorship Award (HAKUHODO Technologies), the 18th Symposium of Young Researcher Association for NLP Studies (2023-08-31)

Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

OUTFOX: LLM-generated Essay Detection through In-context Learning with Adversarially Generated Examples

URL
Best Paper Award (first place), the 29th Annual Meeting of The Association for Natural Language Processing (2023-03-16)

Youmi Ma, An Wang, Naoaki Okazaki

DREEAM: Guiding Attention with Evidence for Improving Document-Level Relation Extraction

URL
Best Paper Award, the 29th Annual Meeting of The Association for Natural Language Processing (2023-03-16)

Sakae Mizuki, Naoaki Okazaki

Semantic Specialization for Knowledge-based Word Sense Disambiguation

URL
Sponsorship Award (Hitachi), the 29th Annual Meeting of The Association for Natural Language Processing (2023-03-16)

Kakeru Hattori, Youmi Ma, Naoaki Okazaki

Query Suggestion and Summarization: Generating Query-Summary Pairs for Query-Focused Summarization

URL
Special Committee Award, the 29th Annual Meeting of The Association for Natural Language Processing (2023-03-16)

Masahiro Kaneko, Graham Neubig, Naoaki Okazaki

Solving NLP Problems through Human-System Collaboration: A Discussion-based Approach

URL
Special Committee Award, the 29th Annual Meeting of The Association for Natural Language Processing (2023-03-16)

Kyosuke Nishida, Taku Hasegawa, Koki Maeda, Kuniko Saito

DueT: Foundation Model for Visual and Language based on Dual-adapter Tuning

URL
Best paper award (first place), the Association for Natural Language Processing (2022-03-17)

Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki

Optimizing Word Segmentation for Downstream Tasks by Weighting Text Vector

URL
Best Paper Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

Sho Takase, Shun Kiyono, Sosuke Kobayashi, Jun Suzuki

Vanishing Gradient Problem and its Solution for Multi-layer Transformer

URL
Best Paper Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

Koki Maeda, Masahiro Kaneko, Naoaki Okazaki

IMPARA: Impact-based Metrics for GEC using PARAllel Data

URL
Special Committee Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

Ayana Niwa, Sho Takase, Naoaki Okazaki

Non-autoregressive Generation using the Nearest Neighbor

URL
Special Committee Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

Hiyori Yoshikawa, Naoaki Okazaki

Selective Prediction for Evaluating Confidence of Knowledge in Language Models

URL
Special Committee Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

Sayo Kada, Yosuke Yamano, Akane Niimi, Hideaki Tamori, Norito Kokai, Naoaki Okazaki, Kentaro Inui

An Automatic Selection Method for Thumbnail Image using Movie Title

URL
Outstanding Paper Award, AKBC2021 (2021-10-05)

Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, Naoaki Okazaki

Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction

URL
Best Paper Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

Sakae Mizuki, Naoaki Okazaki

Hyponymy Detection using Hierarchical Code Learning

URL
Young Researcher’s Encouragement Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

Tatsuya Hiraoka

Optimizing Word Segmentation using Loss Values of Downstream Tasks

URL
Young Researcher’s Encouragement Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

Youmi Ma

Named Entity Recognition and Relation Extraction by Table-Filling using BERT

URL
Committee Special Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

Kosuke Yamada ,Yuta Hitomi,Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

Headline Generation that Reliably Contains the Specified Words

URL
Sponsor Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

Kosuke Yamada ,Yuta Hitomi,Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

Headline Generation that Reliably Contains the Specified Words

URL
TokyoTech Education Award 2020 (2021-03-02)

Yoshihiro Miyake, Naoaki Okazaki, Takafumi Kanamori, Tsuyoshi Murata, Shin-ya Nishizaki, Kazuyuki Shudo, Kenji Kise, Masamichi Shimosaka, Masakazu Sekijima, Keisuke Yanagisawa, Masahiro Kuze, Mitsuji Sampei, Ichiro Yamanaka, Takehiko Itoh, Toru Takeuchi, Takeo Yamaguchi, Kei Sakaguchi

University-wide Education Program of Data Science and Artificial Intelligence for Graduate Students

URL
Presentation award, the 15th NTCIR (2020-12-17)

Yuichi Sasazawa, Naoaki Okazaki

WER99 at the NTCIR-15 QA Lab-PoliInfo-2 Classification Task

URL
Winning the Video-guided Machine Translation (VMT) Challenge 2020 (2020-07-13)

Tosho Hirasawa, Zhishen Yang, Mamoru Komachi, and Naoaki Okazaki

Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020

URL
Language resource award, the 26th Annual Meeting of The Association for Natural Language Processing (2020-03-20)

Yuta Hitomi, Yuya Taguchi, Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

Style Transfer for Abstractive Summarization in a Small-scale Resource

URL
Young researcher’s encouragement award, the 26th Annual Meeting of The Association for Natural Language Processing (2020-03-20)

Kazuki Matsumaru

Improving Truthfulness of Headline Generation

URL
Young researcher’s encouragement award, the 242nd Meeting of Special Interest Group of Natural Language Processing (SIGNL), Information Processing Society of Japan (IPSJ) (2019-10-25)

Tatsuya Hiraoka

HMM-based Neural Network Capturing Latent History with RNN

URL
Best Paper Award, the 240th Meeting of Special Interest Group of Natural Language Processing (SIGNL), Information Processing Society of Japan (IPSJ) (2019-06-14)

Kazuki Matsumaru, Sho Takase, Naoaki Okazaki

Re-examinating the Task of Headline Generation based on Textual Entailment

URL
JSAI 2018 Best Paper Award (2018-06-27)

Sho Takase, Naoaki Okazaki, Kentaro Inui

Learning to Compose Distributed Representations of Relational Patterns

URL
Best Paper Award, the 24th Annual Meeting of The Association for Natural Language Processing (2018-03-15)

Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, Masaaki Nagata

Reducing odd generation in neural headline generation

URL

Research Publications

Journal papers

An Wang, Huidong Jiang, Youmi Ma, Junfeng Jiang, Ao Liu, and Naoaki Okazaki. Improving Implicit Sentiments Analysis via Explanations of Multiple Perspectives. In IEEE Access, pages to appear, 2025.
Vijay Daultani, Hector Vazquez Martinez, and Naoaki Okazaki. Acceptability Evaluation of Naturally Written Sentences. Journal of Information Processing, 32:652–666, August 2024. (doi: 10.2197/ipsjjip.32.652)

DOI
Ao Liu, Congjian Luo, and Naoaki Okazaki. Improving Logical-Level Natural Language Generation with Topic-Conditioned Data Augmentation and Logical Form Generation. Journal of Information Processing, 31:332–343, April 2023. (doi: 10.2197/ipsjjip.31.332)

DOI
Ayana Niwa, Sho Takase, and Naoaki Okazaki. Nearest Neighbor Non-autoregressive Text Generation. Journal of Information Processing, 31:334–352, April 2023. (doi: 10.2197/ipsjjip.31.344)

DOI
Chunpeng Ma, Aili Shen, Hiyori Yoshikawa, Tomoya Iwakura, Daniel Beck, and Timothy Baldwin. On the Effectiveness of Images in Multi-Modal Text Classification: An Annotation Study. ACM Trans. Asian Low-Resour. Lang. Inf. Process., 22(3):1–19, March 2023. (doi: 10.1145/3565572)

URL DOI
Tosho Hirasawa, Masahiro Kaneko, Aizhan Imankulova, and Mamoru Komachi. Pre-Trained Word Embedding and Language Model Improve Multimodal Machine Translation: A Case Study in Multi30K. IEEE Access, 10:67653–67668, 2022. (doi: 10.1109/ACCESS.2022.3185243)

DOI
Zhishen Yang, Tosho Hirasawa, Mamoru Komachi, and Naoaki Okazaki. Why videos do not guide translations in video-guided machine translation? An empirical evaluation of video-guided machine translation dataset. Journal of Information Processing, 30:388–396, May 2022. (doi: 10.2197/ipsjjip.30.388)

DOI
Youmi Ma, Tatsuya Hiraoka, and Naoaki Okazaki. Named Entity Recognition and Relation Extraction Using Enhanced Table Filling by Contextualized Representations. 自然言語処理, 29(1):187–223, March 2022. (doi: 10.5715/jnlp.29.187)

DOI
Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Recurrent Neural Hidden Markov Model for High-Order Transition. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 21(2):1–15, March 2022. (doi: 10.1145/3476511)

URL DOI
Emanuele Bugliarello, Ryan Cotterell, Naoaki Okazaki, and Desmond Elliott. Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs. Transactions of the Association for Computational Linguistics, 9:978–994, September 2021. (doi: 10.1162/tacl_a_00408)

URL DOI
Ayana Niwa, Naoaki Okazaki, Kohei Wakimoto, Keisuke Nishiguchi, and Masataka Mouri. Construction of a Corpus of Rhetorical Devices in Slogans and Structural Analysis of Antitheses. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 20(6), November 2021. (doi: 10.1145/3465218)

DOI
Sangwhan Moon and Naoaki Okazaki. The Effects and Mitigation of Out-of-Vocabulary in Universal Language Models. Journal of Information Processing, 29:490–503, July 2021. (doi: 10.2197/ipsjjip.29.490)

DOI
Kaori Abe, Yuichiroh Matsubayashi, Naoaki Okazaki, and Kentaro Inui. Multi-dialect Neural Machine Translation for 48 Low-resource Japanese Dialects. Journal of Natural Language Processing, 27(4):781–800, December 2020. (doi: 10.5715/jnlp.27.781)

DOI
Hayate Iso, Yui Uehara, Tatsuya Ishigaki, Hiroshi Noji, Eiji Aramaki, Ichiro Kobayashi, Yusuke Miyao, Naoaki Okazaki, and Hiroya Takamura. Learning to Select, Track, and Generate for Data-to-Text. Journal of Natural Language Processing, 27(3):599–626, September 2020. (doi: 10.5715/jnlp.27.599)

DOI
Diana Galvan-Sosa, Koji Matsuda, Naoaki Okazaki, and Kentaro Inui. Empirical Exploration of the Challenges in Temporal Relation Extraction from Clinical Text. Journal of Natural Language Processing, 27(2):383–409, June 2020. (doi: 10.5715/jnlp.27.383)

DOI
Kazuaki Hanawa, Akira Sasaki, Naoaki Okazaki, and Kentaro Inui. Stance Detection Attending External Knowledge from Wikipedia. Journal of Information Processing, 27:499–506, August 2019. (doi: 10.2197/ipsjjip.27.499)

DOI
Masatoshi Suzuki, Koji Matsuda, Satoshi Sekine, Naoaki Okazaki, and Kentaro Inui. A Joint Neural Model for Fine-Grained Named Entity Classification of Wikipedia Articles. IEICE Transactions on Information and Systems, Special Section on Semantic Web and Linked Data, E101.D(1):73–81, January 2018. (doi: 10.1587/transinf.2017SWP0005)

DOI
Ran Tian, Naoaki Okazaki, and Kentaro Inui. The mechanism of additive composition. Machine Learning, 106(7):1083–1130, July 2017. (doi: 10.1007/s10994-017-5634-8)

DOI
Shuangshuang Zhou, Naoaki Okazaki, Koji Matsuda, Ran Tian, and Kentaro Inui. Supervised Approaches for Japanese Wikification. Journal of Information Processing, 25:341–350, April 2017. (doi: 10.2197/ipsjjip.25.341)

DOI

International conferences

Marco Cognetta, David Pohl, Junyoung Lee, and Naoaki Okazaki. Pitfalls, Subtleties, and Techniques in Automata-Based Subword-Level Constrained Generation. In Tokenization Workshop, pages (to appear), Vancouver, Canada, July 2025.

URL
Masahiro Kaneko, Youmi Ma, Yuki Wata, and Naoaki Okazaki. Sampling-based Pseudo-Likelihood for Membership Inference Attacks. In Findings of the Association for Computational Linguistics: ACL 2025 (ACL), pages (to appear), Vienna, Austria, July 2025.
Eri Onami, Taiki Miyanishi, Koki Maeda, and Shuhei Kurita. LegalViz: Legal Text Visualization by Text To Diagram Generation. In Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), pages 6657–6676, Albuquerque, New Mexico, May 2025.
Keito Sasagawa, Koki Maeda, Issa Sugiura, Shuhei Kurita, Naoaki Okazaki, and Daisuke Kawahara. Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model. In Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Demo Track (NAACL), pages (to appear), Albuquerque, New Mexico, May 2025.
Junyoung Lee, Marco Cognetta, Sangwhan Moon, and Naoaki Okazaki. Jamo-Level Subword Tokenization in Low-Resource Korean Machine Translation. In The Eighth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT), pages (to appear), Albuquerque, USA, May 2025.
Ryuto Koike, Masahiro Kaneko, and Naoaki Okazaki. How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection. In Findings of the Association for Computational Linguistics: EMNLP 2024 (EMNLP), pages 14384–14395, Miami, Florida, USA, November 2024. (doi: 10.18653/v1/2024.findings-emnlp.841)

URL DOI
Marco Cognetta, Vilém Zouhar, and Naoaki Okazaki. Distributional Properties of Subword Regularization. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 10753–10763, Miami, Florida, USA, November 2024. (doi: 10.18653/v1/2024.emnlp-main.600)

URL DOI
Shota Koyama, Ryo Nagata, Hiroya Takamura, and Naoaki Okazaki. n-gram F-score for Evaluating Grammatical Error Correction. In Proceedings of the 17th International Natural Language Generation Conference (INLG), pages 303–313, Tokyo, Japan, September 2024.

URL
Naoaki Okazaki, Kakeru Hattori, Hirai Shota, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, and Sakae Mizuki. Building a Large Japanese Web Corpus for Large Language Models. In Proceedings of the First Conference on Language Modeling (COLM), pages (18 pages), University of Pennsylvania, USA, October 2024.

URL
Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Hirai Shota, Sakae Mizuki, Rio Yokota, and Naoaki Okazaki. Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities. In Proceedings of the First Conference on Language Modeling (COLM), pages (25 pages), University of Pennsylvania, USA, October 2024.

URL
Mengsay Loem, Masahiro Kaneko, and Naoaki Okazaki. SAIE Framework: Support Alone Isn’t Enough - Advancing LLM Training with Adversarial Remarks. In Proceedings of the 27th European Conference on Artificial Intelligence (ECAI), pages 3717–3724, Santiago de Compostela, Spain, October 2024. (doi: 10.3233/FAIA240931)

URL DOI
Koki Maeda, Tosho Hirasawa, Atsushi Hashimoto, Jun Harashima, Leszek Rybicki, Fukasawa Yusuke, and Yoshitaka Ushiku. COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark. In Proceedings of the European Conference on Computer Vision (ECCV), pages (to appear), Milan, Italy, September 2024.
Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, and Naoaki Okazaki. Likelihood-based Mitigation of Evaluation Bias in Large Language Models. In Lun-Wei Ku, Andre Martins, and Vivek Srikumar, editors, Findings of the Association for Computational Linguistics ACL 2024 (ACL 2024), pages 3237–3245, Bangkok, Thailand and virtual meeting, August 2024. (doi: 10.18653/v1/2024.findings-acl.193)

URL DOI
Marco Cognetta, Tatsuya Hiraoka, Rico Sennrich, Yuval Pinter, and Naoaki Okazaki. An Analysis of BPE Vocabulary Trimming in Neural Machine Translation. In Shabnam Tafreshi, Arjun Akula, João Sedoc, Aleksandr Drozd, Anna Rogers, and Anna Rumshisky, editors, Proceedings of the Fifth Workshop on Insights from Negative Results in NLP, pages 48–50, Mexico City, Mexico, June 2024. (doi: 10.18653/v1/2024.insights-1.7)

URL DOI
Marco Cognetta, Vilém Zouhar, Sangwhan Moon, and Naoaki Okazaki. Two Counterexamples to Tokenization and the Noiseless Channel. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 16897–16906, Torino, Italia, May 2024.

URL
Panatchakorn Anantaprayoon, Masahiro Kaneko, and Naoaki Okazaki. Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 6395–6408, Torino, Italia, May 2024.

URL
Youmi Ma, An Wang, and Naoaki Okazaki. Building a Japanese Document-Level Relation Extraction Dataset Assisted by Cross-Lingual Transfer. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 2567–2579, Torino, Italia, May 2024.

URL
Masahiro Kaneko and Naoaki Okazaki. Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 3955–3961, Torino, Italia, May 2024.

URL
Ryuto Koike, Masahiro Kaneko, and Naoaki Okazaki. OUTFOX: LLM-generated Essay Detection through In-context Learning with Adversarially Generated Examples. In The 38th Annual AAAI Conference on Artificial Intelligence (AAAI), pages 21258–21266, February 2024.
Koki Maeda, Shuhei Kurita, Taiki Miyanishi, and Naoaki Okazaki. Query-based Image Captioning from Multi-context 360° Images. In Findings of the Association for Computational Linguistics: EMNLP 2023 (EMNLP), pages 6940–6954, Singapore, December 2023. (doi: 10.18653/v1/2023.findings-emnlp.463)

URL DOI
Taku Hasegawa, Kyosuke Nishida, Koki Maeda, and Kuniko Saito. DueT: Image-Text Contrastive Transfer Learning with Dual-adapter Tuning. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 13607–13624, Singapore, December 2023. (doi: 10.18653/v1/2023.emnlp-main.839)

URL DOI
Trang Nguyen and Naoaki Okazaki. Causal Reasoning through Two Layers of Cognition for Improving Generalization in Visual Question Answering. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 9221–9236, Singapore, December 2023. (doi: 10.18653/v1/2023.emnlp-main.573)

URL DOI
Masahiro Kaneko and Naoaki Okazaki. Reducing Sequence Length by Predicting Edit Operations with Large Language Models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 10017–10029, Singapore, December 2023. (doi: 10.18653/v1/2023.emnlp-main.619)

URL DOI
Youmi Ma, Bhushan Kotnis, Carolin Lawrance, Goran Glavaš, and Naoaki Okazaki. Improving Cross-Lingual Transfer for Open Information Extraction with Linguistic Feature Projection. In Proceedings of the 3rd Workshop on Multi-lingual Representation Learning (MRL), pages 125–138, Singapore, December 2023. (doi: 10.18653/v1/2023.mrl-1.11)

URL DOI
Trang Nguyen, Amin Mansouri, Kanika Madan, Khuong Duy Nguyen, Kartik Ahuja, Dianbo Liu, and Yoshua Bengio. Reusable Slotwise Mechanisms. In A. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, and S. Levine, editors, Advances in Neural Information Processing Systems (NeurIPS), volume 36, pages 23533–23556, 2023.

URL
Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. The Impact of Debiasing on the Performance of Language Models in Downstream Tasks is Underestimated. In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 2: Short Papers) (AACL), pages 29–36, Nusa Dua, Bali, November 2023. (doi: 10.18653/v1/2023.ijcnlp-short.4)

URL DOI
Masayasu Muraoka, Bishwaranjan Bhattacharjee, Michele Merler, Graeme Blackwood, Yulong Li, and Yang Zhao. Cross-Lingual Transfer of Large Language Model by Visually-Derived Supervision Toward Low-Resource Languages. In Proceedings of the 31th ACM International Conference on Multimedia (MM ’23), pages 3637–3646, October 2023. (doi: 10.1145/3581783.3611992)

DOI
Yang Zhao, Tetsuya Nasukawa, Masayasu Muraoka, and Bishwaranjan Bhattacharjee. A Simple Yet Strong Domain-Agnostic De-bias Method for Zero-Shot Sentiment Classification. In Findings of the Association for Computational Linguistics: ACL 2023, pages 3923–3931, Toronto, Canada, July 2023.

URL
Mengsay Loem, Masahiro Kaneko, Sho Takase, and Naoaki Okazaki. Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods. In Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023) (BEA), pages 205–219, Toronto, Canada, July 2023.

URL
An Wang, Junfeng Jiang, Youmi Ma, Ao Liu, and Naoaki Okazaki. Generative Data Augmentation for Aspect Sentiment Quad Prediction. In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM), pages 128–140, Toronto, Canada, July 2023. (doi: 10.18653/v1/2023.starsem-1.12)

URL DOI
Marco Cognetta, Sangwhan Moon, Lawrence Wolf-Sonkin, and Naoaki Okazaki. Parameter-Efficient Korean Character-Level Language Modeling. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 2350–2356, Dubrovnik, Croatia, May 2023.

URL
Hiyori Yoshikawa and Naoaki Okazaki. Selective-LAMA: Selective Prediction for Confidence-Aware Evaluation of Language Models. In Findings of the Association for Computational Linguistics: EACL 2023 (Findings of EACL), pages 2017–2028, Dubrovnik, Croatia, May 2023.

URL
Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. Comparing Intrinsic Gender Bias Evaluation Measures without using Human Annotated Examples. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 2857–2863, Dubrovnik, Croatia, May 2023.

URL
Sakae Mizuki and Naoaki Okazaki. Semantic Specialization for Knowledge-based Word Sense Disambiguation. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 3457–3470, Dubrovnik, Croatia, May 2023.

URL
Youmi Ma, An Wang, and Naoaki Okazaki. DREEAM: Guiding Attention with Evidence for Improving Document-Level Relation Extraction. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 1971–1983, Dubrovnik, Croatia, May 2023.

URL
Zhishen Yang, Raj Dabre, Hideki Tanaka, and Naoaki Okazaki. SciCap+: A Knowledge Augmented Dataset to Study the Challenges of Scientific Figure Captioning. In Proceedings of the Workshop on Scientific Document Understanding, co-located with 37th AAAI Conference on Artificial Intelligence (CEUR Workshop Proceedings), page (Paper13), Washington DC, USA, February 2023.

URL
Ao Liu, Haoyu Dong, Naoaki Okazaki, Shi Han, and Dongmei Zhang. PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 5531–5546, Abu Dhabi, United Arab Emirates, December 2022.

URL
Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. Gender Bias in Meta-Embeddings. In Findings of the Association for Computational Linguistics: EMNLP 2022 (EMNLP), pages 3118–3133, Abu Dhabi, United Arab Emirates, December 2022.

URL
Hiroki Iida and Naoaki Okazaki. Unsupervised Domain Adaptation for Sparse Retrieval by Filling Vocabulary and Word Frequency Gaps. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (AACL), pages 752–765, Online, November 2022.

URL
Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. Debiasing Isn’t Enough! – on the Effectiveness of Debiasing MLMs and Their Social Biases in Downstream Tasks. In Proceedings of the 29th International Conference on Computational Linguistics (COLING), pages 1299–1310, Gyeongju, Republic of Korea, October 2022.

URL
Koki Maeda, Masahiro Kaneko, and Naoaki Okazaki. IMPARA: Impact based Metric for GEC using Parallel Data. In Proceedings of the 29th International Conference on Computational Linguistics (COLING), pages 3578–3588, Gyeongju, Republic of Korea, October 2022.

URL
Yidong Wang, Hao Wu, Ao Liu, Wenxin Hou, Zhen Wu, Jindong Wang, Takahiro Shinozaki, Manabu Okumura, and Yue Zhang. Exploiting Unlabeled Data for Target-Oriented Opinion Words Extraction. In Proceedings of the 29th International Conference on Computational Linguistics (COLING), pages 7075–7085, Gyeongju, Republic of Korea, October 2022.

URL
Hsuan-Yu Kuo, Youmi Ma, and Naoaki Okazaki. Annotating Entity and Causal Relationships on Japanese Vehicle Recall Information. In Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 783–791, Manila, Philippines, October 2022.

URL
Vijay Daultani and Naoaki Okazaki. Improving Automatic Evaluation of Acceptability Based on Language Models with a Coarse Sentence Representation. In Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 109–118, Manila, Philippines, October 2022.

URL
Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zubair Afzal, Zenan Zhai, Timothy Baldwin, and Karin Verspoor. Overview of ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents. In International Conference of the Cross-Language Evaluation Forum for European Languages (CLEF), pages 521–540, September 2022.
Mengsay Loem, Sho Takase, Masahiro Kaneko, and Naoaki Okazaki. ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop (NAACL SRW), pages 16–24, Hybrid: Seattle, Washington + Online, July 2022. (doi: 10.18653/v1/2022.naacl-srw.3)

URL DOI
Haoyu Dong, Zhoujun Cheng, Xinyi He, Mengyu Zhou, Anda Zhou, Fan Zhou, Ao Liu, Shi Han, and Dongmei Zhang. Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (IJCAI), pages 5426–5435, July 2022. (doi: 10.24963/ijcai.2022/761)

URL DOI
Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, and Naoaki Okazaki. Gender Bias in Masked Language Models for Multiple Languages. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 2740–2750, Seattle, United States, July 2022. (doi: 10.18653/v1/2022.naacl-main.197)

URL Code DOI
Yu Pan, Zeyong Su, Ao Liu, Wang Jingquan, Nannan Li, and Zenglin Xu. A Unified Weight Initialization Paradigm for Tensorial Convolutional Neural Networks. In International Conference on Machine Learning (ICML), pages 17238–17257, Baltimore, Maryland, United States, July 2022.

URL
Won Ik Cho, Sangwhan Moon, Jongin Kim, Seokmin Kim, and Nam Soo Kim. StyleKQC: A Style-Variant Paraphrase Corpus for Korean Questions and Commands. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 7122–7128, Marseille, France, June 2022.

URL
Hwichan Kim, Sangwhan Moon, Naoaki Okazaki, and Mamoru Komachi. Learning How to Translate North Korean through South Korean. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 6711–6718, Marseille, France, June 2022.

URL
Sangwhan Moon, Won Ik Cho, Hye Joo Han, Naoaki Okazaki, and Nam Soo Kim. OpenKorPOS: Democratizing Korean Tokenization with Voting-Based Open Corpus Annotation. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 4975–4983, Marseille, France, June 2022.

URL
Sho Takase and Naoaki Okazaki. Multi-Task Learning for Cross-Lingual Abstractive Summarization. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 3008–3016, Marseille, France, June 2022.

URL
Yujin Takahashi, Masahiro Kaneko, Masato Mita, and Mamoru Komachi. ProQE: Proficiency-wise Quality Estimation dataset for Grammatical Error Correction. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 5994–6000, Marseille, France, June 2022.

URL
Masahiro Kaneko, Sho Takase, Ayana Niwa, and Naoaki Okazaki. Interpretability for Language Learners Using Example-Based Grammatical Error Correction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 7176–7187, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.496)

URL Code DOI
Ao Liu, An Wang, and Naoaki Okazaki. Semi-Supervised Formality Style Transfer with Consistency Training. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 4689–4701, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.321)

URL Code DOI
Yi Zhou, Masahiro Kaneko, and Danushka Bollegala. Sense Embeddings are also Biased – Evaluating Social Biases in Static and Contextualised Sense Embeddings. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 1924–1935, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.135)

URL DOI
Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Word-level Perturbation Considering Word Length and Compositional Subwords. In Findings of the Association for Computational Linguistics: ACL 2022 (Findings of ACL), pages 3268–3275, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.findings-acl.258)

URL Code DOI
Sho Takase, Tatsuya Hiraoka, and Naoaki Okazaki. Single Model Ensemble for Subword Regularized Models in Low-Resource Machine Translation. In Findings of the Association for Computational Linguistics: ACL 2022 (Findings of ACL), pages 2536–2541, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.findings-acl.199)

URL DOI
Youmi Ma, Tatsuya Hiraoka, and Naoaki Okazaki. Joint Entity and Relation Extraction Based on Table Labeling Using Convolutional Neural Networks. In Proceedings of the Sixth Workshop on Structured Prediction for NLP (SPNLP), pages 11–21, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.spnlp-1.2)

URL Code DOI
Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zenan Zhai, Zubair Afzal, Trevor Cohn, Timothy Baldwin, and Karin Verspoor. The ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents. In European Conference on Information Retrieval (ECIR), pages 400–407, April 2022.
Masahiro Kaneko and Danushka Bollegala. Unmasking the Mask – Evaluating Social Biases in Masked Language Models. In Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI), pages 11954–11962, Vancouver, BC, Canada, February 2022. (doi: 10.1609/aaai.v36i11.21453)

URL DOI
Qian Sun, Aili Shen, Hiyori Yoshikawa, Chunpeng Ma, Daniel Beck, Tomoya Iwakura, and Timothy Baldwin. Evaluating Hierarchical Document Categorisation. In Proceedings of the The 19th Annual Workshop of the Australasian Language Technology Association (ALTA), pages 179–184, December 2021.
Hiroki Iida and Naoaki Okazaki. Incorporating Semantic Textual Similarity and Lexical Matching for Information Retrieval. In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 582–591, Shanghai, China, November 2021.

URL
Shota Koyama, Hiroya Takamura, and Naoaki Okazaki. Various Errors Improve Neural Grammatical Error Correction. In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 251–261, Shanghai, China, November 2021.

URL
Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zubair Afzal, Zenan Zhai, Timothy Baldwin, and Karin Verspoor. Overview of ChEMU 2021: Reaction Reference Resolution and Anaphora Resolution in Chemical Patents. In Experimental IR Meets Multilinguality, Multimodality, and Interaction: 12th International Conference of the CLEF Association (CLEF), September 2021. (doi: 10.1007/978-3-030-85251-1_20)

URL DOI
Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zubair Afzal, Zenan Zhai, Timothy Baldwin, and Karin Verspoor. Extended Overview of ChEMU 2021: Reaction Reference Resolution and Anaphora Resolution in Chemical Patents. In Proceedings of the Working Notes of CLEF 2021, volume 2936, pages 693–709, September 2021.

URL
Kosuke Yamada, Yuta Hitomi, Hideaki Tamori, Ryohei Sasano, Naoaki Okazaki, Kentaro Inui, and Koichi Takeda. Transformer-based Lexically Constrained Headline Generation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 4085–4090, Online and Punta Cana, Dominican Republic, November 2021. (doi: 10.18653/v1/2021.emnlp-main.335)

URL Code DOI
Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, and Naoaki Okazaki. Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction. In Proceedings of the 3rd Conference on Automated Knowledge Base Construction (AKBC), pages (19 pages), October 2021.

URL Slides
Hiyori Yoshikawa, Tomoya Iwakura, Kimi Kaneko, Hiroaki Yoshida, Yasutaka Kumano, Kazutaka Shimada, Rafal Rzepka, and Patrycja Swieczkowska. Tell Me What You Read: Automatic Expertise-Based Annotator Assignment for Text Annotation in Expert Domains. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 1575–1585, Held Online, September 2021.

URL
Ayana Niwa, Keisuke Nishiguchi, and Naoaki Okazaki. Predicting Antonyms in Context using BERT. In Proceedings of the 14th International Conference on Natural Language Generation (INLG), pages 48–54, Aberdeen, Scotland, UK, August 2021.

URL
Keiji Yasuda, Ichiro Yamada, Naoaki Okazaki, Hideki Tanaka, Hidehiro Asaka, Takeshi Anzai, and Fumiaki Sugaya. Field Experiments of Real Time Foreign News Distribution Powered by MT. In Proceedings of Machine Translation Summit XVIII: Users and Providers Track (MT Summit), pages 227–232, Virtual, August 2021.

URL
Raj Dabre, Aizhan Imankulova, and Masahiro Kaneko. Studying The Impact Of Document-level Context On Simultaneous Neural Machine Translation. In Proceedings of the 18th Biennial Machine Translation Summit (Volume 1: Research Track) (MT Summit), pages 202–214, Virtual, August 2021.

URL
Hiyori Yoshikawa, Saber A. Akhondi, Camilo Thorne, Christian Druckenbrodt, Ralph Hoessel, Zenan Zhai, Jiayuan He, Timothy Baldwin, and Karin Verspoor. Chemical Reaction Reference Resolution in Patents. In Proceedings of the 2nd Workshop on on Patent Text Mining and Semantic Technologies, pages 10–17, July 2021.

URL
Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Joint Optimization of Tokenization and Downstream Model. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (Findings of ACL), pages 244–255, Online, August 2021. (doi: 10.18653/v1/2021.findings-acl.21)

URL Code DOI
Aomi Koyama, Kengo Hotate, Masahiro Kaneko, and Mamoru Komachi. Comparison of Grammatical Error Correction Using Back-Translation Models. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop (NAACL SRW), pages 126–135, Online, June 2021. (doi: 10.18653/v1/2021.naacl-srw.16)

URL Video DOI
Seiichiro Kondo, Kengo Hotate, Tosho Hirasawa, Masahiro Kaneko, and Mamoru Komachi. Sentence Concatenation Approach to Data Augmentation for Neural Machine Translation. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop (NAACL SRW), pages 143–149, Online, June 2021. (doi: 10.18653/v1/2021.naacl-srw.18)

URL DOI
Sho Takase and Shun Kiyono. Rethinking Perturbations in Encoder-Decoders for Fast Training. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 5767–5780, Online, June 2021. (doi: 10.18653/v1/2021.naacl-main.460)

URL DOI
Chunpeng Ma, Aili Shen, Hiyori Yoshikawa, Tomoya Iwakura, Daniel Beck, and Timothy Baldwin. On the (In)Effectiveness of Images for Text Classification. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 42–48, Online, April 2021. (doi: 10.18653/v1/2021.eacl-main.4)

URL DOI
Masahiro Kaneko and Danushka Bollegala. Debiasing Pre-trained Contextualised Embeddings. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 1256–1266, Online, April 2021.

URL Code
Masahiro Kaneko and Danushka Bollegala. Dictionary-based Debiasing of Pre-trained Word Embeddings. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 212–223, Online, April 2021. (doi: 10.18653/v1/2021.eacl-main.16)

URL Code DOI
Zhishen Yang and Naoaki Okazaki. Image Caption Generation for News Articles. In Proceedings of the 28th International Conference on Computational Linguistics (COLING), pages 1941–1951, Barcelona, Spain (Online), December 2020. (doi: 10.18653/v1/2020.coling-main.176)

URL Code DOI
Sho Takase and Sosuke Kobayashi. All Word Embeddings from One Embedding. In Proceedings of the 34th Conference on Neural Information Processing System (NeurIPS), pages 3775–3785, December 2020.

URL arXiv Code
Won Ik Cho, Sangwhan Moon, and Youngsook Song. Open Korean Corpora: A Practical Report. In Proceedings of Second Workshop for NLP Open Source Software (NLP-OSS), pages 85–93, Online, November 2020. (doi: 10.18653/v1/2020.nlposs-1.12)

URL DOI
Shin Kanouchi, Masato Neishi, Yuta Hayashibe, Hiroki Ouchi, and Naoaki Okazaki. You May Like This Hotel Because ...: Identifying Evidence for Explainable Recommendations. In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (AACL-IJCNLP), pages 890–899, Suzhou, China, December 2020.

URL
Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Optimizing Word Segmentation for Downstream Task. In Findings of the Association for Computational Linguistics: EMNLP 2020 (Findings of EMNLP), pages 1341–1351, Online, November 2020. (doi: 10.18653/v1/2020.findings-emnlp.120)

URL DOI
Won Ik Cho, Youngki Moon, Sangwhan Moon, Seok Min Kim, and Nam Soo Kim. Machines Getting with the Program: Understanding Intent Arguments of Non-Canonical Directives. In Findings of the Association for Computational Linguistics: EMNLP 2020 (Findings of EMNLP), pages 329–339, Online, November 2020. (doi: 10.18653/v1/2020.findings-emnlp.31)

URL DOI
Sangwhan Moon and Naoaki Okazaki. PatchBERT: Just-in-Time, Out-of-Vocabulary Patching. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7846–7852, Online, November 2020. (doi: 10.18653/v1/2020.emnlp-main.631)

URL DOI
Wiem Ben Rim and Naoaki Okazaki. SWAGex at SemEval-2020 Task 4: Commonsense Explanation as Next Event Prediction. In Proceedings of the Fourteenth Workshop on Semantic Evaluation (SemEval), pages 422–429, Barcelona (online), December 2020.

URL
Zhishen Yang, Lars Wolfsteller, and Naoaki Okazaki. TextLearner at SemEval-2020 Task 10: A Contextualized Ranking System in Solving Emphasis Selection in Text. In Proceedings of the Fourteenth Workshop on Semantic Evaluation (SemEval), pages 1691–1697, Barcelona (online), December 2020.

URL
Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos, Ryan Cotterell, and Naoaki Okazaki. It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1640–1649, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.149)

URL DOI
Emanuele Bugliarello and Naoaki Okazaki. Enhancing Machine Translation with Dependency-Aware Self-Attention. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1618–1627, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.147)

URL DOI
Zixia Jia, Youmi Ma, Jiong Cai, and Kewei Tu. Semi-Supervised Semantic Dependency Parsing Using CRF Autoencoders. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 6795–6805, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.607)

URL DOI
Kazuki Matsumaru, Sho Takase, and Naoaki Okazaki. Improving Truthfulness of Headline Generation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1335–1346, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.123)

URL DOI
Matsuno Shogo, Mizuki Sakae, and Sakaki Takeshi. Improved Advertisement Targeting via Fine-grained Location Prediction using Twitter. In Companion of The 2020 Web Conference 2020 (WWW), pages 527–532, Taipei, Taiwan, 2020. (doi: 10.1145/3366424.3382118)

URL DOI
Sangwhan Moon and Naoaki Okazaki. Jamo Pair Encoding: Subcharacter Representation-based Extreme Korean Vocabulary Compression for Efficient Subword Tokenization. In Proceedings of the 12th Language Resources and Evaluation Conference (LREC), pages 3490–3497, Marseille, France, May 2020.

URL
Sho Shimazu, Sho Takase, Toshiaki Nakazawa, and Naoaki Okazaki. Evaluation Dataset for Zero Pronoun in Japanese to English Translation. In Proceedings of the 12th Language Resources and Evaluation Conference (LREC), pages 3630–3634, Marseille, France, May 2020.

URL
Sakae Mizuki and Naoaki Okazaki. Analyzing the Variation Property of Contextualized Word Representations. In AI 2019: Advances in Artificial Intelligence, pages 393–405, December 2019. (doi: 10.1007/978-3-030-35288-2_32)

URL DOI
Yuichi Sasazawa, Sho Takase, and Naoaki Okazaki. Neural Question Generation using Interrogative Phrases. In Proceedings of the 12th International Conference on Natural Language Generation (INLG), pages 106–111, Tokyo, Japan, October 2019. (doi: 10.18653/v1/W19-8613)

URL DOI
Emanuele Bugliarello, Swayambhoo Jain, and Vineeth Rakesh. Matrix Completion in the Unit Hypercube via Structured Matrix Factorization. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI), pages 2038–2044, August 2019. (doi: 10.24963/ijcai.2019/282)

URL DOI
Tatsuya Hiraoka, Hiroyuki Shindo, and Yuji Matsumoto. Stochastic Tokenization with a Language Model for Neural Text Classification. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1620–1629, Florence, Italy, July 2019. (doi: 10.18653/v1/P19-1158)

URL DOI
Hayate Iso, Yui Uehara, Tatsuya Ishigaki, Hiroshi Noji, Eiji Aramaki, Ichiro Kobayashi, Yusuke Miyao, Naoaki Okazaki, and Hiroya Takamura. Learning to Select, Track, and Generate for Data-to-Text. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pages 2102–2113, Florence, Italy, July 2019. (doi: 10.18653/v1/P19-1202)

URL DOI
Sho Takase and Naoaki Okazaki. Positional Encoding to Control Output Sequence Length. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (NAACL), pages 3999–4004, Minneapolis, Minnesota, June 2019. (doi: 10.18653/v1/N19-1401)

URL DOI
Zhishen Yang, Sam Vijlbrief, and Naoaki Okazaki. TokyoTech_NLP at SemEval-2019 Task 3: Emotion-related Symbols in Emotion Detection. In Proceedings of the 13th International Workshop on Semantic Evaluation (SemEval), pages 350–354, Minneapolis, Minnesota, USA, June 2019. (doi: 10.18653/v1/S19-2061)

URL DOI
Sho Takase, Jun Suzuki, and Masaaki Nagata. Character n-gram Embeddings to Improve RNN Language Models. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI), pages 5074–5082, January 2019.

arXiv
Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, and Masaaki Nagata. Reducing Odd Generation from Neural Headline Generation. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation (PACLIC), Hong Kong, December 2018.

URL
Kaori Abe, Yuichiroh Matsubayashi, Naoaki Okazaki, and Kentaro Inui. Multi-dialect Neural Machine Translation and Dialectometry. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation (PACLIC), Hong Kong, December 2018.

URL
Sho Takase, Jun Suzuki, and Masaaki Nagata. Direct Output Connection for a High-Rank Language Model. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 4599–4609, Brussels, Belgium, October 2018. (doi: 10.18653/v1/D18-1489)

URL DOI
Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, and Masaaki Nagata. Unsupervised Token-wise Alignment to Improve Interpretation of Encoder-Decoder Models. In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pages 74–81, Brussels, Belgium, November 2018. (doi: 10.18653/v1/W18-5410)

URL DOI
Diana Galvan, Naoaki Okazaki, Koji Matsuda, and Kentaro Inui. Investigating the Challenges of Temporal Relation Extraction from Clinical Text. In Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis (Louhi), pages 55–64, Brussels, Belgium, October 2018. (doi: 10.18653/v1/W18-5607)

URL DOI
Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, and Kentaro Inui. Predicting Stances from Social Media Posts using Factorization Machines. In Proceedings of the 27th International Conference on Computational Linguistics (COLING), pages 3381–3390, August 2018.

URL
Yuta Hitomi, Hideaki Tamori, Naoaki Okazaki, and Kentaro Inui. Proofread Sentence Generation as Multi-Task Learning with Editing Operation Prediction. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP), pages 436–441, November 2017.

URL
Sosuke Kobayashi, Naoaki Okazaki, and Kentaro Inui. A Neural Language Model for Dynamically Representing the Meanings of Unknown Words and Entities in a Discourse. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP), pages 473–483, November 2017.

URL
Kazuaki Hanawa, Akira Sasaki, Naoaki Okazaki, and Kentaro Inui. A Crowdsourcing Approach for Annotating Causal Relation Instances in Wikipedia. In Proceedings of the 31st Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 336–345, November 2017.

URL
Shota Sasaki, Sho Takase, Naoya Inoue, Naoaki Okazaki, and Kentaro Inui. Handling Multiword Expressions in Causality Estimation. In IWCS 2017 — 12th International Conference on Computational Semantics — Short papers, pages (6 pages), 2017.

URL
Hideaki Tamori, Yuta Hitomi, Naoaki Okazaki, and Kentaro Inui. Analyzing the Revision Logs of a Japanese Newspaper for Article Quality Assessment. In Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism, pages 46–50, Copenhagen, Denmark, September 2017. (doi: 10.18653/v1/W17-4208)

URL DOI
Sho Yokoi, Daichi Mochihashi, Ryo Takahashi, Naoaki Okazaki, and Kentaro Inui. Learning Co-Substructures by Kernel Dependence Maximization. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), pages 3329–3335, August 2017.

URL
Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, and Kentaro Inui. Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using Tweets and Matrix Factorization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 398–408, Vancouver, Canada, July 2017. (doi: 10.18653/v1/P17-1037)

URL DOI

Invited talks

Jun Suzuki, Kyosuke Nishida, and Naoaki Okazaki. A Gentle Introduction to Technologies Behind Language Models and Recent Achievement in ChatGPT. In Tutorial 2, the 27nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), May 2023.

URL Slides
Naoaki Okazaki. Neural Machine Translation and Summarization for News. In International Workshop on Speech to Speech Machine Translation (IWSSMT), November 2020.

URL
Naoaki Okazaki. Towards Natural Language Processing that Understands Context. In AI Shooting Stars Session, Artificial Intelligence — International Research and Applications: 1st Japanese-German-French DWIH Symposium, November 2018.

URL
Naoaki Okazaki. How Deep Learning Changes Natural Language Processing. In Fourth Asia Pacific Corpus Linguistics Conference (APCLC 2018), September 2018.

URL
Naoaki Okazaki. Bridging Knowledge and Text with Deep Neural Networks. In Second International Workshop on Symbolic-Neural Learning (SNL-2018), July 2018.

URL
Naoaki Okazaki. Generating Text with Deep Neural Networks. In Deep Learning: Theory, Algorithms, and Applications, March 2018.

URL

Non-refereed papers

Keito Sasagawa, Koki Maeda, Issa Sugiura, Shuhei Kurita, Naoaki Okazaki, and Daisuke Kawahara. Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model, 2024.

URL arXiv
Masahiro Kaneko, Youmi Ma, Yuki Wata, and Naoaki Okazaki. Sampling-based Pseudo-Likelihood for Membership Inference Attacks, 2024.

arXiv
Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, and Naoaki Okazaki. Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction. In Proceedings of the Fifth Widening Natural Language Processing Workshop (WiNLP2021), November 2021.
Zhishen Yang, Tosho Hirasawa, Mamoru Komachi, and Naoaki Okazaki. Do Videos Guide Translations? Evaluation on Video-guided Machine Translation dataset. In Visually Grounded Interaction and Language (ViGIL), 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2021) workshop, June 2021.

URL
Tosho Hirasawa, Zhishen Yang, Mamoru Komachi, and Naoaki Okazaki. Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020. In First Workshop on Advances in Language and Vision Research (ALVR 2020), ACL 2020, July 2020.

arXiv
Youmi Ma, Tatsuya Hiraoka, and Naoaki Okazaki. Named Entity Recognition and Relation Extraction using Enhanced Table Filling by Contextualized Representations, 2020.

arXiv