Publications

Awards

  1. Sponsorship Award (CyberAgent, Inc.), the 19th Symposium of Young Researcher Association for NLP Studies (2024-09-06)

    Koshiro Saito, Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

    Easily detectable LLMs without sacrificing its generative capability

    URL

  2. Encouragement Award, the 19th Symposium of Young Researcher Association for NLP Studies (2024-09-06)

    Koshiro Saito, Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

    Easily detectable LLMs without sacrificing its generative capability

    URL

  3. Best Paper Award, the 261th Meeting of Special Interest Group of Natural Language Processing (SIGNL), Information Processing Society of Japan (IPSJ) (2024-09-03)

    Koshiro Saito, Sakae Mizuki, Masanari Ohi, Taishi Nakamura, Taihei Shiotani, Koki Maeda, Youmi Ma, Kakeru Hattori, Kazuki Fujii, Takumi Okamoto, Shigeki Ishida, Hiroya Takamura, Rio Yokota, Naoaki Okazaki

    Advantages of Training LLMs on Japanese Text

    URL

  4. Young Researcher’s Encouragement Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

    Masanari Ohi

    Likelihood-based Mitigation of Evaluation Bias in Large Language Models

    URL

  5. Young Researcher’s Encouragement Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

    Yuki Wata

    Sampling-based Membership Inference Attack to Large Language Models

    URL

  6. Young Researcher’s Encouragement Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

    Mengsay Loem

    Enhancing Learning and Inference Capabilities of Language Models via Dicussions with Adversarial Utterances

    URL

  7. Young Researcher’s Encouragement Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

    Ayana Niwa

    AmbiNLG: Instruction Text Disambiguation for Natural Language Generation

    URL

  8. Young Researcher’s Encouragement Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

    Shota Koyama

    N-gram F-score between the Original Text, Reference Text, and Corrected Text for Automatic Evaluation of Grammatical Error Correction

    URL

  9. Best Paper Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

    Naoaki Okazaki, Kakeru Hattori, Shota Hirai, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, Sakae Mizuki

    Swallow Corpus: Japanese Large-Scale Web Corpus

    URL

  10. Best Paper Award, the 30th Annual Meeting of The Association for Natural Language Processing (2024-03-14)

    Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Shota Hirai, Sakae Mizuki, Rio Yokota, Naoaki Okazaki

    Constructing Large Language Models with Strong Japanese Capability Through Continual Pre-training

    URL

  11. Encouragement Award, the 18th Symposium of Young Researcher Association for NLP Studies (2023-08-31)

    Youmi Ma, An Wang, Naoaki Okazaki

    Constructing Document-Level Relation Extraction Corpora in Japanese

    URL

  12. Sponsorship Award (PKSHA Technology), the 18th Symposium of Young Researcher Association for NLP Studies (2023-08-31)

    Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

    OUTFOX: LLM-generated Essay Detection through In-context Learning with Adversarially Generated Examples

    URL

  13. Sponsorship Award (HAKUHODO Technologies), the 18th Symposium of Young Researcher Association for NLP Studies (2023-08-31)

    Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

    OUTFOX: LLM-generated Essay Detection through In-context Learning with Adversarially Generated Examples

    URL

  14. Best Paper Award (first place), the 29th Annual Meeting of The Association for Natural Language Processing (2023-03-16)

    Youmi Ma, An Wang, Naoaki Okazaki

    DREEAM: Guiding Attention with Evidence for Improving Document-Level Relation Extraction

    URL

  15. Best Paper Award, the 29th Annual Meeting of The Association for Natural Language Processing (2023-03-16)

    Sakae Mizuki, Naoaki Okazaki

    Semantic Specialization for Knowledge-based Word Sense Disambiguation

    URL

  16. Sponsorship Award (Hitachi), the 29th Annual Meeting of The Association for Natural Language Processing (2023-03-16)

    Kakeru Hattori, Youmi Ma, Naoaki Okazaki

    Query Suggestion and Summarization: Generating Query-Summary Pairs for Query-Focused Summarization

    URL

  17. Special Committee Award, the 29th Annual Meeting of The Association for Natural Language Processing (2023-03-16)

    Masahiro Kaneko, Graham Neubig, Naoaki Okazaki

    Solving NLP Problems through Human-System Collaboration: A Discussion-based Approach

    URL

  18. Special Committee Award, the 29th Annual Meeting of The Association for Natural Language Processing (2023-03-16)

    Kyosuke Nishida, Taku Hasegawa, Koki Maeda, Kuniko Saito

    DueT: Foundation Model for Visual and Language based on Dual-adapter Tuning

    URL

  19. Best paper award (first place), the Association for Natural Language Processing (2022-03-17)

    Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki

    Optimizing Word Segmentation for Downstream Tasks by Weighting Text Vector

    URL

  20. Best Paper Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

    Sho Takase, Shun Kiyono, Sosuke Kobayashi, Jun Suzuki

    Vanishing Gradient Problem and its Solution for Multi-layer Transformer

    URL

  21. Best Paper Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

    Koki Maeda, Masahiro Kaneko, Naoaki Okazaki

    IMPARA: Impact-based Metrics for GEC using PARAllel Data

    URL

  22. Special Committee Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

    Ayana Niwa, Sho Takase, Naoaki Okazaki

    Non-autoregressive Generation using the Nearest Neighbor

    URL

  23. Special Committee Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

    Hiyori Yoshikawa, Naoaki Okazaki

    Selective Prediction for Evaluating Confidence of Knowledge in Language Models

    URL

  24. Special Committee Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

    Sayo Kada, Yosuke Yamano, Akane Niimi, Hideaki Tamori, Norito Kokai, Naoaki Okazaki, Kentaro Inui

    An Automatic Selection Method for Thumbnail Image using Movie Title

    URL

  25. Outstanding Paper Award, AKBC2021 (2021-10-05)

    Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, Naoaki Okazaki

    Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction

    URL

  26. Best Paper Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Sakae Mizuki, Naoaki Okazaki

    Hyponymy Detection using Hierarchical Code Learning

    URL

  27. Young Researcher’s Encouragement Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Tatsuya Hiraoka

    Optimizing Word Segmentation using Loss Values of Downstream Tasks

    URL

  28. Young Researcher’s Encouragement Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Youmi Ma

    Named Entity Recognition and Relation Extraction by Table-Filling using BERT

    URL

  29. Committee Special Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Kosuke Yamada ,Yuta Hitomi,Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

    Headline Generation that Reliably Contains the Specified Words

    URL

  30. Sponsor Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Kosuke Yamada ,Yuta Hitomi,Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

    Headline Generation that Reliably Contains the Specified Words

    URL

  31. TokyoTech Education Award 2020 (2021-03-02)

    Yoshihiro Miyake, Naoaki Okazaki, Takafumi Kanamori, Tsuyoshi Murata, Shin-ya Nishizaki, Kazuyuki Shudo, Kenji Kise, Masamichi Shimosaka, Masakazu Sekijima, Keisuke Yanagisawa, Masahiro Kuze, Mitsuji Sampei, Ichiro Yamanaka, Takehiko Itoh, Toru Takeuchi, Takeo Yamaguchi, Kei Sakaguchi

    University-wide Education Program of Data Science and Artificial Intelligence for Graduate Students

    URL

  32. Presentation award, the 15th NTCIR (2020-12-17)

    Yuichi Sasazawa, Naoaki Okazaki

    WER99 at the NTCIR-15 QA Lab-PoliInfo-2 Classification Task

    URL

  33. Winning the Video-guided Machine Translation (VMT) Challenge 2020 (2020-07-13)

    Tosho Hirasawa, Zhishen Yang, Mamoru Komachi, and Naoaki Okazaki

    Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020

    URL

  34. Language resource award, the 26th Annual Meeting of The Association for Natural Language Processing (2020-03-20)

    Yuta Hitomi, Yuya Taguchi, Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

    Style Transfer for Abstractive Summarization in a Small-scale Resource

    URL

  35. Young researcher’s encouragement award, the 26th Annual Meeting of The Association for Natural Language Processing (2020-03-20)

    Kazuki Matsumaru

    Improving Truthfulness of Headline Generation

    URL

  36. Young researcher’s encouragement award, the 242nd Meeting of Special Interest Group of Natural Language Processing (SIGNL), Information Processing Society of Japan (IPSJ) (2019-10-25)

    Tatsuya Hiraoka

    HMM-based Neural Network Capturing Latent History with RNN

    URL

  37. Best Paper Award, the 240th Meeting of Special Interest Group of Natural Language Processing (SIGNL), Information Processing Society of Japan (IPSJ) (2019-06-14)

    Kazuki Matsumaru, Sho Takase, Naoaki Okazaki

    Re-examinating the Task of Headline Generation based on Textual Entailment

    URL

  38. JSAI 2018 Best Paper Award (2018-06-27)

    Sho Takase, Naoaki Okazaki, Kentaro Inui

    Learning to Compose Distributed Representations of Relational Patterns

    URL

  39. Best Paper Award, the 24th Annual Meeting of The Association for Natural Language Processing (2018-03-15)

    Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, Masaaki Nagata

    Reducing odd generation in neural headline generation

    URL

Research Publications

Journal papers

  1. Vijay Daultani, Hector Vazquez Martinez, and Naoaki Okazaki. Acceptability Evaluation of Naturally Written Sentences. Journal of Information Processing, 17(3):to appear, 2024.

  2. Ao Liu, Congjian Luo, and Naoaki Okazaki. Improving Logical-Level Natural Language Generation with Topic-Conditioned Data Augmentation and Logical Form Generation. Journal of Information Processing, 31:332–343, April 2023. (doi: 10.2197/ipsjjip.31.332)

    DOI

  3. Ayana Niwa, Sho Takase, and Naoaki Okazaki. Nearest Neighbor Non-autoregressive Text Generation. Journal of Information Processing, 31:334–352, April 2023. (doi: 10.2197/ipsjjip.31.344)

    DOI

  4. Chunpeng Ma, Aili Shen, Hiyori Yoshikawa, Tomoya Iwakura, Daniel Beck, and Timothy Baldwin. On the Effectiveness of Images in Multi-Modal Text Classification: An Annotation Study. ACM Trans. Asian Low-Resour. Lang. Inf. Process., 22(3):1–19, March 2023. (doi: 10.1145/3565572)

    URL DOI

  5. Tosho Hirasawa, Masahiro Kaneko, Aizhan Imankulova, and Mamoru Komachi. Pre-Trained Word Embedding and Language Model Improve Multimodal Machine Translation: A Case Study in Multi30K. IEEE Access, 10:67653–67668, 2022. (doi: 10.1109/ACCESS.2022.3185243)

    DOI

  6. Zhishen Yang, Tosho Hirasawa, Mamoru Komachi, and Naoaki Okazaki. Why videos do not guide translations in video-guided machine translation? An empirical evaluation of video-guided machine translation dataset. Journal of Information Processing, 30:388–396, May 2022. (doi: 10.2197/ipsjjip.30.388)

    DOI

  7. Youmi Ma, Tatsuya Hiraoka, and Naoaki Okazaki. Named Entity Recognition and Relation Extraction Using Enhanced Table Filling by Contextualized Representations. 自然言語処理, 29(1):187–223, March 2022. (doi: 10.5715/jnlp.29.187)

    DOI

  8. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Recurrent Neural Hidden Markov Model for High-Order Transition. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 21(2):1–15, March 2022. (doi: 10.1145/3476511)

    URL DOI

  9. Emanuele Bugliarello, Ryan Cotterell, Naoaki Okazaki, and Desmond Elliott. Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs. Transactions of the Association for Computational Linguistics, 9:978–994, September 2021. (doi: 10.1162/tacl_a_00408)

    URL DOI

  10. Ayana Niwa, Naoaki Okazaki, Kohei Wakimoto, Keisuke Nishiguchi, and Masataka Mouri. Construction of a Corpus of Rhetorical Devices in Slogans and Structural Analysis of Antitheses. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 20(6), November 2021. (doi: 10.1145/3465218)

    DOI

  11. Sangwhan Moon and Naoaki Okazaki. The Effects and Mitigation of Out-of-Vocabulary in Universal Language Models. Journal of Information Processing, 29:490–503, July 2021. (doi: 10.2197/ipsjjip.29.490)

    DOI

  12. Kaori Abe, Yuichiroh Matsubayashi, Naoaki Okazaki, and Kentaro Inui. Multi-dialect Neural Machine Translation for 48 Low-resource Japanese Dialects. Journal of Natural Language Processing, 27(4):781–800, December 2020. (doi: 10.5715/jnlp.27.781)

    DOI

  13. Hayate Iso, Yui Uehara, Tatsuya Ishigaki, Hiroshi Noji, Eiji Aramaki, Ichiro Kobayashi, Yusuke Miyao, Naoaki Okazaki, and Hiroya Takamura. Learning to Select, Track, and Generate for Data-to-Text. Journal of Natural Language Processing, 27(3):599–626, September 2020. (doi: 10.5715/jnlp.27.599)

    DOI

  14. Diana Galvan-Sosa, Koji Matsuda, Naoaki Okazaki, and Kentaro Inui. Empirical Exploration of the Challenges in Temporal Relation Extraction from Clinical Text. Journal of Natural Language Processing, 27(2):383–409, June 2020. (doi: 10.5715/jnlp.27.383)

    DOI

  15. Kazuaki Hanawa, Akira Sasaki, Naoaki Okazaki, and Kentaro Inui. Stance Detection Attending External Knowledge from Wikipedia. Journal of Information Processing, 27:499–506, August 2019. (doi: 10.2197/ipsjjip.27.499)

    DOI

  16. Masatoshi Suzuki, Koji Matsuda, Satoshi Sekine, Naoaki Okazaki, and Kentaro Inui. A Joint Neural Model for Fine-Grained Named Entity Classification of Wikipedia Articles. IEICE Transactions on Information and Systems, Special Section on Semantic Web and Linked Data, E101.D(1):73–81, January 2018. (doi: 10.1587/transinf.2017SWP0005)

    DOI

  17. Ran Tian, Naoaki Okazaki, and Kentaro Inui. The mechanism of additive composition. Machine Learning, 106(7):1083–1130, July 2017. (doi: 10.1007/s10994-017-5634-8)

    DOI

  18. Shuangshuang Zhou, Naoaki Okazaki, Koji Matsuda, Ran Tian, and Kentaro Inui. Supervised Approaches for Japanese Wikification. Journal of Information Processing, 25:341–350, April 2017. (doi: 10.2197/ipsjjip.25.341)

    DOI

International conferences

  1. Ryuto Koike, Masahiro Kaneko, and Naoaki Okazaki. How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection. In Findings of the Association for Computational Linguistics: EMNLP 2024 (EMNLP), pages (to appear), Miami, USA, November 2024.

  2. Marco Cognetta, Vilém Zouhar, and Naoaki Okazaki. Distributional Properties of Subword Regularization. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages (to appear), Miami, USA, November 2024.

  3. Shota Koyama, Ryo Nagata, Hiroya Takamura, and Naoaki Okazaki. n-gram F-score for Evaluating Grammatical Error Correction. In Proceedings of the 17th International Natural Language Generation Conference, pages (to appear), Tokyo, Japan, September 2024.

  4. Naoaki Okazaki, Kakeru Hattori, Hirai Shota, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, and Sakae Mizuki. Building a Large Japanese Web Corpus for Large Language Models. In Proceedings of the First Conference on Language Modeling (COLM), pages (to appear), University of Pennsylvania, USA, October 2024.

  5. Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Hirai Shota, Sakae Mizuki, Rio Yokota, and Naoaki Okazaki. Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities. In Proceedings of the First Conference on Language Modeling (COLM), pages (to appear), University of Pennsylvania, USA, October 2024.

  6. Mengsay Loem, Masahiro Kaneko, and Naoaki Okazaki. SAIE Framework: Support Alone Isn’t Enough - Advancing LLM Training with Adversarial Remarks. In Proceedings of the 27th European Conference on Artificial Intelligence (ECAI), pages (to appear), Santiago de Compostela, Spain, October 2024.

  7. Koki Maeda, Tosho Hirasawa, Atsushi Hashimoto, Jun Harashima, Leszek Rybicki, Fukasawa Yusuke, and Yoshitaka Ushiku. COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark. In Proceedings of the European Conference on Computer Vision (ECCV), pages (to appear), Milan, Italy, September 2024.

  8. Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, and Naoaki Okazaki. Likelihood-based Mitigation of Evaluation Bias in Large Language Models. In Lun-Wei Ku, Andre Martins, and Vivek Srikumar, editors, Findings of the Association for Computational Linguistics ACL 2024 (ACL 2024), pages 3237–3245, Bangkok, Thailand and virtual meeting, August 2024. (doi: 10.18653/v1/2024.findings-acl.193)

    URL DOI

  9. Marco Cognetta, Tatsuya Hiraoka, Rico Sennrich, Yuval Pinter, and Naoaki Okazaki. An Analysis of BPE Vocabulary Trimming in Neural Machine Translation. In Shabnam Tafreshi, Arjun Akula, João Sedoc, Aleksandr Drozd, Anna Rogers, and Anna Rumshisky, editors, Proceedings of the Fifth Workshop on Insights from Negative Results in NLP, pages 48–50, Mexico City, Mexico, June 2024. (doi: 10.18653/v1/2024.insights-1.7)

    URL DOI

  10. Marco Cognetta, Vilém Zouhar, Sangwhan Moon, and Naoaki Okazaki. Two Counterexamples to Tokenization and the Noiseless Channel. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 16897–16906, Torino, Italia, May 2024.

    URL

  11. Panatchakorn Anantaprayoon, Masahiro Kaneko, and Naoaki Okazaki. Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 6395–6408, Torino, Italia, May 2024.

    URL

  12. Youmi Ma, An Wang, and Naoaki Okazaki. Building a Japanese Document-Level Relation Extraction Dataset Assisted by Cross-Lingual Transfer. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 2567–2579, Torino, Italia, May 2024.

    URL

  13. Masahiro Kaneko and Naoaki Okazaki. Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 3955–3961, Torino, Italia, May 2024.

    URL

  14. Ryuto Koike, Masahiro Kaneko, and Naoaki Okazaki. OUTFOX: LLM-generated Essay Detection through In-context Learning with Adversarially Generated Examples. In The 38th Annual AAAI Conference on Artificial Intelligence (AAAI), pages 21258–21266, February 2024.

  15. Koki Maeda, Shuhei Kurita, Taiki Miyanishi, and Naoaki Okazaki. Query-based Image Captioning from Multi-context 360° Images. In Findings of the Association for Computational Linguistics: EMNLP 2023 (EMNLP), pages 6940–6954, Singapore, December 2023. (doi: 10.18653/v1/2023.findings-emnlp.463)

    URL DOI

  16. Taku Hasegawa, Kyosuke Nishida, Koki Maeda, and Kuniko Saito. DueT: Image-Text Contrastive Transfer Learning with Dual-adapter Tuning. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 13607–13624, Singapore, December 2023. (doi: 10.18653/v1/2023.emnlp-main.839)

    URL DOI

  17. Trang Nguyen and Naoaki Okazaki. Causal Reasoning through Two Layers of Cognition for Improving Generalization in Visual Question Answering. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 9221–9236, Singapore, December 2023. (doi: 10.18653/v1/2023.emnlp-main.573)

    URL DOI

  18. Masahiro Kaneko and Naoaki Okazaki. Reducing Sequence Length by Predicting Edit Operations with Large Language Models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 10017–10029, Singapore, December 2023. (doi: 10.18653/v1/2023.emnlp-main.619)

    URL DOI

  19. Youmi Ma, Bhushan Kotnis, Carolin Lawrance, Goran Glavaš, and Naoaki Okazaki. Improving Cross-Lingual Transfer for Open Information Extraction with Linguistic Feature Projection. In Proceedings of the 3rd Workshop on Multi-lingual Representation Learning (MRL), pages 125–138, Singapore, December 2023. (doi: 10.18653/v1/2023.mrl-1.11)

    URL DOI

  20. Trang Nguyen, Amin Mansouri, Kanika Madan, Khuong Duy Nguyen, Kartik Ahuja, Dianbo Liu, and Yoshua Bengio. Reusable Slotwise Mechanisms. In A. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, and S. Levine, editors, Advances in Neural Information Processing Systems (NeurIPS), volume 36, pages 23533–23556, 2023.

    URL

  21. Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. The Impact of Debiasing on the Performance of Language Models in Downstream Tasks is Underestimated. In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 2: Short Papers) (AACL), pages 29–36, Nusa Dua, Bali, November 2023. (doi: 10.18653/v1/2023.ijcnlp-short.4)

    URL DOI

  22. Masayasu Muraoka, Bishwaranjan Bhattacharjee, Michele Merler, Graeme Blackwood, Yulong Li, and Yang Zhao. Cross-Lingual Transfer of Large Language Model by Visually-Derived Supervision Toward Low-Resource Languages. In Proceedings of the 31th ACM International Conference on Multimedia (MM ’23), pages 3637–3646, October 2023. (doi: 10.1145/3581783.3611992)

    DOI

  23. Yang Zhao, Tetsuya Nasukawa, Masayasu Muraoka, and Bishwaranjan Bhattacharjee. A Simple Yet Strong Domain-Agnostic De-bias Method for Zero-Shot Sentiment Classification. In Findings of the Association for Computational Linguistics: ACL 2023, pages 3923–3931, Toronto, Canada, July 2023.

    URL

  24. Mengsay Loem, Masahiro Kaneko, Sho Takase, and Naoaki Okazaki. Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods. In Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023) (BEA), pages 205–219, Toronto, Canada, July 2023.

    URL

  25. An Wang, Junfeng Jiang, Youmi Ma, Ao Liu, and Naoaki Okazaki. Generative Data Augmentation for Aspect Sentiment Quad Prediction. In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM), pages 128–140, Toronto, Canada, July 2023. (doi: 10.18653/v1/2023.starsem-1.12)

    URL DOI

  26. Marco Cognetta, Sangwhan Moon, Lawrence Wolf-Sonkin, and Naoaki Okazaki. Parameter-Efficient Korean Character-Level Language Modeling. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 2350–2356, Dubrovnik, Croatia, May 2023.

    URL

  27. Hiyori Yoshikawa and Naoaki Okazaki. Selective-LAMA: Selective Prediction for Confidence-Aware Evaluation of Language Models. In Findings of the Association for Computational Linguistics: EACL 2023 (Findings of EACL), pages 2017–2028, Dubrovnik, Croatia, May 2023.

    URL

  28. Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. Comparing Intrinsic Gender Bias Evaluation Measures without using Human Annotated Examples. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 2857–2863, Dubrovnik, Croatia, May 2023.

    URL

  29. Sakae Mizuki and Naoaki Okazaki. Semantic Specialization for Knowledge-based Word Sense Disambiguation. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 3457–3470, Dubrovnik, Croatia, May 2023.

    URL

  30. Youmi Ma, An Wang, and Naoaki Okazaki. DREEAM: Guiding Attention with Evidence for Improving Document-Level Relation Extraction. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 1971–1983, Dubrovnik, Croatia, May 2023.

    URL

  31. Zhishen Yang, Raj Dabre, Hideki Tanaka, and Naoaki Okazaki. SciCap+: A Knowledge Augmented Dataset to Study the Challenges of Scientific Figure Captioning. In Proceedings of the Workshop on Scientific Document Understanding, co-located with 37th AAAI Conference on Artificial Intelligence (CEUR Workshop Proceedings), page (Paper13), Washington DC, USA, February 2023.

    URL

  32. Ao Liu, Haoyu Dong, Naoaki Okazaki, Shi Han, and Dongmei Zhang. PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 5531–5546, Abu Dhabi, United Arab Emirates, December 2022.

    URL

  33. Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. Gender Bias in Meta-Embeddings. In Findings of the Association for Computational Linguistics: EMNLP 2022 (EMNLP), pages 3118–3133, Abu Dhabi, United Arab Emirates, December 2022.

    URL

  34. Hiroki Iida and Naoaki Okazaki. Unsupervised Domain Adaptation for Sparse Retrieval by Filling Vocabulary and Word Frequency Gaps. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (AACL), pages 752–765, Online, November 2022.

    URL

  35. Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. Debiasing Isn’t Enough! – on the Effectiveness of Debiasing MLMs and Their Social Biases in Downstream Tasks. In Proceedings of the 29th International Conference on Computational Linguistics (COLING), pages 1299–1310, Gyeongju, Republic of Korea, October 2022.

    URL

  36. Koki Maeda, Masahiro Kaneko, and Naoaki Okazaki. IMPARA: Impact based Metric for GEC using Parallel Data. In Proceedings of the 29th International Conference on Computational Linguistics (COLING), pages 3578–3588, Gyeongju, Republic of Korea, October 2022.

    URL

  37. Yidong Wang, Hao Wu, Ao Liu, Wenxin Hou, Zhen Wu, Jindong Wang, Takahiro Shinozaki, Manabu Okumura, and Yue Zhang. Exploiting Unlabeled Data for Target-Oriented Opinion Words Extraction. In Proceedings of the 29th International Conference on Computational Linguistics (COLING), pages 7075–7085, Gyeongju, Republic of Korea, October 2022.

    URL

  38. Hsuan-Yu Kuo, Youmi Ma, and Naoaki Okazaki. Annotating Entity and Causal Relationships on Japanese Vehicle Recall Information. In Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 783–791, Manila, Philippines, October 2022.

    URL

  39. Vijay Daultani and Naoaki Okazaki. Improving Automatic Evaluation of Acceptability Based on Language Models with a Coarse Sentence Representation. In Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 109–118, Manila, Philippines, October 2022.

    URL

  40. Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zubair Afzal, Zenan Zhai, Timothy Baldwin, and Karin Verspoor. Overview of ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents. In International Conference of the Cross-Language Evaluation Forum for European Languages (CLEF), pages 521–540, September 2022.

  41. Mengsay Loem, Sho Takase, Masahiro Kaneko, and Naoaki Okazaki. ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop (NAACL SRW), pages 16–24, Hybrid: Seattle, Washington + Online, July 2022. (doi: 10.18653/v1/2022.naacl-srw.3)

    URL DOI

  42. Haoyu Dong, Zhoujun Cheng, Xinyi He, Mengyu Zhou, Anda Zhou, Fan Zhou, Ao Liu, Shi Han, and Dongmei Zhang. Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (IJCAI), pages 5426–5435, July 2022. (doi: 10.24963/ijcai.2022/761)

    URL DOI

  43. Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, and Naoaki Okazaki. Gender Bias in Masked Language Models for Multiple Languages. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 2740–2750, Seattle, United States, July 2022. (doi: 10.18653/v1/2022.naacl-main.197)

    URL Code DOI

  44. Yu Pan, Zeyong Su, Ao Liu, Wang Jingquan, Nannan Li, and Zenglin Xu. A Unified Weight Initialization Paradigm for Tensorial Convolutional Neural Networks. In International Conference on Machine Learning (ICML), pages 17238–17257, Baltimore, Maryland, United States, July 2022.

    URL

  45. Won Ik Cho, Sangwhan Moon, Jongin Kim, Seokmin Kim, and Nam Soo Kim. StyleKQC: A Style-Variant Paraphrase Corpus for Korean Questions and Commands. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 7122–7128, Marseille, France, June 2022.

    URL

  46. Hwichan Kim, Sangwhan Moon, Naoaki Okazaki, and Mamoru Komachi. Learning How to Translate North Korean through South Korean. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 6711–6718, Marseille, France, June 2022.

    URL

  47. Sangwhan Moon, Won Ik Cho, Hye Joo Han, Naoaki Okazaki, and Nam Soo Kim. OpenKorPOS: Democratizing Korean Tokenization with Voting-Based Open Corpus Annotation. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 4975–4983, Marseille, France, June 2022.

    URL

  48. Sho Takase and Naoaki Okazaki. Multi-Task Learning for Cross-Lingual Abstractive Summarization. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 3008–3016, Marseille, France, June 2022.

    URL

  49. Yujin Takahashi, Masahiro Kaneko, Masato Mita, and Mamoru Komachi. ProQE: Proficiency-wise Quality Estimation dataset for Grammatical Error Correction. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 5994–6000, Marseille, France, June 2022.

    URL

  50. Masahiro Kaneko, Sho Takase, Ayana Niwa, and Naoaki Okazaki. Interpretability for Language Learners Using Example-Based Grammatical Error Correction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 7176–7187, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.496)

    URL Code DOI

  51. Ao Liu, An Wang, and Naoaki Okazaki. Semi-Supervised Formality Style Transfer with Consistency Training. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 4689–4701, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.321)

    URL Code DOI

  52. Yi Zhou, Masahiro Kaneko, and Danushka Bollegala. Sense Embeddings are also Biased – Evaluating Social Biases in Static and Contextualised Sense Embeddings. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 1924–1935, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.135)

    URL DOI

  53. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Word-level Perturbation Considering Word Length and Compositional Subwords. In Findings of the Association for Computational Linguistics: ACL 2022 (Findings of ACL), pages 3268–3275, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.findings-acl.258)

    URL Code DOI

  54. Sho Takase, Tatsuya Hiraoka, and Naoaki Okazaki. Single Model Ensemble for Subword Regularized Models in Low-Resource Machine Translation. In Findings of the Association for Computational Linguistics: ACL 2022 (Findings of ACL), pages 2536–2541, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.findings-acl.199)

    URL DOI

  55. Youmi Ma, Tatsuya Hiraoka, and Naoaki Okazaki. Joint Entity and Relation Extraction Based on Table Labeling Using Convolutional Neural Networks. In Proceedings of the Sixth Workshop on Structured Prediction for NLP (SPNLP), pages 11–21, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.spnlp-1.2)

    URL Code DOI

  56. Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zenan Zhai, Zubair Afzal, Trevor Cohn, Timothy Baldwin, and Karin Verspoor. The ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents. In European Conference on Information Retrieval (ECIR), pages 400–407, April 2022.

  57. Masahiro Kaneko and Danushka Bollegala. Unmasking the Mask – Evaluating Social Biases in Masked Language Models. In Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI), pages 11954–11962, Vancouver, BC, Canada, February 2022. (doi: 10.1609/aaai.v36i11.21453)

    URL DOI

  58. Qian Sun, Aili Shen, Hiyori Yoshikawa, Chunpeng Ma, Daniel Beck, Tomoya Iwakura, and Timothy Baldwin. Evaluating Hierarchical Document Categorisation. In Proceedings of the The 19th Annual Workshop of the Australasian Language Technology Association (ALTA), pages 179–184, December 2021.

  59. Hiroki Iida and Naoaki Okazaki. Incorporating Semantic Textual Similarity and Lexical Matching for Information Retrieval. In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 582–591, Shanghai, China, November 2021.

    URL

  60. Shota Koyama, Hiroya Takamura, and Naoaki Okazaki. Various Errors Improve Neural Grammatical Error Correction. In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 251–261, Shanghai, China, November 2021.

    URL

  61. Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zubair Afzal, Zenan Zhai, Timothy Baldwin, and Karin Verspoor. Overview of ChEMU 2021: Reaction Reference Resolution and Anaphora Resolution in Chemical Patents. In Experimental IR Meets Multilinguality, Multimodality, and Interaction: 12th International Conference of the CLEF Association (CLEF), September 2021. (doi: 10.1007/978-3-030-85251-1_20)

    URL DOI

  62. Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zubair Afzal, Zenan Zhai, Timothy Baldwin, and Karin Verspoor. Extended Overview of ChEMU 2021: Reaction Reference Resolution and Anaphora Resolution in Chemical Patents. In Proceedings of the Working Notes of CLEF 2021, volume 2936, pages 693–709, September 2021.

    URL

  63. Kosuke Yamada, Yuta Hitomi, Hideaki Tamori, Ryohei Sasano, Naoaki Okazaki, Kentaro Inui, and Koichi Takeda. Transformer-based Lexically Constrained Headline Generation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 4085–4090, Online and Punta Cana, Dominican Republic, November 2021. (doi: 10.18653/v1/2021.emnlp-main.335)

    URL Code DOI

  64. Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, and Naoaki Okazaki. Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction. In Proceedings of the 3rd Conference on Automated Knowledge Base Construction (AKBC), pages (19 pages), October 2021.

    URL Slides

  65. Hiyori Yoshikawa, Tomoya Iwakura, Kimi Kaneko, Hiroaki Yoshida, Yasutaka Kumano, Kazutaka Shimada, Rafal Rzepka, and Patrycja Swieczkowska. Tell Me What You Read: Automatic Expertise-Based Annotator Assignment for Text Annotation in Expert Domains. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 1575–1585, Held Online, September 2021.

    URL

  66. Ayana Niwa, Keisuke Nishiguchi, and Naoaki Okazaki. Predicting Antonyms in Context using BERT. In Proceedings of the 14th International Conference on Natural Language Generation (INLG), pages 48–54, Aberdeen, Scotland, UK, August 2021.

    URL

  67. Keiji Yasuda, Ichiro Yamada, Naoaki Okazaki, Hideki Tanaka, Hidehiro Asaka, Takeshi Anzai, and Fumiaki Sugaya. Field Experiments of Real Time Foreign News Distribution Powered by MT. In Proceedings of Machine Translation Summit XVIII: Users and Providers Track (MT Summit), pages 227–232, Virtual, August 2021.

    URL

  68. Raj Dabre, Aizhan Imankulova, and Masahiro Kaneko. Studying The Impact Of Document-level Context On Simultaneous Neural Machine Translation. In Proceedings of the 18th Biennial Machine Translation Summit (Volume 1: Research Track) (MT Summit), pages 202–214, Virtual, August 2021.

    URL

  69. Hiyori Yoshikawa, Saber A. Akhondi, Camilo Thorne, Christian Druckenbrodt, Ralph Hoessel, Zenan Zhai, Jiayuan He, Timothy Baldwin, and Karin Verspoor. Chemical Reaction Reference Resolution in Patents. In Proceedings of the 2nd Workshop on on Patent Text Mining and Semantic Technologies, pages 10–17, July 2021.

    URL

  70. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Joint Optimization of Tokenization and Downstream Model. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (Findings of ACL), pages 244–255, Online, August 2021. (doi: 10.18653/v1/2021.findings-acl.21)

    URL Code DOI

  71. Aomi Koyama, Kengo Hotate, Masahiro Kaneko, and Mamoru Komachi. Comparison of Grammatical Error Correction Using Back-Translation Models. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop (NAACL SRW), pages 126–135, Online, June 2021. (doi: 10.18653/v1/2021.naacl-srw.16)

    URL Video DOI

  72. Seiichiro Kondo, Kengo Hotate, Tosho Hirasawa, Masahiro Kaneko, and Mamoru Komachi. Sentence Concatenation Approach to Data Augmentation for Neural Machine Translation. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop (NAACL SRW), pages 143–149, Online, June 2021. (doi: 10.18653/v1/2021.naacl-srw.18)

    URL DOI

  73. Sho Takase and Shun Kiyono. Rethinking Perturbations in Encoder-Decoders for Fast Training. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 5767–5780, Online, June 2021. (doi: 10.18653/v1/2021.naacl-main.460)

    URL DOI

  74. Chunpeng Ma, Aili Shen, Hiyori Yoshikawa, Tomoya Iwakura, Daniel Beck, and Timothy Baldwin. On the (In)Effectiveness of Images for Text Classification. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 42–48, Online, April 2021. (doi: 10.18653/v1/2021.eacl-main.4)

    URL DOI

  75. Masahiro Kaneko and Danushka Bollegala. Debiasing Pre-trained Contextualised Embeddings. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 1256–1266, Online, April 2021.

    URL Code

  76. Masahiro Kaneko and Danushka Bollegala. Dictionary-based Debiasing of Pre-trained Word Embeddings. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 212–223, Online, April 2021. (doi: 10.18653/v1/2021.eacl-main.16)

    URL Code DOI

  77. Zhishen Yang and Naoaki Okazaki. Image Caption Generation for News Articles. In Proceedings of the 28th International Conference on Computational Linguistics (COLING), pages 1941–1951, Barcelona, Spain (Online), December 2020. (doi: 10.18653/v1/2020.coling-main.176)

    URL Code DOI

  78. Sho Takase and Sosuke Kobayashi. All Word Embeddings from One Embedding. In Proceedings of the 34th Conference on Neural Information Processing System (NeurIPS), pages 3775–3785, December 2020.

    URL arXiv Code

  79. Won Ik Cho, Sangwhan Moon, and Youngsook Song. Open Korean Corpora: A Practical Report. In Proceedings of Second Workshop for NLP Open Source Software (NLP-OSS), pages 85–93, Online, November 2020. (doi: 10.18653/v1/2020.nlposs-1.12)

    URL DOI

  80. Shin Kanouchi, Masato Neishi, Yuta Hayashibe, Hiroki Ouchi, and Naoaki Okazaki. You May Like This Hotel Because ...: Identifying Evidence for Explainable Recommendations. In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (AACL-IJCNLP), pages 890–899, Suzhou, China, December 2020.

    URL

  81. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Optimizing Word Segmentation for Downstream Task. In Findings of the Association for Computational Linguistics: EMNLP 2020 (Findings of EMNLP), pages 1341–1351, Online, November 2020. (doi: 10.18653/v1/2020.findings-emnlp.120)

    URL DOI

  82. Won Ik Cho, Youngki Moon, Sangwhan Moon, Seok Min Kim, and Nam Soo Kim. Machines Getting with the Program: Understanding Intent Arguments of Non-Canonical Directives. In Findings of the Association for Computational Linguistics: EMNLP 2020 (Findings of EMNLP), pages 329–339, Online, November 2020. (doi: 10.18653/v1/2020.findings-emnlp.31)

    URL DOI

  83. Sangwhan Moon and Naoaki Okazaki. PatchBERT: Just-in-Time, Out-of-Vocabulary Patching. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7846–7852, Online, November 2020. (doi: 10.18653/v1/2020.emnlp-main.631)

    URL DOI

  84. Wiem Ben Rim and Naoaki Okazaki. SWAGex at SemEval-2020 Task 4: Commonsense Explanation as Next Event Prediction. In Proceedings of the Fourteenth Workshop on Semantic Evaluation (SemEval), pages 422–429, Barcelona (online), December 2020.

    URL

  85. Zhishen Yang, Lars Wolfsteller, and Naoaki Okazaki. TextLearner at SemEval-2020 Task 10: A Contextualized Ranking System in Solving Emphasis Selection in Text. In Proceedings of the Fourteenth Workshop on Semantic Evaluation (SemEval), pages 1691–1697, Barcelona (online), December 2020.

    URL

  86. Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos, Ryan Cotterell, and Naoaki Okazaki. It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1640–1649, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.149)

    URL DOI

  87. Emanuele Bugliarello and Naoaki Okazaki. Enhancing Machine Translation with Dependency-Aware Self-Attention. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1618–1627, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.147)

    URL DOI

  88. Zixia Jia, Youmi Ma, Jiong Cai, and Kewei Tu. Semi-Supervised Semantic Dependency Parsing Using CRF Autoencoders. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 6795–6805, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.607)

    URL DOI

  89. Kazuki Matsumaru, Sho Takase, and Naoaki Okazaki. Improving Truthfulness of Headline Generation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1335–1346, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.123)

    URL DOI

  90. Matsuno Shogo, Mizuki Sakae, and Sakaki Takeshi. Improved Advertisement Targeting via Fine-grained Location Prediction using Twitter. In Companion of The 2020 Web Conference 2020 (WWW), pages 527–532, Taipei, Taiwan, 2020. (doi: 10.1145/3366424.3382118)

    URL DOI

  91. Sangwhan Moon and Naoaki Okazaki. Jamo Pair Encoding: Subcharacter Representation-based Extreme Korean Vocabulary Compression for Efficient Subword Tokenization. In Proceedings of the 12th Language Resources and Evaluation Conference (LREC), pages 3490–3497, Marseille, France, May 2020.

    URL

  92. Sho Shimazu, Sho Takase, Toshiaki Nakazawa, and Naoaki Okazaki. Evaluation Dataset for Zero Pronoun in Japanese to English Translation. In Proceedings of the 12th Language Resources and Evaluation Conference (LREC), pages 3630–3634, Marseille, France, May 2020.

    URL

  93. Sakae Mizuki and Naoaki Okazaki. Analyzing the Variation Property of Contextualized Word Representations. In AI 2019: Advances in Artificial Intelligence, pages 393–405, December 2019. (doi: 10.1007/978-3-030-35288-2_32)

    URL DOI

  94. Yuichi Sasazawa, Sho Takase, and Naoaki Okazaki. Neural Question Generation using Interrogative Phrases. In Proceedings of the 12th International Conference on Natural Language Generation (INLG), pages 106–111, Tokyo, Japan, October 2019. (doi: 10.18653/v1/W19-8613)

    URL DOI

  95. Emanuele Bugliarello, Swayambhoo Jain, and Vineeth Rakesh. Matrix Completion in the Unit Hypercube via Structured Matrix Factorization. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI), pages 2038–2044, August 2019. (doi: 10.24963/ijcai.2019/282)

    URL DOI

  96. Tatsuya Hiraoka, Hiroyuki Shindo, and Yuji Matsumoto. Stochastic Tokenization with a Language Model for Neural Text Classification. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1620–1629, Florence, Italy, July 2019. (doi: 10.18653/v1/P19-1158)

    URL DOI

  97. Hayate Iso, Yui Uehara, Tatsuya Ishigaki, Hiroshi Noji, Eiji Aramaki, Ichiro Kobayashi, Yusuke Miyao, Naoaki Okazaki, and Hiroya Takamura. Learning to Select, Track, and Generate for Data-to-Text. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pages 2102–2113, Florence, Italy, July 2019. (doi: 10.18653/v1/P19-1202)

    URL DOI

  98. Sho Takase and Naoaki Okazaki. Positional Encoding to Control Output Sequence Length. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (NAACL), pages 3999–4004, Minneapolis, Minnesota, June 2019. (doi: 10.18653/v1/N19-1401)

    URL DOI

  99. Zhishen Yang, Sam Vijlbrief, and Naoaki Okazaki. TokyoTech_NLP at SemEval-2019 Task 3: Emotion-related Symbols in Emotion Detection. In Proceedings of the 13th International Workshop on Semantic Evaluation (SemEval), pages 350–354, Minneapolis, Minnesota, USA, June 2019. (doi: 10.18653/v1/S19-2061)

    URL DOI

  100. Sho Takase, Jun Suzuki, and Masaaki Nagata. Character n-gram Embeddings to Improve RNN Language Models. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI), pages 5074–5082, January 2019.

    arXiv

  101. Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, and Masaaki Nagata. Reducing Odd Generation from Neural Headline Generation. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation (PACLIC), Hong Kong, December 2018.

    URL

  102. Kaori Abe, Yuichiroh Matsubayashi, Naoaki Okazaki, and Kentaro Inui. Multi-dialect Neural Machine Translation and Dialectometry. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation (PACLIC), Hong Kong, December 2018.

    URL

  103. Sho Takase, Jun Suzuki, and Masaaki Nagata. Direct Output Connection for a High-Rank Language Model. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 4599–4609, Brussels, Belgium, October 2018. (doi: 10.18653/v1/D18-1489)

    URL DOI

  104. Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, and Masaaki Nagata. Unsupervised Token-wise Alignment to Improve Interpretation of Encoder-Decoder Models. In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pages 74–81, Brussels, Belgium, November 2018. (doi: 10.18653/v1/W18-5410)

    URL DOI

  105. Diana Galvan, Naoaki Okazaki, Koji Matsuda, and Kentaro Inui. Investigating the Challenges of Temporal Relation Extraction from Clinical Text. In Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis (Louhi), pages 55–64, Brussels, Belgium, October 2018. (doi: 10.18653/v1/W18-5607)

    URL DOI

  106. Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, and Kentaro Inui. Predicting Stances from Social Media Posts using Factorization Machines. In Proceedings of the 27th International Conference on Computational Linguistics (COLING), pages 3381–3390, August 2018.

    URL

  107. Yuta Hitomi, Hideaki Tamori, Naoaki Okazaki, and Kentaro Inui. Proofread Sentence Generation as Multi-Task Learning with Editing Operation Prediction. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP), pages 436–441, November 2017.

    URL

  108. Sosuke Kobayashi, Naoaki Okazaki, and Kentaro Inui. A Neural Language Model for Dynamically Representing the Meanings of Unknown Words and Entities in a Discourse. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP), pages 473–483, November 2017.

    URL

  109. Kazuaki Hanawa, Akira Sasaki, Naoaki Okazaki, and Kentaro Inui. A Crowdsourcing Approach for Annotating Causal Relation Instances in Wikipedia. In Proceedings of the 31st Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 336–345, November 2017.

    URL

  110. Shota Sasaki, Sho Takase, Naoya Inoue, Naoaki Okazaki, and Kentaro Inui. Handling Multiword Expressions in Causality Estimation. In IWCS 2017 — 12th International Conference on Computational Semantics — Short papers, pages (6 pages), 2017.

    URL

  111. Hideaki Tamori, Yuta Hitomi, Naoaki Okazaki, and Kentaro Inui. Analyzing the Revision Logs of a Japanese Newspaper for Article Quality Assessment. In Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism, pages 46–50, Copenhagen, Denmark, September 2017. (doi: 10.18653/v1/W17-4208)

    URL DOI

  112. Sho Yokoi, Daichi Mochihashi, Ryo Takahashi, Naoaki Okazaki, and Kentaro Inui. Learning Co-Substructures by Kernel Dependence Maximization. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), pages 3329–3335, August 2017.

    URL

  113. Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, and Kentaro Inui. Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using Tweets and Matrix Factorization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 398–408, Vancouver, Canada, July 2017. (doi: 10.18653/v1/P17-1037)

    URL DOI

Invited talks

  1. Jun Suzuki, Kyosuke Nishida, and Naoaki Okazaki. A Gentle Introduction to Technologies Behind Language Models and Recent Achievement in ChatGPT. In Tutorial 2, the 27nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), May 2023.

    URL Slides

  2. Naoaki Okazaki. Neural Machine Translation and Summarization for News. In International Workshop on Speech to Speech Machine Translation (IWSSMT), November 2020.

    URL

  3. Naoaki Okazaki. Towards Natural Language Processing that Understands Context. In AI Shooting Stars Session, Artificial Intelligence — International Research and Applications: 1st Japanese-German-French DWIH Symposium, November 2018.

    URL

  4. Naoaki Okazaki. How Deep Learning Changes Natural Language Processing. In Fourth Asia Pacific Corpus Linguistics Conference (APCLC 2018), September 2018.

    URL

  5. Naoaki Okazaki. Bridging Knowledge and Text with Deep Neural Networks. In Second International Workshop on Symbolic-Neural Learning (SNL-2018), July 2018.

    URL

  6. Naoaki Okazaki. Generating Text with Deep Neural Networks. In Deep Learning: Theory, Algorithms, and Applications, March 2018.

    URL

Non-refereed papers

  1. Masahiro Kaneko, Youmi Ma, Yuki Wata, and Naoaki Okazaki. Sampling-based Pseudo-Likelihood for Membership Inference Attacks, 2024.

    arXiv

  2. Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, and Naoaki Okazaki. Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction. In Proceedings of the Fifth Widening Natural Language Processing Workshop (WiNLP2021), November 2021.

  3. Zhishen Yang, Tosho Hirasawa, Mamoru Komachi, and Naoaki Okazaki. Do Videos Guide Translations? Evaluation on Video-guided Machine Translation dataset. In Visually Grounded Interaction and Language (ViGIL), 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2021) workshop, June 2021.

    URL

  4. Tosho Hirasawa, Zhishen Yang, Mamoru Komachi, and Naoaki Okazaki. Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020. In First Workshop on Advances in Language and Vision Research (ALVR 2020), ACL 2020, July 2020.

    arXiv

  5. Youmi Ma, Tatsuya Hiraoka, and Naoaki Okazaki. Named Entity Recognition and Relation Extraction using Enhanced Table Filling by Contextualized Representations, 2020.

    arXiv