Publications

Awards

  1. Best paper award (first place), the Association for Natural Language Processing (2022-03-17)

    Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki

    Optimizing Word Segmentation for Downstream Tasks by Weighting Text Vector

    URL

  2. Best Paper Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

    Sho Takase, Shun Kiyono, Sosuke Kobayashi, Jun Suzuki

    Vanishing Gradient Problem and its Solution for Multi-layer Transformer

    URL

  3. Best Paper Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

    Koki Maeda, Masahiro Kaneko, Naoaki Okazaki

    IMPARA: Impact-based Metrics for GEC using PARAllel Data

    URL

  4. Special Committee Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

    Ayana Niwa, Sho Takase, Naoaki Okazaki

    Non-autoregressive Generation using the Nearest Neighbor

    URL

  5. Special Committee Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

    Hiyori Yoshikawa, Naoaki Okazaki

    Selective Prediction for Evaluating Confidence of Knowledge in Language Models

    URL

  6. Special Committee Award, the 28th Annual Meeting of The Association for Natural Language Processing (2022-03-17)

    Sayo Kada, Yosuke Yamano, Akane Niimi, Hideaki Tamori, Norito Kokai, Naoaki Okazaki, Kentaro Inui

    An Automatic Selection Method for Thumbnail Image using Movie Title

    URL

  7. Outstanding Paper Award, AKBC2021 (2021-10-05)

    Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, Naoaki Okazaki

    Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction

    URL

  8. Best Paper Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Sakae Mizuki, Naoaki Okazaki

    Hyponymy Detection using Hierarchical Code Learning

    URL

  9. Young Researcher’s Encouragement Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Tatsuya Hiraoka

    Optimizing Word Segmentation using Loss Values of Downstream Tasks

    URL

  10. Young Researcher’s Encouragement Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Youmi Ma

    Named Entity Recognition and Relation Extraction by Table-Filling using BERT

    URL

  11. Committee Special Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Kosuke Yamada ,Yuta Hitomi,Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

    Headline Generation that Reliably Contains the Specified Words

    URL

  12. Sponsor Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Kosuke Yamada ,Yuta Hitomi,Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

    Headline Generation that Reliably Contains the Specified Words

    URL

  13. TokyoTech Education Award 2020 (2021-03-02)

    Yoshihiro Miyake, Naoaki Okazaki, Takafumi Kanamori, Tsuyoshi Murata, Shin-ya Nishizaki, Kazuyuki Shudo, Kenji Kise, Masamichi Shimosaka, Masakazu Sekijima, Keisuke Yanagisawa, Masahiro Kuze, Mitsuji Sampei, Ichiro Yamanaka, Takehiko Itoh, Toru Takeuchi, Takeo Yamaguchi, Kei Sakaguchi

    University-wide Education Program of Data Science and Artificial Intelligence for Graduate Students

    URL

  14. Presentation award, the 15th NTCIR (2020-12-17)

    Yuichi Sasazawa, Naoaki Okazaki

    WER99 at the NTCIR-15 QA Lab-PoliInfo-2 Classification Task

    URL

  15. Winning the Video-guided Machine Translation (VMT) Challenge 2020 (2020-07-13)

    Tosho Hirasawa, Zhishen Yang, Mamoru Komachi, and Naoaki Okazaki

    Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020

    URL

  16. Language resource award, the 26th Annual Meeting of The Association for Natural Language Processing (2020-03-20)

    Yuta Hitomi, Yuya Taguchi, Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

    Style Transfer for Abstractive Summarization in a Small-scale Resource

    URL

  17. Young researcher’s encouragement award, the 26th Annual Meeting of The Association for Natural Language Processing (2020-03-20)

    Kazuki Matsumaru

    Improving Truthfulness of Headline Generation

    URL

  18. Young researcher’s encouragement award, the 242nd Meeting of Special Interest Group of Natural Language Processing (SIGNL), Information Processing Society of Japan (IPSJ) (2019-10-25)

    Tatsuya Hiraoka

    HMM-based Neural Network Capturing Latent History with RNN

    URL

  19. Best Paper Award, the 240th Meeting of Special Interest Group of Natural Language Processing (SIGNL), Information Processing Society of Japan (IPSJ) (2019-06-14)

    Kazuki Matsumaru, Sho Takase, Naoaki Okazaki

    Re-examinating the Task of Headline Generation based on Textual Entailment

    URL

  20. JSAI 2018 Best Paper Award (2018-06-27)

    Sho Takase, Naoaki Okazaki, Kentaro Inui

    Learning to Compose Distributed Representations of Relational Patterns

    URL

  21. Best Paper Award, the 24th Annual Meeting of The Association for Natural Language Processing (2018-03-15)

    Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, Masaaki Nagata

    Reducing odd generation in neural headline generation

    URL

Research Publications

Journal papers

  1. Tosho Hirasawa, Masahiro Kaneko, Aizhan Imankulova, and Mamoru Komachi. Pre-trained Word Embedding and Language Model Improve Multimodal Machine Translation: A Case Study in Multi30K. IEEE Access:(to appear), 2022. (doi: 10.1109/ACCESS.2022.3185243)

    DOI

  2. Zhishen Yang, Tosho Hirasawa, Mamoru Komachi, and Naoaki Okazaki. Why videos do not guide translations in video-guided machine translation? An empirical evaluation of video-guided machine translation dataset. Journal of Information Processing, 30:388–396, May 2022. (doi: 10.2197/ipsjjip.30.388)

    DOI

  3. Youmi Ma, Tatsuya Hiraoka, and Naoaki Okazaki. Named Entity Recognition and Relation Extraction Using Enhanced Table Filling by Contextualized Representations. 自然言語処理, 29(1):187–223, March 2022. (doi: 10.5715/jnlp.29.187)

    DOI

  4. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Recurrent Neural Hidden Markov Model for High-Order Transition. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 21(2):1–15, March 2022. (doi: 10.1145/3476511)

    URL DOI

  5. Emanuele Bugliarello, Ryan Cotterell, Naoaki Okazaki, and Desmond Elliott. Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs. Transactions of the Association for Computational Linguistics, 9:978–994, September 2021. (doi: 10.1162/tacl_a_00408)

    URL DOI

  6. Ayana Niwa, Naoaki Okazaki, Kohei Wakimoto, Keisuke Nishiguchi, and Masataka Mouri. Construction of a Corpus of Rhetorical Devices in Slogans and Structural Analysis of Antitheses. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 20(6), November 2021. (doi: 10.1145/3465218)

    DOI

  7. Sangwhan Moon and Naoaki Okazaki. The Effects and Mitigation of Out-of-Vocabulary in Universal Language Models. Journal of Information Processing, 29:490–503, July 2021. (doi: 10.2197/ipsjjip.29.490)

    DOI

  8. Kaori Abe, Yuichiroh Matsubayashi, Naoaki Okazaki, and Kentaro Inui. Multi-dialect Neural Machine Translation for 48 Low-resource Japanese Dialects. Journal of Natural Language Processing, 27(4):781–800, December 2020. (doi: 10.5715/jnlp.27.781)

    DOI

  9. Hayate Iso, Yui Uehara, Tatsuya Ishigaki, Hiroshi Noji, Eiji Aramaki, Ichiro Kobayashi, Yusuke Miyao, Naoaki Okazaki, and Hiroya Takamura. Learning to Select, Track, and Generate for Data-to-Text. Journal of Natural Language Processing, 27(3):599–626, September 2020. (doi: 10.5715/jnlp.27.599)

    DOI

  10. Diana Galvan-Sosa, Koji Matsuda, Naoaki Okazaki, and Kentaro Inui. Empirical Exploration of the Challenges in Temporal Relation Extraction from Clinical Text. Journal of Natural Language Processing, 27(2):383–409, June 2020. (doi: 10.5715/jnlp.27.383)

    DOI

  11. Kazuaki Hanawa, Akira Sasaki, Naoaki Okazaki, and Kentaro Inui. Stance Detection Attending External Knowledge from Wikipedia. Journal of Information Processing, 27:499–506, August 2019. (doi: 10.2197/ipsjjip.27.499)

    DOI

  12. Masatoshi Suzuki, Koji Matsuda, Satoshi Sekine, Naoaki Okazaki, and Kentaro Inui. A Joint Neural Model for Fine-Grained Named Entity Classification of Wikipedia Articles. IEICE Transactions on Information and Systems, Special Section on Semantic Web and Linked Data, E101.D(1):73–81, January 2018. (doi: 10.1587/transinf.2017SWP0005)

    DOI

  13. Ran Tian, Naoaki Okazaki, and Kentaro Inui. The mechanism of additive composition. Machine Learning, 106(7):1083–1130, July 2017. (doi: 10.1007/s10994-017-5634-8)

    DOI

  14. Shuangshuang Zhou, Naoaki Okazaki, Koji Matsuda, Ran Tian, and Kentaro Inui. Supervised Approaches for Japanese Wikification. Journal of Information Processing, 25:341–350, April 2017. (doi: 10.2197/ipsjjip.25.341)

    DOI

International conferences

  1. Hsuan-Yu Kuo, Youmi Ma, and Naoaki Okazaki. Annotating Entity and Causal Relationships on Japanese Vehicle Recall Information. In Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages (to appear), October 2022.

  2. Vijay Daultani and Naoaki Okazaki. Improving Automatic Evaluation of Acceptability Based on Language Models with a Coarse Sentence Representation. In Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages (to appear), October 2022.

  3. Mengsay Loem, Sho Takase, Masahiro Kaneko, and Naoaki Okazaki. ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop (NAACL SRW), pages 16–24, Hybrid: Seattle, Washington + Online, July 2022. (doi: 10.18653/v1/2022.naacl-srw.3)

    URL DOI

  4. Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, and Naoaki Okazaki. Gender Bias in Masked Language Models for Multiple Languages. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 2740–2750, Seattle, United States, July 2022. (doi: 10.18653/v1/2022.naacl-main.197)

    URL Code DOI

  5. Won Ik Cho, Sangwhan Moon, Jong In Kim, Seokmin Kim, and Nam Soo Kim. StyleKQC: A Style-Variant Paraphrase Corpus for Korean Questions and Commands. In Proceedings of the 13th Language Resources and Evaluation Conference (LREC), pages (to appear), Marseille, June 2022.

  6. Hwichan Kim, Sangwhan Moon, Naoaki Okazaki, and Mamoru Komachi. Learning How to Translate North Korean through South Korean. In Proceedings of the 13th Language Resources and Evaluation Conference (LREC), pages (to appear), Marseille, June 2022.

  7. Sangwhan Moon, Won Ik Cho, Hye Joo Han, Naoaki Okazaki, and Nam Soo Kim. OpenKorPOS: Democratizing Korean Tokenization with Voting-Based Open Corpus Annotation. In Proceedings of the 13th Language Resources and Evaluation Conference (LREC), pages (to appear), Marseille, June 2022.

  8. Sho Takase and Naoaki Okazaki. Multi-Task Learning for Cross-Lingual Abstractive Summarization. In Proceedings of the 13th Language Resources and Evaluation Conference (LREC), pages (to appear), Marseille, June 2022.

  9. Yujin Takahashi, Masahiro Kaneko, Masato Mita, and Mamoru Komachi. Proficiency Matters Quality Estimation in Grammatical Error Correction. In Proceedings of the 13th Language Resources and Evaluation Conference (LREC), pages (to appear), Marseille, June 2022.

  10. Masahiro Kaneko, Sho Takase, Ayana Niwa, and Naoaki Okazaki. Interpretability for Language Learners Using Example-Based Grammatical Error Correction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 7176–7187, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.496)

    URL Code DOI

  11. Ao Liu, An Wang, and Naoaki Okazaki. Semi-Supervised Formality Style Transfer with Consistency Training. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 4689–4701, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.321)

    URL Code DOI

  12. Yi Zhou, Masahiro Kaneko, and Danushka Bollegala. Sense Embeddings are also Biased – Evaluating Social Biases in Static and Contextualised Sense Embeddings. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 1924–1935, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.135)

    URL DOI

  13. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Word-level Perturbation Considering Word Length and Compositional Subwords. In Findings of the Association for Computational Linguistics: ACL 2022 (Findings of ACL), pages 3268–3275, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.findings-acl.258)

    URL Code DOI

  14. Sho Takase, Tatsuya Hiraoka, and Naoaki Okazaki. Single Model Ensemble for Subword Regularized Models in Low-Resource Machine Translation. In Findings of the Association for Computational Linguistics: ACL 2022 (Findings of ACL), pages 2536–2541, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.findings-acl.199)

    URL DOI

  15. Youmi Ma, Tatsuya Hiraoka, and Naoaki Okazaki. Joint Entity and Relation Extraction Based on Table Labeling Using Convolutional Neural Networks. In Proceedings of the Sixth Workshop on Structured Prediction for NLP (SPNLP), pages 11–21, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.spnlp-1.2)

    URL Code DOI

  16. Masahiro Kaneko and Danushka Bollegala. Unmasking the Mask – Evaluating Social Biases in Masked Language Models. In Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI), pages 11954–11962, Vancouver, BC, Canada, February 2022. (doi: 10.1609/aaai.v36i11.21453)

    URL DOI

  17. Qian Sun, Aili Shen, Hiyori Yoshikawa, Chunpeng Ma, Daniel Beck, Tomoya Iwakura, and Timothy Baldwin. Evaluating Hierarchical Document Categorisation. In Proceedings of the The 19th Annual Workshop of the Australasian Language Technology Association (ALTA), pages 179–184, December 2021.

  18. Hiroki Iida and Naoaki Okazaki. Incorporating Semantic Textual Similarity and Lexical Matching for Information Retrieval. In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 582–591, Shanghai, China, November 2021.

    URL

  19. Shota Koyama, Hiroya Takamura, and Naoaki Okazaki. Various Errors Improve Neural Grammatical Error Correction. In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 251–261, Shanghai, China, November 2021.

    URL

  20. Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zubair Afzal, Zenan Zhai, Timothy Baldwin, and Karin Verspoor. Overview of ChEMU 2021: Reaction Reference Resolution and Anaphora Resolution in Chemical Patents. In Experimental IR Meets Multilinguality, Multimodality, and Interaction: 12th International Conference of the CLEF Association (CLEF), September 2021. (doi: 10.1007/978-3-030-85251-1_20)

    URL DOI

  21. Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zubair Afzal, Zenan Zhai, Timothy Baldwin, and Karin Verspoor. Extended Overview of ChEMU 2021: Reaction Reference Resolution and Anaphora Resolution in Chemical Patents. In Proceedings of the Working Notes of CLEF 2021, volume 2936, pages 693–709, September 2021.

    URL

  22. Kosuke Yamada, Yuta Hitomi, Hideaki Tamori, Ryohei Sasano, Naoaki Okazaki, Kentaro Inui, and Koichi Takeda. Transformer-based Lexically Constrained Headline Generation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 4085–4090, Online and Punta Cana, Dominican Republic, November 2021. (doi: 10.18653/v1/2021.emnlp-main.335)

    URL Code DOI

  23. Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, and Naoaki Okazaki. Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction. In Proceedings of the 3rd Conference on Automated Knowledge Base Construction (AKBC), pages (19 pages), October 2021.

    URL Slides

  24. Hiyori Yoshikawa, Tomoya Iwakura, Kimi Kaneko, Hiroaki Yoshida, Yasutaka Kumano, Kazutaka Shimada, Rafal Rzepka, and Patrycja Swieczkowska. Tell Me What You Read: Automatic Expertise-Based Annotator Assignment for Text Annotation in Expert Domains. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 1575–1585, Held Online, September 2021.

    URL

  25. Ayana Niwa, Keisuke Nishiguchi, and Naoaki Okazaki. Predicting Antonyms in Context using BERT. In Proceedings of the 14th International Conference on Natural Language Generation (INLG), pages 48–54, Aberdeen, Scotland, UK, August 2021.

    URL

  26. Keiji Yasuda, Ichiro Yamada, Naoaki Okazaki, Hideki Tanaka, Hidehiro Asaka, Takeshi Anzai, and Fumiaki Sugaya. Field Experiments of Real Time Foreign News Distribution Powered by MT. In Proceedings of Machine Translation Summit XVIII: Users and Providers Track (MT Summit), pages 227–232, Virtual, August 2021.

    URL

  27. Raj Dabre, Aizhan Imankulova, and Masahiro Kaneko. Studying The Impact Of Document-level Context On Simultaneous Neural Machine Translation. In Proceedings of the 18th Biennial Machine Translation Summit (Volume 1: Research Track) (MT Summit), pages 202–214, Virtual, August 2021.

    URL

  28. Hiyori Yoshikawa, Saber A. Akhondi, Camilo Thorne, Christian Druckenbrodt, Ralph Hoessel, Zenan Zhai, Jiayuan He, Timothy Baldwin, and Karin Verspoor. Chemical Reaction Reference Resolution in Patents. In Proceedings of the 2nd Workshop on on Patent Text Mining and Semantic Technologies, pages 10–17, July 2021.

    URL

  29. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Joint Optimization of Tokenization and Downstream Model. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (Findings of ACL), pages 244–255, Online, August 2021. (doi: 10.18653/v1/2021.findings-acl.21)

    URL Code DOI

  30. Aomi Koyama, Kengo Hotate, Masahiro Kaneko, and Mamoru Komachi. Comparison of Grammatical Error Correction Using Back-Translation Models. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop (NAACL SRW), pages 126–135, Online, June 2021. (doi: 10.18653/v1/2021.naacl-srw.16)

    URL Video DOI

  31. Seiichiro Kondo, Kengo Hotate, Tosho Hirasawa, Masahiro Kaneko, and Mamoru Komachi. Sentence Concatenation Approach to Data Augmentation for Neural Machine Translation. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop (NAACL SRW), pages 143–149, Online, June 2021. (doi: 10.18653/v1/2021.naacl-srw.18)

    URL DOI

  32. Sho Takase and Shun Kiyono. Rethinking Perturbations in Encoder-Decoders for Fast Training. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 5767–5780, Online, June 2021. (doi: 10.18653/v1/2021.naacl-main.460)

    URL DOI

  33. Chunpeng Ma, Aili Shen, Hiyori Yoshikawa, Tomoya Iwakura, Daniel Beck, and Timothy Baldwin. On the (In)Effectiveness of Images for Text Classification. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 42–48, Online, April 2021. (doi: 10.18653/v1/2021.eacl-main.4)

    URL DOI

  34. Masahiro Kaneko and Danushka Bollegala. Debiasing Pre-trained Contextualised Embeddings. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 1256–1266, Online, April 2021.

    URL Code

  35. Masahiro Kaneko and Danushka Bollegala. Dictionary-based Debiasing of Pre-trained Word Embeddings. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 212–223, Online, April 2021. (doi: 10.18653/v1/2021.eacl-main.16)

    URL Code DOI

  36. Zhishen Yang and Naoaki Okazaki. Image Caption Generation for News Articles. In Proceedings of the 28th International Conference on Computational Linguistics (COLING), pages 1941–1951, Barcelona, Spain (Online), December 2020. (doi: 10.18653/v1/2020.coling-main.176)

    URL Code DOI

  37. Sho Takase and Sosuke Kobayashi. All Word Embeddings from One Embedding. In Proceedings of the 34th Conference on Neural Information Processing System (NeurIPS), pages 3775–3785, December 2020.

    URL arXiv Code

  38. Won Ik Cho, Sangwhan Moon, and Youngsook Song. Open Korean Corpora: A Practical Report. In Proceedings of Second Workshop for NLP Open Source Software (NLP-OSS), pages 85–93, Online, November 2020. (doi: 10.18653/v1/2020.nlposs-1.12)

    URL DOI

  39. Shin Kanouchi, Masato Neishi, Yuta Hayashibe, Hiroki Ouchi, and Naoaki Okazaki. You May Like This Hotel Because ...: Identifying Evidence for Explainable Recommendations. In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (AACL-IJCNLP), pages 890–899, Suzhou, China, December 2020.

    URL

  40. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Optimizing Word Segmentation for Downstream Task. In Findings of the Association for Computational Linguistics: EMNLP 2020 (Findings of EMNLP), pages 1341–1351, Online, November 2020. (doi: 10.18653/v1/2020.findings-emnlp.120)

    URL DOI

  41. Won Ik Cho, Youngki Moon, Sangwhan Moon, Seok Min Kim, and Nam Soo Kim. Machines Getting with the Program: Understanding Intent Arguments of Non-Canonical Directives. In Findings of the Association for Computational Linguistics: EMNLP 2020 (Findings of EMNLP), pages 329–339, Online, November 2020. (doi: 10.18653/v1/2020.findings-emnlp.31)

    URL DOI

  42. Sangwhan Moon and Naoaki Okazaki. PatchBERT: Just-in-Time, Out-of-Vocabulary Patching. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7846–7852, Online, November 2020. (doi: 10.18653/v1/2020.emnlp-main.631)

    URL DOI

  43. Wiem Ben Rim and Naoaki Okazaki. SWAGex at SemEval-2020 Task 4: Commonsense Explanation as Next Event Prediction. In Proceedings of the Fourteenth Workshop on Semantic Evaluation (SemEval), pages 422–429, Barcelona (online), December 2020.

    URL

  44. Zhishen Yang, Lars Wolfsteller, and Naoaki Okazaki. TextLearner at SemEval-2020 Task 10: A Contextualized Ranking System in Solving Emphasis Selection in Text. In Proceedings of the Fourteenth Workshop on Semantic Evaluation (SemEval), pages 1691–1697, Barcelona (online), December 2020.

    URL

  45. Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos, Ryan Cotterell, and Naoaki Okazaki. It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1640–1649, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.149)

    URL DOI

  46. Emanuele Bugliarello and Naoaki Okazaki. Enhancing Machine Translation with Dependency-Aware Self-Attention. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1618–1627, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.147)

    URL DOI

  47. Zixia Jia, Youmi Ma, Jiong Cai, and Kewei Tu. Semi-Supervised Semantic Dependency Parsing Using CRF Autoencoders. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 6795–6805, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.607)

    URL DOI

  48. Kazuki Matsumaru, Sho Takase, and Naoaki Okazaki. Improving Truthfulness of Headline Generation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1335–1346, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.123)

    URL DOI

  49. Matsuno Shogo, Mizuki Sakae, and Sakaki Takeshi. Improved Advertisement Targeting via Fine-grained Location Prediction using Twitter. In Companion of The 2020 Web Conference 2020 (WWW), pages 527–532, Taipei, Taiwan, 2020. (doi: 10.1145/3366424.3382118)

    URL DOI

  50. Sangwhan Moon and Naoaki Okazaki. Jamo Pair Encoding: Subcharacter Representation-based Extreme Korean Vocabulary Compression for Efficient Subword Tokenization. In Proceedings of the 12th Language Resources and Evaluation Conference (LREC), pages 3490–3497, Marseille, France, May 2020.

    URL

  51. Sho Shimazu, Sho Takase, Toshiaki Nakazawa, and Naoaki Okazaki. Evaluation Dataset for Zero Pronoun in Japanese to English Translation. In Proceedings of the 12th Language Resources and Evaluation Conference (LREC), pages 3630–3634, Marseille, France, May 2020.

    URL

  52. Sakae Mizuki and Naoaki Okazaki. Analyzing the Variation Property of Contextualized Word Representations. In AI 2019: Advances in Artificial Intelligence, pages 393–405, December 2019. (doi: 10.1007/978-3-030-35288-2_32)

    URL DOI

  53. Yuichi Sasazawa, Sho Takase, and Naoaki Okazaki. Neural Question Generation using Interrogative Phrases. In Proceedings of the 12th International Conference on Natural Language Generation (INLG), pages 106–111, Tokyo, Japan, October 2019. (doi: 10.18653/v1/W19-8613)

    URL DOI

  54. Emanuele Bugliarello, Swayambhoo Jain, and Vineeth Rakesh. Matrix Completion in the Unit Hypercube via Structured Matrix Factorization. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI), pages 2038–2044, August 2019. (doi: 10.24963/ijcai.2019/282)

    URL DOI

  55. Tatsuya Hiraoka, Hiroyuki Shindo, and Yuji Matsumoto. Stochastic Tokenization with a Language Model for Neural Text Classification. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1620–1629, Florence, Italy, July 2019. (doi: 10.18653/v1/P19-1158)

    URL DOI

  56. Hayate Iso, Yui Uehara, Tatsuya Ishigaki, Hiroshi Noji, Eiji Aramaki, Ichiro Kobayashi, Yusuke Miyao, Naoaki Okazaki, and Hiroya Takamura. Learning to Select, Track, and Generate for Data-to-Text. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pages 2102–2113, Florence, Italy, July 2019. (doi: 10.18653/v1/P19-1202)

    URL DOI

  57. Sho Takase and Naoaki Okazaki. Positional Encoding to Control Output Sequence Length. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (NAACL), pages 3999–4004, Minneapolis, Minnesota, June 2019. (doi: 10.18653/v1/N19-1401)

    URL DOI

  58. Zhishen Yang, Sam Vijlbrief, and Naoaki Okazaki. TokyoTech_NLP at SemEval-2019 Task 3: Emotion-related Symbols in Emotion Detection. In Proceedings of the 13th International Workshop on Semantic Evaluation (SemEval), pages 350–354, Minneapolis, Minnesota, USA, June 2019. (doi: 10.18653/v1/S19-2061)

    URL DOI

  59. Sho Takase, Jun Suzuki, and Masaaki Nagata. Character n-gram Embeddings to Improve RNN Language Models. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI), pages 5074–5082, January 2019.

    arXiv

  60. Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, and Masaaki Nagata. Reducing Odd Generation from Neural Headline Generation. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation (PACLIC), Hong Kong, December 2018.

    URL

  61. Kaori Abe, Yuichiroh Matsubayashi, Naoaki Okazaki, and Kentaro Inui. Multi-dialect Neural Machine Translation and Dialectometry. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation (PACLIC), Hong Kong, December 2018.

    URL

  62. Sho Takase, Jun Suzuki, and Masaaki Nagata. Direct Output Connection for a High-Rank Language Model. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 4599–4609, Brussels, Belgium, October 2018. (doi: 10.18653/v1/D18-1489)

    URL DOI

  63. Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, and Masaaki Nagata. Unsupervised Token-wise Alignment to Improve Interpretation of Encoder-Decoder Models. In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pages 74–81, Brussels, Belgium, November 2018. (doi: 10.18653/v1/W18-5410)

    URL DOI

  64. Diana Galvan, Naoaki Okazaki, Koji Matsuda, and Kentaro Inui. Investigating the Challenges of Temporal Relation Extraction from Clinical Text. In Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis (Louhi), pages 55–64, Brussels, Belgium, October 2018. (doi: 10.18653/v1/W18-5607)

    URL DOI

  65. Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, and Kentaro Inui. Predicting Stances from Social Media Posts using Factorization Machines. In Proceedings of the 27th International Conference on Computational Linguistics (COLING), pages 3381–3390, August 2018.

    URL

  66. Yuta Hitomi, Hideaki Tamori, Naoaki Okazaki, and Kentaro Inui. Proofread Sentence Generation as Multi-Task Learning with Editing Operation Prediction. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP), pages 436–441, November 2017.

    URL

  67. Sosuke Kobayashi, Naoaki Okazaki, and Kentaro Inui. A Neural Language Model for Dynamically Representing the Meanings of Unknown Words and Entities in a Discourse. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP), pages 473–483, November 2017.

    URL

  68. Kazuaki Hanawa, Akira Sasaki, Naoaki Okazaki, and Kentaro Inui. A Crowdsourcing Approach for Annotating Causal Relation Instances in Wikipedia. In Proceedings of the 31st Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 336–345, November 2017.

    URL

  69. Shota Sasaki, Sho Takase, Naoya Inoue, Naoaki Okazaki, and Kentaro Inui. Handling Multiword Expressions in Causality Estimation. In IWCS 2017 — 12th International Conference on Computational Semantics — Short papers, pages (6 pages), 2017.

    URL

  70. Hideaki Tamori, Yuta Hitomi, Naoaki Okazaki, and Kentaro Inui. Analyzing the Revision Logs of a Japanese Newspaper for Article Quality Assessment. In Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism, pages 46–50, Copenhagen, Denmark, September 2017. (doi: 10.18653/v1/W17-4208)

    URL DOI

  71. Sho Yokoi, Daichi Mochihashi, Ryo Takahashi, Naoaki Okazaki, and Kentaro Inui. Learning Co-Substructures by Kernel Dependence Maximization. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), pages 3329–3335, August 2017.

    URL

  72. Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, and Kentaro Inui. Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using Tweets and Matrix Factorization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 398–408, Vancouver, Canada, July 2017. (doi: 10.18653/v1/P17-1037)

    URL DOI

Invited talks

  1. Naoaki Okazaki. Neural Machine Translation and Summarization for News. In International Workshop on Speech to Speech Machine Translation (IWSSMT), November 2020.

    URL

  2. Naoaki Okazaki. Towards Natural Language Processing that Understands Context. In AI Shooting Stars Session, Artificial Intelligence — International Research and Applications: 1st Japanese-German-French DWIH Symposium, November 2018.

    URL

  3. Naoaki Okazaki. How Deep Learning Changes Natural Language Processing. In Fourth Asia Pacific Corpus Linguistics Conference (APCLC 2018), September 2018.

    URL

  4. Naoaki Okazaki. Bridging Knowledge and Text with Deep Neural Networks. In Second International Workshop on Symbolic-Neural Learning (SNL-2018), July 2018.

    URL

  5. Naoaki Okazaki. Generating Text with Deep Neural Networks. In Deep Learning: Theory, Algorithms, and Applications, March 2018.

    URL

Non-refereed papers

  1. Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, and Naoaki Okazaki. Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction. In Proceedings of the Fifth Widening Natural Language Processing Workshop (WiNLP2021), November 2021.

  2. Zhishen Yang, Tosho Hirasawa, Mamoru Komachi, and Naoaki Okazaki. Do Videos Guide Translations? Evaluation on Video-guided Machine Translation dataset. In Visually Grounded Interaction and Language (ViGIL), 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2021) workshop, June 2021.

    URL

  3. Tosho Hirasawa, Zhishen Yang, Mamoru Komachi, and Naoaki Okazaki. Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020. In First Workshop on Advances in Language and Vision Research (ALVR 2020), ACL 2020, July 2020.

    arXiv

  4. Youmi Ma, Tatsuya Hiraoka, and Naoaki Okazaki. Named Entity Recognition and Relation Extraction using Enhanced Table Filling by Contextualized Representations, 2020.

    arXiv