Publications

Awards

  1. Best Paper Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Sakae Mizuki, Naoaki Okazaki

    Hyponymy Detection using Hierarchical Code Learning

    URL

  2. Young Researcher’s Encouragement Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Tatsuya Hiraoka

    Optimizing Word Segmentation using Loss Values of Downstream Tasks

    URL

  3. Young Researcher’s Encouragement Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Youmi Ma

    Named Entity Recognition and Relation Extraction by Table-Filling using BERT

    URL

  4. Committee Special Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Kosuke Yamada ,Yuta Hitomi,Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

    Headline Generation that Reliably Contains the Specified Words

    URL

  5. Sponsor Award, the 27th Annual Meeting of The Association for Natural Language Processing (2021-03-18)

    Kosuke Yamada ,Yuta Hitomi,Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

    Headline Generation that Reliably Contains the Specified Words

    URL

  6. TokyoTech Education Award 2020 (2021-03-02)

    Yoshihiro Miyake, Naoaki Okazaki, Takafumi Kanamori, Tsuyoshi Murata, Shin-ya Nishizaki, Kazuyuki Shudo, Kenji Kise, Masamichi Shimosaka, Masakazu Sekijima, Keisuke Yanagisawa, Masahiro Kuze, Mitsuji Sampei, Ichiro Yamanaka, Takehiko Itoh, Toru Takeuchi, Takeo Yamaguchi, Kei Sakaguchi

    University-wide Education Program of Data Science and Artificial Intelligence for Graduate Students

    URL

  7. Presentation award, the 15th NTCIR (2020-12-17)

    Yuichi Sasazawa, Naoaki Okazaki

    WER99 at the NTCIR-15 QA Lab-PoliInfo-2 Classification Task

    URL

  8. Winning the Video-guided Machine Translation (VMT) Challenge 2020 (2020-07-13)

    Tosho Hirasawa, Zhishen Yang, Mamoru Komachi, and Naoaki Okazaki

    Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020

    URL

  9. Language resource award, the 26th Annual Meeting of The Association for Natural Language Processing (2020-03-20)

    Yuta Hitomi, Yuya Taguchi, Hideaki Tamori, Naoaki Okazaki, Kentaro Inui

    Style Transfer for Abstractive Summarization in a Small-scale Resource

    URL

  10. Young researcher’s encouragement award, the 26th Annual Meeting of The Association for Natural Language Processing (2020-03-20)

    Kazuki Matsumaru

    Improving Truthfulness of Headline Generation

    URL

  11. Young researcher’s encouragement award, the 242nd Meeting of Special Interest Group of Natural Language Processing (SIGNL), Information Processing Society of Japan (IPSJ) (2019-10-25)

    Tatsuya Hiraoka

    HMM-based Neural Network Capturing Latent History with RNN

    URL

  12. Best Paper Award, the 240th Meeting of Special Interest Group of Natural Language Processing (SIGNL), Information Processing Society of Japan (IPSJ) (2019-06-14)

    Kazuki Matsumaru, Sho Takase, Naoaki Okazaki

    Re-examinating the Task of Headline Generation based on Textual Entailment

    URL

  13. JSAI 2018 Best Paper Award (2018-06-27)

    Sho Takase, Naoaki Okazaki, Kentaro Inui

    Learning to Compose Distributed Representations of Relational Patterns

    URL

  14. Best Paper Award, the 24th Annual Meeting of The Association for Natural Language Processing (2018-03-15)

    Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, Masaaki Nagata

    Reducing odd generation in neural headline generation

    URL

Research Publications

Journal papers

  1. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Recurrent Neural Hidden Markov Model for High-Order Transition. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 20:(to appear), 2021.

  2. Emanuele Bugliarello, Ryan Cotterell, Naoaki Okazaki, and Desmond Elliott. Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs. Transactions of the Association for Computational Linguistics, 9:(to appear), 2021.

    arXiv

  3. Ayana Niwa, Naoaki Okazaki, Kohei Wakimoto, Keisuke Nishiguchi, and Masataka Mouri. Construction of a Corpus of Rhetorical Devices in Slogans and Structural Analysis of Antitheses. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 20:(to appear), 2021.

  4. Sangwhan Moon and Naoaki Okazaki. The Effects and Mitigation of Out-of-Vocabulary in Universal Language Models. Journal of Information Processing, 90(to appear), July 2021.

  5. Kaori Abe, Yuichiroh Matsubayashi, Naoaki Okazaki, and Kentaro Inui. Multi-dialect Neural Machine Translation for 48 Low-resource Japanese Dialects. Journal of Natural Language Processing, 27(4):781–800, December 2020. (doi: 10.5715/jnlp.27.781)

    DOI

  6. Hayate Iso, Yui Uehara, Tatsuya Ishigaki, Hiroshi Noji, Eiji Aramaki, Ichiro Kobayashi, Yusuke Miyao, Naoaki Okazaki, and Hiroya Takamura. Learning to Select, Track, and Generate for Data-to-Text. Journal of Natural Language Processing, 27(3):599–626, September 2020. (doi: 10.5715/jnlp.27.599)

    DOI

  7. Diana Galvan-Sosa, Koji Matsuda, Naoaki Okazaki, and Kentaro Inui. Empirical Exploration of the Challenges in Temporal Relation Extraction from Clinical Text. Journal of Natural Language Processing, 27(2):383–409, June 2020. (doi: 10.5715/jnlp.27.383)

    DOI

  8. image
    Kazuaki Hanawa, Akira Sasaki, Naoaki Okazaki, and Kentaro Inui. Stance Detection Attending External Knowledge from Wikipedia. Journal of Information Processing, 27:499–506, August 2019. (doi: 10.2197/ipsjjip.27.499)

    DOI

  9. Masatoshi Suzuki, Koji Matsuda, Satoshi Sekine, Naoaki Okazaki, and Kentaro Inui. A Joint Neural Model for Fine-Grained Named Entity Classification of Wikipedia Articles. IEICE Transactions on Information and Systems, Special Section on Semantic Web and Linked Data, E101.D(1):73–81, January 2018. (doi: 10.1587/transinf.2017SWP0005)

    DOI

  10. Ran Tian, Naoaki Okazaki, and Kentaro Inui. The mechanism of additive composition. Machine Learning, 106(7):1083–1130, July 2017. (doi: 10.1007/s10994-017-5634-8)

    DOI

  11. Sho Takase, Naoaki Okazaki, and Kentaro Inui. Learning to Compose Distributed Representations of Relational Patterns. Transactions of the Japanese Society for Artificial Intelligence (in Japanese), 32(4):D-G96_1-11, July 2017. (doi: 10.1527/tjsai.D-G96)

    DOI

  12. Shuangshuang Zhou, Naoaki Okazaki, Koji Matsuda, Ran Tian, and Kentaro Inui. Supervised Approaches for Japanese Wikification. Journal of Information Processing, 25:341–350, April 2017. (doi: 10.2197/ipsjjip.25.341)

    DOI

International conferences

  1. Ayana Niwa, Keisuke Nishiguchi, and Naoaki Okazaki. Predicting Antonyms in Context using BERT. In Proceedings of the 14th International Conference on Natural Language Generation, pages (to appear), 2021.

  2. Hiyori Yoshikawa, Saber A. Akhondi, Camilo Thorne, Christian Druckenbrodt, Ralph Hoessel, Zenan Zhai, Jiayuan He, Timothy Baldwin, and Karin Verspoor. Chemical Reaction Reference Resolution in Patents. In Proceedings of the 2nd Workshop on on Patent Text Mining and Semantic Technologies, 2021.

  3. Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, and Naoaki Okazaki. Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction. In Proceedings of the Fifth Widening Natural Language Processing Workshop (WiNLP2021), pages (to appear), November 2021.

  4. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Joint Optimization of Tokenization and Downstream Model. In Findings of the Association for Computational Linguistics: ACL 2021, pages (to appear), August 2021.

    arXiv

  5. Sho Takase and Shun Kiyono. Rethinking Perturbations in Encoder-Decoders for Fast Training. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 5767–5780, Online, June 2021. (doi: 10.18653/v1/2021.naacl-main.460)

    URL DOI

  6. Zhishen Yang, Tosho Hirasawa, Mamoru Komachi, and Naoaki Okazaki. Did Videos Guide Translations? Evaluation on Video-guided Machine Translation dataset. In Proceedings of Visually Grounded Interaction and Language (ViGIL), 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2021) workshop, pages (to appear), June 2021.

    URL

  7. Masahiro Kaneko and Danushka Bollegala. Debiasing Pre-trained Contextualised Embeddings. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 1256–1266, Online, April 2021.

    URL

  8. Masahiro Kaneko and Danushka Bollegala. Dictionary-based Debiasing of Pre-trained Word Embeddings. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 212–223, Online, April 2021.

    URL

  9. Zhishen Yang and Naoaki Okazaki. Image Caption Generation for News Articles. In Proceedings of the 28th International Conference on Computational Linguistics, pages 1941–1951, Barcelona, Spain (Online), December 2020. (doi: 10.18653/v1/2020.coling-main.176)

    URL DOI

  10. Sho Takase and Sosuke Kobayashi. All Word Embeddings from One Embedding. In Proceedings of the Thirty-fourth Conference on Neural Information Processing System (NeurIPS 2020), pages 3775–3785, December 2020.

    arXiv

  11. Won Ik Cho, Sangwhan Moon, and Youngsook Song. Open Korean Corpora: A Practical Report. In Proceedings of Second Workshop for NLP Open Source Software (NLP-OSS), pages 85–93, Online, November 2020. (doi: 10.18653/v1/2020.nlposs-1.12)

    URL DOI

  12. Shin Kanouchi, Masato Neishi, Yuta Hayashibe, Hiroki Ouchi, and Naoaki Okazaki. You May Like This Hotel Because ...: Identifying Evidence for Explainable Recommendations. In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, pages 890–899, Suzhou, China, December 2020.

    URL

  13. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Optimizing Word Segmentation for Downstream Task. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 1341–1351, Online, November 2020. (doi: 10.18653/v1/2020.findings-emnlp.120)

    URL DOI

  14. Won Ik Cho, Youngki Moon, Sangwhan Moon, Seok Min Kim, and Nam Soo Kim. Machines Getting with the Program: Understanding Intent Arguments of Non-Canonical Directives. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 329–339, Online, November 2020. (doi: 10.18653/v1/2020.findings-emnlp.31)

    URL DOI

  15. Sangwhan Moon and Naoaki Okazaki. PatchBERT: Just-in-Time, Out-of-Vocabulary Patching. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7846–7852, Online, November 2020. (doi: 10.18653/v1/2020.emnlp-main.631)

    URL DOI

  16. Tosho Hirasawa, Zhishen Yang, Mamoru Komachi, and Naoaki Okazaki. Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020. In First Workshop on Advances in Language and Vision Research (ALVR 2020), ACL 2020, July 2020.

    arXiv

  17. Wiem Ben Rim and Naoaki Okazaki. SWAGex at SemEval-2020 Task 4: Commonsense Explanation as Next Event Prediction. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 422–429, Barcelona (online), December 2020.

    URL

  18. Zhishen Yang, Lars Wolfsteller, and Naoaki Okazaki. TextLearner at SemEval-2020 Task 10: A Contextualized Ranking System in Solving Emphasis Selection in Text. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1691–1697, Barcelona (online), December 2020.

    URL

  19. Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos, Ryan Cotterell, and Naoaki Okazaki. It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 1640–1649, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.149)

    URL DOI

  20. Emanuele Bugliarello and Naoaki Okazaki. Enhancing Machine Translation with Dependency-Aware Self-Attention. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 1618–1627, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.147)

    URL DOI

  21. Zixia Jia, Youmi Ma, Jiong Cai, and Kewei Tu. Semi-Supervised Semantic Dependency Parsing Using CRF Autoencoders. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 6795–6805, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.607)

    URL DOI

  22. Kazuki Matsumaru, Sho Takase, and Naoaki Okazaki. Improving Truthfulness of Headline Generation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 1335–1346, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.123)

    URL DOI

  23. Sangwhan Moon and Naoaki Okazaki. Jamo Pair Encoding: Subcharacter Representation-based Extreme Korean Vocabulary Compression for Efficient Subword Tokenization. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 3490–3497, Marseille, France, May 2020.

    URL

  24. Sho Shimazu, Sho Takase, Toshiaki Nakazawa, and Naoaki Okazaki. Evaluation Dataset for Zero Pronoun in Japanese to English Translation. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 3630–3634, Marseille, France, May 2020.

    URL

  25. Sakae Mizuki and Naoaki Okazaki. Analyzing the Variation Property of Contextualized Word Representations. In AI 2019: Advances in Artificial Intelligence, pages 393–405, December 2019. (doi: 10.1007/978-3-030-35288-2_32)

    URL DOI

  26. Yuichi Sasazawa, Sho Takase, and Naoaki Okazaki. Neural Question Generation using Interrogative Phrases. In Proceedings of the 12th International Conference on Natural Language Generation, pages 106–111, Tokyo, Japan, October 2019. (doi: 10.18653/v1/W19-8613)

    URL DOI

  27. Emanuele Bugliarello, Swayambhoo Jain, and Vineeth Rakesh. Matrix Completion in the Unit Hypercube via Structured Matrix Factorization. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19), pages 2038–2044, August 2019. (doi: 10.24963/ijcai.2019/282)

    URL DOI

  28. Tatsuya Hiraoka, Hiroyuki Shindo, and Yuji Matsumoto. Stochastic Tokenization with a Language Model for Neural Text Classification. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 1620–1629, Florence, Italy, July 2019. (doi: 10.18653/v1/P19-1158)

    URL DOI

  29. Hayate Iso, Yui Uehara, Tatsuya Ishigaki, Hiroshi Noji, Eiji Aramaki, Ichiro Kobayashi, Yusuke Miyao, Naoaki Okazaki, and Hiroya Takamura. Learning to Select, Track, and Generate for Data-to-Text. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 2102–2113, Florence, Italy, July 2019. (doi: 10.18653/v1/P19-1202)

    URL DOI

  30. Sho Takase and Naoaki Okazaki. Positional Encoding to Control Output Sequence Length. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 3999–4004, Minneapolis, Minnesota, June 2019. (doi: 10.18653/v1/N19-1401)

    URL DOI

  31. Zhishen Yang, Sam Vijlbrief, and Naoaki Okazaki. TokyoTech_NLP at SemEval-2019 Task 3: Emotion-related Symbols in Emotion Detection. In Proceedings of the 13th International Workshop on Semantic Evaluation, pages 350–354, Minneapolis, Minnesota, USA, June 2019. (doi: 10.18653/v1/S19-2061)

    URL DOI

  32. Sho Takase, Jun Suzuki, and Masaaki Nagata. Character n-gram Embeddings to Improve RNN Language Models. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019), pages 5074–5082, January 2019.

    arXiv

  33. Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, and Masaaki Nagata. Reducing Odd Generation from Neural Headline Generation. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation, Hong Kong, 2018.

    URL

  34. Kaori Abe, Yuichiroh Matsubayashi, Naoaki Okazaki, and Kentaro Inui. Multi-dialect Neural Machine Translation and Dialectometry. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation, Hong Kong, 2018.

    URL

  35. Sho Takase, Jun Suzuki, and Masaaki Nagata. Direct Output Connection for a High-Rank Language Model. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 4599–4609, Brussels, Belgium, October 2018. (doi: 10.18653/v1/D18-1489)

    URL DOI

  36. Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, and Masaaki Nagata. Unsupervised Token-wise Alignment to Improve Interpretation of Encoder-Decoder Models. In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pages 74–81, Brussels, Belgium, November 2018. (doi: 10.18653/v1/W18-5410)

    URL DOI

  37. Diana Galvan, Naoaki Okazaki, Koji Matsuda, and Kentaro Inui. Investigating the Challenges of Temporal Relation Extraction from Clinical Text. In Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis, pages 55–64, Brussels, Belgium, October 2018. (doi: 10.18653/v1/W18-5607)

    URL DOI

  38. Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, and Kentaro Inui. Predicting Stances from Social Media Posts using Factorization Machines. In Proceedings of the 27th International Conference on Computational Linguistics (Coling 2018), pages 3381–3390, August 2018.

    URL

  39. Yuta Hitomi, Hideaki Tamori, Naoaki Okazaki, and Kentaro Inui. Proofread Sentence Generation as Multi-Task Learning with Editing Operation Prediction. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP 2017), pages 436–441, November 2017.

    URL

  40. Sosuke Kobayashi, Naoaki Okazaki, and Kentaro Inui. A Neural Language Model for Dynamically Representing the Meanings of Unknown Words and Entities in a Discourse. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP 2017), pages 473–483, November 2017.

    URL

  41. Kazuaki Hanawa, Akira Sasaki, Naoaki Okazaki, and Kentaro Inui. A Crowdsourcing Approach for Annotating Causal Relation Instances in Wikipedia. In Proceedings of the 31st Pacific Asia Conference on Language, Information and Computation (PACLIC 2017), pages 336–345, November 2017.

    URL

  42. Shota Sasaki, Sho Takase, Naoya Inoue, Naoaki Okazaki, and Kentaro Inui. Handling Multiword Expressions in Causality Estimation. In Proceedings of the 12th International Conference on Computational Semantics (IWCS 2017), pages (6 pages), September 2017.

    URL

  43. Hideaki Tamori, Yuta Hitomi, Naoaki Okazaki, and Kentaro Inui. Analyzing the Revision Logs of a Japanese Newspaper for Article Quality Assessment. In Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism, pages 46–50, Copenhagen, Denmark, September 2017. (doi: 10.18653/v1/W17-4208)

    URL DOI

  44. Sho Yokoi, Daichi Mochihashi, Ryo Takahashi, Naoaki Okazaki, and Kentaro Inui. Learning Co-Substructures by Kernel Dependence Maximization. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI 2017), pages 3329–3335, August 2017.

    URL

  45. Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, and Kentaro Inui. Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using Tweets and Matrix Factorization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 398–408, Vancouver, Canada, July 2017. (doi: 10.18653/v1/P17-1037)

    URL DOI

Invited talks

  1. Naoaki Okazaki. Neural Machine Translation and Summarization for News. In International Workshop on Speech to Speech Machine Translation (IWSSMT), November 2020.

    URL

  2. Naoaki Okazaki. Towards Natural Language Processing that Understands Context. In AI Shooting Stars Session, Artificial Intelligence — International Research and Applications: 1st Japanese-German-French DWIH Symposium, November 2018.

    URL

  3. Naoaki Okazaki. How Deep Learning Changes Natural Language Processing. In Fourth Asia Pacific Corpus Linguistics Conference (APCLC 2018), September 2018.

    URL

  4. Naoaki Okazaki. Bridging Knowledge and Text with Deep Neural Networks. In Second International Workshop on Symbolic-Neural Learning (SNL-2018), July 2018.

    URL

  5. Naoaki Okazaki. Generating Text with Deep Neural Networks. In Deep Learning: Theory, Algorithms, and Applications, March 2018.

    URL