Biography - Taro Yamada

Taro Yamada is a master course student of computer science at Okazaki Laboratory. His research interests include natural language generation and summarization. He develops a headline generation system, where an encoder-decoder model generates a headline for a given news article.

Education

MSc, School of Computing, Tokyo Institute of Technology (2020)
BSc, Department of Information Science, Tokyo University of Science (2018)

Job Experience

Publications

Journal papers

Invited talks

Jun Suzuki, Kyosuke Nishida, and Naoaki Okazaki. A Gentle Introduction to Technologies Behind Language Models and Recent Achievement in ChatGPT. In Tutorial 2, the 27nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), May 2023.

URL Slides
Naoaki Okazaki. Neural Machine Translation and Summarization for News. In International Workshop on Speech to Speech Machine Translation (IWSSMT), November 2020.

URL
Naoaki Okazaki. Towards Natural Language Processing that Understands Context. In AI Shooting Stars Session, Artificial Intelligence — International Research and Applications: 1st Japanese-German-French DWIH Symposium, November 2018.

URL
Naoaki Okazaki. How Deep Learning Changes Natural Language Processing. In Fourth Asia Pacific Corpus Linguistics Conference (APCLC 2018), September 2018.

URL
Naoaki Okazaki. Bridging Knowledge and Text with Deep Neural Networks. In Second International Workshop on Symbolic-Neural Learning (SNL-2018), July 2018.

URL
Naoaki Okazaki. Generating Text with Deep Neural Networks. In Deep Learning: Theory, Algorithms, and Applications, March 2018.

URL

International conferences

Marco Cognetta, David Pohl, Junyoung Lee, and Naoaki Okazaki. Pitfalls, Subtleties, and Techniques in Automata-Based Subword-Level Constrained Generation. In Tokenization Workshop, pages (to appear), Vancouver, Canada, July 2025.

URL
Masahiro Kaneko, Youmi Ma, Yuki Wata, and Naoaki Okazaki. Sampling-based Pseudo-Likelihood for Membership Inference Attacks. In Findings of the Association for Computational Linguistics: ACL 2025 (ACL), pages (to appear), Vienna, Austria, July 2025.
Keito Sasagawa, Koki Maeda, Issa Sugiura, Shuhei Kurita, Naoaki Okazaki, and Daisuke Kawahara. Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model. In Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Demo Track (NAACL), pages (to appear), Albuquerque, New Mexico, May 2025.
Junyoung Lee, Marco Cognetta, Sangwhan Moon, and Naoaki Okazaki. Jamo-Level Subword Tokenization in Low-Resource Korean Machine Translation. In The Eighth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT), pages (to appear), Albuquerque, USA, May 2025.
Ryuto Koike, Masahiro Kaneko, and Naoaki Okazaki. How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection. In Findings of the Association for Computational Linguistics: EMNLP 2024 (EMNLP), pages 14384–14395, Miami, Florida, USA, November 2024. (doi: 10.18653/v1/2024.findings-emnlp.841)

URL DOI
Marco Cognetta, Vilém Zouhar, and Naoaki Okazaki. Distributional Properties of Subword Regularization. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 10753–10763, Miami, Florida, USA, November 2024. (doi: 10.18653/v1/2024.emnlp-main.600)

URL DOI
Shota Koyama, Ryo Nagata, Hiroya Takamura, and Naoaki Okazaki. n-gram F-score for Evaluating Grammatical Error Correction. In Proceedings of the 17th International Natural Language Generation Conference (INLG), pages 303–313, Tokyo, Japan, September 2024.

URL
Naoaki Okazaki, Kakeru Hattori, Hirai Shota, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, and Sakae Mizuki. Building a Large Japanese Web Corpus for Large Language Models. In Proceedings of the First Conference on Language Modeling (COLM), pages (18 pages), University of Pennsylvania, USA, October 2024.

URL
Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Hirai Shota, Sakae Mizuki, Rio Yokota, and Naoaki Okazaki. Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities. In Proceedings of the First Conference on Language Modeling (COLM), pages (25 pages), University of Pennsylvania, USA, October 2024.

URL
Mengsay Loem, Masahiro Kaneko, and Naoaki Okazaki. SAIE Framework: Support Alone Isn’t Enough - Advancing LLM Training with Adversarial Remarks. In Proceedings of the 27th European Conference on Artificial Intelligence (ECAI), pages 3717–3724, Santiago de Compostela, Spain, October 2024. (doi: 10.3233/FAIA240931)

URL DOI
Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, and Naoaki Okazaki. Likelihood-based Mitigation of Evaluation Bias in Large Language Models. In Lun-Wei Ku, Andre Martins, and Vivek Srikumar, editors, Findings of the Association for Computational Linguistics ACL 2024 (ACL 2024), pages 3237–3245, Bangkok, Thailand and virtual meeting, August 2024. (doi: 10.18653/v1/2024.findings-acl.193)

URL DOI
Marco Cognetta, Tatsuya Hiraoka, Rico Sennrich, Yuval Pinter, and Naoaki Okazaki. An Analysis of BPE Vocabulary Trimming in Neural Machine Translation. In Shabnam Tafreshi, Arjun Akula, João Sedoc, Aleksandr Drozd, Anna Rogers, and Anna Rumshisky, editors, Proceedings of the Fifth Workshop on Insights from Negative Results in NLP, pages 48–50, Mexico City, Mexico, June 2024. (doi: 10.18653/v1/2024.insights-1.7)

URL DOI
Marco Cognetta, Vilém Zouhar, Sangwhan Moon, and Naoaki Okazaki. Two Counterexamples to Tokenization and the Noiseless Channel. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 16897–16906, Torino, Italia, May 2024.

URL
Panatchakorn Anantaprayoon, Masahiro Kaneko, and Naoaki Okazaki. Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 6395–6408, Torino, Italia, May 2024.

URL
Youmi Ma, An Wang, and Naoaki Okazaki. Building a Japanese Document-Level Relation Extraction Dataset Assisted by Cross-Lingual Transfer. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 2567–2579, Torino, Italia, May 2024.

URL
Masahiro Kaneko and Naoaki Okazaki. Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 3955–3961, Torino, Italia, May 2024.

URL
Ryuto Koike, Masahiro Kaneko, and Naoaki Okazaki. OUTFOX: LLM-generated Essay Detection through In-context Learning with Adversarially Generated Examples. In The 38th Annual AAAI Conference on Artificial Intelligence (AAAI), pages 21258–21266, February 2024.
Koki Maeda, Shuhei Kurita, Taiki Miyanishi, and Naoaki Okazaki. Query-based Image Captioning from Multi-context 360° Images. In Findings of the Association for Computational Linguistics: EMNLP 2023 (EMNLP), pages 6940–6954, Singapore, December 2023. (doi: 10.18653/v1/2023.findings-emnlp.463)

URL DOI
Trang Nguyen and Naoaki Okazaki. Causal Reasoning through Two Layers of Cognition for Improving Generalization in Visual Question Answering. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 9221–9236, Singapore, December 2023. (doi: 10.18653/v1/2023.emnlp-main.573)

URL DOI
Masahiro Kaneko and Naoaki Okazaki. Reducing Sequence Length by Predicting Edit Operations with Large Language Models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 10017–10029, Singapore, December 2023. (doi: 10.18653/v1/2023.emnlp-main.619)

URL DOI
Youmi Ma, Bhushan Kotnis, Carolin Lawrance, Goran Glavaš, and Naoaki Okazaki. Improving Cross-Lingual Transfer for Open Information Extraction with Linguistic Feature Projection. In Proceedings of the 3rd Workshop on Multi-lingual Representation Learning (MRL), pages 125–138, Singapore, December 2023. (doi: 10.18653/v1/2023.mrl-1.11)

URL DOI
Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. The Impact of Debiasing on the Performance of Language Models in Downstream Tasks is Underestimated. In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 2: Short Papers) (AACL), pages 29–36, Nusa Dua, Bali, November 2023. (doi: 10.18653/v1/2023.ijcnlp-short.4)

URL DOI
Mengsay Loem, Masahiro Kaneko, Sho Takase, and Naoaki Okazaki. Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods. In Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023) (BEA), pages 205–219, Toronto, Canada, July 2023.

URL
An Wang, Junfeng Jiang, Youmi Ma, Ao Liu, and Naoaki Okazaki. Generative Data Augmentation for Aspect Sentiment Quad Prediction. In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM), pages 128–140, Toronto, Canada, July 2023. (doi: 10.18653/v1/2023.starsem-1.12)

URL DOI
Marco Cognetta, Sangwhan Moon, Lawrence Wolf-Sonkin, and Naoaki Okazaki. Parameter-Efficient Korean Character-Level Language Modeling. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 2350–2356, Dubrovnik, Croatia, May 2023.

URL
Hiyori Yoshikawa and Naoaki Okazaki. Selective-LAMA: Selective Prediction for Confidence-Aware Evaluation of Language Models. In Findings of the Association for Computational Linguistics: EACL 2023 (Findings of EACL), pages 2017–2028, Dubrovnik, Croatia, May 2023.

URL
Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. Comparing Intrinsic Gender Bias Evaluation Measures without using Human Annotated Examples. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 2857–2863, Dubrovnik, Croatia, May 2023.

URL
Sakae Mizuki and Naoaki Okazaki. Semantic Specialization for Knowledge-based Word Sense Disambiguation. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 3457–3470, Dubrovnik, Croatia, May 2023.

URL
Youmi Ma, An Wang, and Naoaki Okazaki. DREEAM: Guiding Attention with Evidence for Improving Document-Level Relation Extraction. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (EACL), pages 1971–1983, Dubrovnik, Croatia, May 2023.

URL
Zhishen Yang, Raj Dabre, Hideki Tanaka, and Naoaki Okazaki. SciCap+: A Knowledge Augmented Dataset to Study the Challenges of Scientific Figure Captioning. In Proceedings of the Workshop on Scientific Document Understanding, co-located with 37th AAAI Conference on Artificial Intelligence (CEUR Workshop Proceedings), page (Paper13), Washington DC, USA, February 2023.

URL
Ao Liu, Haoyu Dong, Naoaki Okazaki, Shi Han, and Dongmei Zhang. PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 5531–5546, Abu Dhabi, United Arab Emirates, December 2022.

URL
Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. Gender Bias in Meta-Embeddings. In Findings of the Association for Computational Linguistics: EMNLP 2022 (EMNLP), pages 3118–3133, Abu Dhabi, United Arab Emirates, December 2022.

URL
Hiroki Iida and Naoaki Okazaki. Unsupervised Domain Adaptation for Sparse Retrieval by Filling Vocabulary and Word Frequency Gaps. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (AACL), pages 752–765, Online, November 2022.

URL
Masahiro Kaneko, Danushka Bollegala, and Naoaki Okazaki. Debiasing Isn’t Enough! – on the Effectiveness of Debiasing MLMs and Their Social Biases in Downstream Tasks. In Proceedings of the 29th International Conference on Computational Linguistics (COLING), pages 1299–1310, Gyeongju, Republic of Korea, October 2022.

URL
Koki Maeda, Masahiro Kaneko, and Naoaki Okazaki. IMPARA: Impact based Metric for GEC using Parallel Data. In Proceedings of the 29th International Conference on Computational Linguistics (COLING), pages 3578–3588, Gyeongju, Republic of Korea, October 2022.

URL
Hsuan-Yu Kuo, Youmi Ma, and Naoaki Okazaki. Annotating Entity and Causal Relationships on Japanese Vehicle Recall Information. In Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 783–791, Manila, Philippines, October 2022.

URL
Vijay Daultani and Naoaki Okazaki. Improving Automatic Evaluation of Acceptability Based on Language Models with a Coarse Sentence Representation. In Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 109–118, Manila, Philippines, October 2022.

URL
Mengsay Loem, Sho Takase, Masahiro Kaneko, and Naoaki Okazaki. ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop (NAACL SRW), pages 16–24, Hybrid: Seattle, Washington + Online, July 2022. (doi: 10.18653/v1/2022.naacl-srw.3)

URL DOI
Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, and Naoaki Okazaki. Gender Bias in Masked Language Models for Multiple Languages. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 2740–2750, Seattle, United States, July 2022. (doi: 10.18653/v1/2022.naacl-main.197)

URL Code DOI
Hwichan Kim, Sangwhan Moon, Naoaki Okazaki, and Mamoru Komachi. Learning How to Translate North Korean through South Korean. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 6711–6718, Marseille, France, June 2022.

URL
Sangwhan Moon, Won Ik Cho, Hye Joo Han, Naoaki Okazaki, and Nam Soo Kim. OpenKorPOS: Democratizing Korean Tokenization with Voting-Based Open Corpus Annotation. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 4975–4983, Marseille, France, June 2022.

URL
Sho Takase and Naoaki Okazaki. Multi-Task Learning for Cross-Lingual Abstractive Summarization. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), pages 3008–3016, Marseille, France, June 2022.

URL
Masahiro Kaneko, Sho Takase, Ayana Niwa, and Naoaki Okazaki. Interpretability for Language Learners Using Example-Based Grammatical Error Correction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 7176–7187, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.496)

URL Code DOI
Ao Liu, An Wang, and Naoaki Okazaki. Semi-Supervised Formality Style Transfer with Consistency Training. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 4689–4701, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.acl-long.321)

URL Code DOI
Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Word-level Perturbation Considering Word Length and Compositional Subwords. In Findings of the Association for Computational Linguistics: ACL 2022 (Findings of ACL), pages 3268–3275, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.findings-acl.258)

URL Code DOI
Sho Takase, Tatsuya Hiraoka, and Naoaki Okazaki. Single Model Ensemble for Subword Regularized Models in Low-Resource Machine Translation. In Findings of the Association for Computational Linguistics: ACL 2022 (Findings of ACL), pages 2536–2541, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.findings-acl.199)

URL DOI
Youmi Ma, Tatsuya Hiraoka, and Naoaki Okazaki. Joint Entity and Relation Extraction Based on Table Labeling Using Convolutional Neural Networks. In Proceedings of the Sixth Workshop on Structured Prediction for NLP (SPNLP), pages 11–21, Dublin, Ireland, May 2022. (doi: 10.18653/v1/2022.spnlp-1.2)

URL Code DOI
Hiroki Iida and Naoaki Okazaki. Incorporating Semantic Textual Similarity and Lexical Matching for Information Retrieval. In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 582–591, Shanghai, China, November 2021.

URL
Shota Koyama, Hiroya Takamura, and Naoaki Okazaki. Various Errors Improve Neural Grammatical Error Correction. In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 251–261, Shanghai, China, November 2021.

URL
Kosuke Yamada, Yuta Hitomi, Hideaki Tamori, Ryohei Sasano, Naoaki Okazaki, Kentaro Inui, and Koichi Takeda. Transformer-based Lexically Constrained Headline Generation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 4085–4090, Online and Punta Cana, Dominican Republic, November 2021. (doi: 10.18653/v1/2021.emnlp-main.335)

URL Code DOI
Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, and Naoaki Okazaki. Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction. In Proceedings of the 3rd Conference on Automated Knowledge Base Construction (AKBC), pages (19 pages), October 2021.

URL Slides
Ayana Niwa, Keisuke Nishiguchi, and Naoaki Okazaki. Predicting Antonyms in Context using BERT. In Proceedings of the 14th International Conference on Natural Language Generation (INLG), pages 48–54, Aberdeen, Scotland, UK, August 2021.

URL
Keiji Yasuda, Ichiro Yamada, Naoaki Okazaki, Hideki Tanaka, Hidehiro Asaka, Takeshi Anzai, and Fumiaki Sugaya. Field Experiments of Real Time Foreign News Distribution Powered by MT. In Proceedings of Machine Translation Summit XVIII: Users and Providers Track (MT Summit), pages 227–232, Virtual, August 2021.

URL
Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Joint Optimization of Tokenization and Downstream Model. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (Findings of ACL), pages 244–255, Online, August 2021. (doi: 10.18653/v1/2021.findings-acl.21)

URL Code DOI
Zhishen Yang and Naoaki Okazaki. Image Caption Generation for News Articles. In Proceedings of the 28th International Conference on Computational Linguistics (COLING), pages 1941–1951, Barcelona, Spain (Online), December 2020. (doi: 10.18653/v1/2020.coling-main.176)

URL Code DOI
Shin Kanouchi, Masato Neishi, Yuta Hayashibe, Hiroki Ouchi, and Naoaki Okazaki. You May Like This Hotel Because ...: Identifying Evidence for Explainable Recommendations. In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (AACL-IJCNLP), pages 890–899, Suzhou, China, December 2020.

URL
Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, and Naoaki Okazaki. Optimizing Word Segmentation for Downstream Task. In Findings of the Association for Computational Linguistics: EMNLP 2020 (Findings of EMNLP), pages 1341–1351, Online, November 2020. (doi: 10.18653/v1/2020.findings-emnlp.120)

URL DOI
Sangwhan Moon and Naoaki Okazaki. PatchBERT: Just-in-Time, Out-of-Vocabulary Patching. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7846–7852, Online, November 2020. (doi: 10.18653/v1/2020.emnlp-main.631)

URL DOI
Wiem Ben Rim and Naoaki Okazaki. SWAGex at SemEval-2020 Task 4: Commonsense Explanation as Next Event Prediction. In Proceedings of the Fourteenth Workshop on Semantic Evaluation (SemEval), pages 422–429, Barcelona (online), December 2020.

URL
Zhishen Yang, Lars Wolfsteller, and Naoaki Okazaki. TextLearner at SemEval-2020 Task 10: A Contextualized Ranking System in Solving Emphasis Selection in Text. In Proceedings of the Fourteenth Workshop on Semantic Evaluation (SemEval), pages 1691–1697, Barcelona (online), December 2020.

URL
Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos, Ryan Cotterell, and Naoaki Okazaki. It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1640–1649, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.149)

URL DOI
Emanuele Bugliarello and Naoaki Okazaki. Enhancing Machine Translation with Dependency-Aware Self-Attention. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1618–1627, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.147)

URL DOI
Kazuki Matsumaru, Sho Takase, and Naoaki Okazaki. Improving Truthfulness of Headline Generation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1335–1346, Online, July 2020. (doi: 10.18653/v1/2020.acl-main.123)

URL DOI
Sangwhan Moon and Naoaki Okazaki. Jamo Pair Encoding: Subcharacter Representation-based Extreme Korean Vocabulary Compression for Efficient Subword Tokenization. In Proceedings of the 12th Language Resources and Evaluation Conference (LREC), pages 3490–3497, Marseille, France, May 2020.

URL
Sho Shimazu, Sho Takase, Toshiaki Nakazawa, and Naoaki Okazaki. Evaluation Dataset for Zero Pronoun in Japanese to English Translation. In Proceedings of the 12th Language Resources and Evaluation Conference (LREC), pages 3630–3634, Marseille, France, May 2020.

URL
Sakae Mizuki and Naoaki Okazaki. Analyzing the Variation Property of Contextualized Word Representations. In AI 2019: Advances in Artificial Intelligence, pages 393–405, December 2019. (doi: 10.1007/978-3-030-35288-2_32)

URL DOI
Yuichi Sasazawa, Sho Takase, and Naoaki Okazaki. Neural Question Generation using Interrogative Phrases. In Proceedings of the 12th International Conference on Natural Language Generation (INLG), pages 106–111, Tokyo, Japan, October 2019. (doi: 10.18653/v1/W19-8613)

URL DOI
Hayate Iso, Yui Uehara, Tatsuya Ishigaki, Hiroshi Noji, Eiji Aramaki, Ichiro Kobayashi, Yusuke Miyao, Naoaki Okazaki, and Hiroya Takamura. Learning to Select, Track, and Generate for Data-to-Text. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pages 2102–2113, Florence, Italy, July 2019. (doi: 10.18653/v1/P19-1202)

URL DOI
Sho Takase and Naoaki Okazaki. Positional Encoding to Control Output Sequence Length. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (NAACL), pages 3999–4004, Minneapolis, Minnesota, June 2019. (doi: 10.18653/v1/N19-1401)

URL DOI
Zhishen Yang, Sam Vijlbrief, and Naoaki Okazaki. TokyoTech_NLP at SemEval-2019 Task 3: Emotion-related Symbols in Emotion Detection. In Proceedings of the 13th International Workshop on Semantic Evaluation (SemEval), pages 350–354, Minneapolis, Minnesota, USA, June 2019. (doi: 10.18653/v1/S19-2061)

URL DOI
Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, and Masaaki Nagata. Reducing Odd Generation from Neural Headline Generation. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation (PACLIC), Hong Kong, December 2018.

URL
Kaori Abe, Yuichiroh Matsubayashi, Naoaki Okazaki, and Kentaro Inui. Multi-dialect Neural Machine Translation and Dialectometry. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation (PACLIC), Hong Kong, December 2018.

URL
Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, and Masaaki Nagata. Unsupervised Token-wise Alignment to Improve Interpretation of Encoder-Decoder Models. In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pages 74–81, Brussels, Belgium, November 2018. (doi: 10.18653/v1/W18-5410)

URL DOI
Diana Galvan, Naoaki Okazaki, Koji Matsuda, and Kentaro Inui. Investigating the Challenges of Temporal Relation Extraction from Clinical Text. In Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis (Louhi), pages 55–64, Brussels, Belgium, October 2018. (doi: 10.18653/v1/W18-5607)

URL DOI
Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, and Kentaro Inui. Predicting Stances from Social Media Posts using Factorization Machines. In Proceedings of the 27th International Conference on Computational Linguistics (COLING), pages 3381–3390, August 2018.

URL
Yuta Hitomi, Hideaki Tamori, Naoaki Okazaki, and Kentaro Inui. Proofread Sentence Generation as Multi-Task Learning with Editing Operation Prediction. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP), pages 436–441, November 2017.

URL
Sosuke Kobayashi, Naoaki Okazaki, and Kentaro Inui. A Neural Language Model for Dynamically Representing the Meanings of Unknown Words and Entities in a Discourse. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP), pages 473–483, November 2017.

URL
Kazuaki Hanawa, Akira Sasaki, Naoaki Okazaki, and Kentaro Inui. A Crowdsourcing Approach for Annotating Causal Relation Instances in Wikipedia. In Proceedings of the 31st Pacific Asia Conference on Language, Information and Computation (PACLIC), pages 336–345, November 2017.

URL
Shota Sasaki, Sho Takase, Naoya Inoue, Naoaki Okazaki, and Kentaro Inui. Handling Multiword Expressions in Causality Estimation. In IWCS 2017 — 12th International Conference on Computational Semantics — Short papers, pages (6 pages), 2017.

URL
Hideaki Tamori, Yuta Hitomi, Naoaki Okazaki, and Kentaro Inui. Analyzing the Revision Logs of a Japanese Newspaper for Article Quality Assessment. In Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism, pages 46–50, Copenhagen, Denmark, September 2017. (doi: 10.18653/v1/W17-4208)

URL DOI
Sho Yokoi, Daichi Mochihashi, Ryo Takahashi, Naoaki Okazaki, and Kentaro Inui. Learning Co-Substructures by Kernel Dependence Maximization. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), pages 3329–3335, August 2017.

URL
Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, and Kentaro Inui. Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using Tweets and Matrix Factorization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL), pages 398–408, Vancouver, Canada, July 2017. (doi: 10.18653/v1/P17-1037)

URL DOI

Domestic conferences

服部翔, 水木栄, 藤井一喜, 中村泰士, 塩谷泰平, 植田快, 新妻巧朗, 川畑輝, 田森秀明, Youmi Ma, 前田航希, 大井聖也, 齋藤幸史郎, 岡本拓己, 石田茂樹, 横田理央, 高村大也, 岡崎直観. 新聞記事からつくる時事と社会に強い日本語LLM. 言語処理学会第31回年次大会 (NLP2025), C10-1, pp. 3948–3953, 2025年3月.

URL
Youmi Ma, 水木栄, 藤井一喜, 中村泰士, 大井聖也, 島田比奈理, 塩谷泰平, 齋藤幸史郎, 前田航希, 服部翔, 岡本拓己, 石田茂樹, 横田理央, 高村大也, 岡崎直観. 模倣学習による大規模言語モデルの指示チューニング. 言語処理学会第31回年次大会 (NLP2025), Q8-21, pp. 3446–3451, 2025年3月.

URL
島田比奈理, 金子正弘, 岡崎直観. Jailbreakにより生成したフェイクニュースの危険度評価. 言語処理学会第31回年次大会 (NLP2025), P7-18, pp. 2867–2872, 2025年3月.

URL
大葉大輔, 金子正弘, Danushka Bollegala, 岡崎直観. 大規模言語モデルの多言語社会的バイアス抑制における単言語ラベル付きデータの役割. 言語処理学会第31回年次大会 (NLP2025), P7-11, pp. 2826–2831, 2025年3月.

URL
齋藤幸史郎, 小池隆斗, 金子正弘, 岡崎直観. PUPPET：タスク性能を維持しながらLLMとして検出されやすくする学習フレームワーク. 言語処理学会第31回年次大会 (NLP2025), P7-5, pp. 2791–2796, 2025年3月.

URL
杉野かおり, 山野陽祐, 河崎真琴, 田森秀明, 岡崎直観, 乾健太郎. SOMO: 音声認識出力の可読性向上を目的とした整文手法の提案. 言語処理学会第31回年次大会 (NLP2025), C7-1, pp. 2686–2691, 2025年3月.

URL
植木快, 川畑輝, 田口雄哉, 新妻巧朗, 浦川通, 田森秀明, 岡崎直観, 乾健太郎. 時事情報に関する日本語QAベンチマーク『ニュースQ』. 言語処理学会第31回年次大会 (NLP2025), Q6-24, pp. 2606–2611, 2025年3月.

URL
杉浦一瑳, 栗田修平, 小田悠介, 河原大輔, 岡崎直観. オープンLLMによる翻訳を活用した日本語CLIPの開発. 言語処理学会第31回年次大会 (NLP2025), C4-6, pp. 1421–1426, 2025年3月.

URL
前田航希, 杉浦一瑳, 小田悠介, 栗田修平, 岡崎直観. llm-jp-eval-mm: 日本語視覚言語モデルの自動評価基盤. 言語処理学会第31回年次大会 (NLP2025), Q3-23, pp. 1303–1308, 2025年3月.

URL
遠藤洸亮, 脇本宏平, 宮西洋輔, 岡崎直観. バナー広告における画像と広告コピーの評価ベンチマーク構築. 言語処理学会第31回年次大会 (NLP2025), Q3-9, pp. 1225–1230, 2025年3月.

URL
村岡雅康, 岡崎直観. 視覚言語モデルの識別性能に関する評価用ベンチマークの構築. 言語処理学会第31回年次大会 (NLP2025), Q3-4, pp. 1196–1201, 2025年3月.

URL
笹川慶人, 前田航希, 杉浦一瑳, 栗田修平, 岡崎直観, 河原大輔. LLM-jp-3 VILA: 日本語マルチモーダルデータセット及び強力な日本語マルチモーダルモデルの構築. 言語処理学会第31回年次大会 (NLP2025), Q3-2, pp. 1185–1190, 2025年3月.

URL
大井聖也, 金子正弘, 岡崎直観, 井上中順. 複数タスク・複数項目に跨ったマルチモーダル自動評価手法. 言語処理学会第31回年次大会 (NLP2025), C3-4, pp. 970–975, 2025年3月.

URL
高橋侑成, Youmi Ma, 金子正弘, 岡崎直観. 大規模言語モデルはデータ漏洩を隠蔽できるのか. 言語処理学会第31回年次大会 (NLP2025), A3-1, pp. 887–892, 2025年3月.

URL
Panatchakorn Anantaprayoon, 金子正弘, 岡崎直観. Mitigating Social Bias in Large Language Models by Self-Correction. 言語処理学会第31回年次大会 (NLP2025), Q2-22, pp. 863–868, 2025年3月.

URL
服部翔, 岡崎直観, 水木栄, 藤井一喜, 中村泰士, 大井聖也, 塩谷泰平, 齋藤幸史郎, Youmi Ma, 前田航希, 岡本拓己, 石田茂樹, 横田理央, 高村大也. Swallowコーパスv2: 教育的な日本語ウェブコーパスの構築. 言語処理学会第31回年次大会 (NLP2025), C1-5, pp. 94–99, 2025年3月.

URL
前田航希, 長谷川騎平, 栗田修平, 小田悠介, 徳久良子, 岡崎直観. 日本の文化常識・日常生活知識理解のための視覚言語ベンチマーク MECHA-Ja の構築. 情報処理学会第263回自然言語処理研究会研究報告 (2024-NL-263), 28, pp. 1–7, 2025年3月.

URL
高橋侑成, 馬尤咪, 金子正弘, 岡崎直観. 大規模言語モデルに対する漏洩検出への敵対的なデータ隠蔽. 第19回YANSシンポジウム (YANS2024), S4-P22, 2024年9月.

URL
服部翔, 水木栄, 藤井一喜, 中村泰士, 大井聖也, 馬尤咪, 前田航希, 塩谷泰平, 齋藤幸史郎, 岡本拓己, 石田茂樹, 横田理央, 高村大也, 岡崎直観. 小規模で高性能なLLMのための高品質事前学習Webコーパスの構築. 第19回YANSシンポジウム (YANS2024), S3-P33, 2024年9月.

URL
大井聖也, 金子正弘, 岡崎直観, 井上中順. マルチモーダルモデル自動評価のための複数タスク・複数基準評価データセット. 第19回YANSシンポジウム (YANS2024), S1-P28, 2024年9月.

URL
塩谷泰平, 金子正弘, 岡崎直観. 大規模言語モデルによる日本文化に沿った指示データ生成. 第19回YANSシンポジウム (YANS2024), S1-P25, 2024年9月.

URL
齋藤幸史郎, 小池隆斗, 金子正弘, 岡崎直観. 強化学習を用いた、言語理解能力を維持したLLM検出器の性能向上. 第19回YANSシンポジウム (YANS2024), S1-P23, 2024年9月.

URL
齋藤幸史郎, 水木栄, 大井聖也, 中村泰士, 塩谷泰平, 前田航希, 馬尤咪, 服部翔, 藤井一喜, 岡本拓己, 石田茂樹, 高村大也, 横田理央, 岡崎直観. LLMに日本語テキストを学習させる意義. 情報処理学会第261回自然言語処理研究会研究報告 (2024-NL-261), 12, pp. 1–15, 2024年9月.

URL
加藤靖久, 金子正弘, 岡崎直観. 多言語プロンプト: 低資源言語での多言語例を用いたfew-shot推論. 2024年度人工知能学会全国大会 (JSAI2024), pp. 4Xin2110-4Xin2110, 2024年5月. (doi: 10.11517/pjsai.JSAI2024.0_4Xin2110)

DOI
栗原健太郎, 三田雅人, 張培楠, 佐々木翔大, 石上亮介, 岡崎直観. LCTG Bench: 日本語LLMの制御性ベンチマークの構築. 言語処理学会第30回年次大会 (NLP2024), D11-2, pp. 3113–3118, 2024年3月.

URL
大井聖也, 金子正弘, 小池隆斗, Mengsay Loem, 岡崎直観. 大規模言語モデルにおける評価バイアスの尤度に基づく緩和. 言語処理学会第30回年次大会 (NLP2024), A11-4, pp. 3021–3026, 2024年3月.

URL
綿祐貴, 金子正弘, Youmi Ma, 岡崎直観. 大規模言語モデルに対するサンプリングを活用したメンバーシップ推論攻撃. 言語処理学会第30回年次大会 (NLP2024), A11-3, pp. 3015–3020, 2024年3月.

URL
平岡達也, 岡崎直観. 事前学習済みの分散表現は表層的な知識を獲得しているか. 言語処理学会第30回年次大会 (NLP2024), P10-6, pp. 2880–2885, 2024年3月.

URL
Mengsay Loem, 金子正弘, 岡崎直観. 敵対的発言を取り入れた議論による言語モデルの学習強化と推論力の向上. 言語処理学会第30回年次大会 (NLP2024), B10-6, pp. 2750–2755, 2024年3月.

URL
藤井一喜, 中村泰士, Mengsay Loem, 飯田大貴, 大井聖也, 服部翔, 平井翔太, 水木栄, 横田理央, 岡崎直観. 継続事前学習による日本語に強い大規模言語モデルの構築. 言語処理学会第30回年次大会 (NLP2024), A8-5, pp. 2102–2107, 2024年3月.

URL
前田航希, 栗田修平, 宮西大樹, 岡崎直観. 視覚的文脈を利用した視覚言語モデルによる画像キャプション生成自動評価手法. 言語処理学会第30回年次大会 (NLP2024), P7-10, pp. 1996–2001, 2024年3月.

URL
水木栄, 飯田大貴, 藤井一喜, 中村泰士, Mengsay Loem, 大井聖也, 服部翔, 平井翔太, 横田理央, 岡崎直観. 大規模言語モデルの日本語能力の効率的な強化: 継続事前学習における語彙拡張と対訳コーパスの活用. 言語処理学会第30回年次大会 (NLP2024), A6-4, pp. 1514–1519, 2024年3月.

URL
岡崎直観, 服部翔, 平井翔太, 飯田大貴, 大井聖也, 藤井一喜, 中村泰士, Mengsay Loem, 横田理央, 水木栄. Swallowコーパス: 日本語大規模ウェブコーパス. 言語処理学会第30回年次大会 (NLP2024), A6-1, pp. 1498–1503, 2024年3月.

URL
古山翔太, 永田亮, 高村大也, 岡崎直観. 文法誤り訂正の自動評価のための原文・参照文・訂正文間のN-gram F-score. 言語処理学会第30回年次大会 (NLP2024), P4-25, pp. 1198–1203, 2024年3月.

URL
小池隆斗, 金子正弘, 岡崎直観. 制約が異なる指示で生成された文章に対するLLM生成検出の頑健性. 言語処理学会第30回年次大会 (NLP2024), A4-4, pp. 943–948, 2024年3月.

URL
Youmi Ma, An Wang, 岡崎直観. 言語横断ラベル射影を用いた日本語文書レベル関係抽出データセットの構築. 言語処理学会第30回年次大会 (NLP2024), P3-4, pp. 783–788, 2024年3月.

URL
小池隆斗, 金子正弘, 岡崎直観. 敵対的事例を用いたIn-context learningによるLLM生成エッセイの検出. 第18回NLP若手の会シンポジウム, S3-P13, 2023年8月.
Youmi Ma, An Wang, 岡崎直観. 日本語文書レベル関係抽出コーパスの構築. 第18回NLP若手の会シンポジウム, S5-P19, 2023年8月.
平井翔太, 村岡雅康, 岡崎直観. 割り当て画像の多様性を考慮したVokenizationによるマスク言語モデルの改善. 2023年度人工知能学会全国大会 (JSAI2023), 4Xin1-38, pp. (4 pages), 2023年6月. (doi: 10.11517/pjsai.JSAI2023.0_4Xin138)

DOI
丹羽彩奈, 岡崎直観. 事前学習済みモデルT5における近傍分布の有効性の調査. 言語処理学会第29回年次大会 (NLP2023), P12-6, pp. 3048–3053, 2023年3月.

URL
浦川通, 新妻巧朗, 田口雄哉, 田森秀明, 岡崎直観, 乾健太郎. 短歌における言語モデルの実応用–歌人の視点を通した生成と作歌支援の実践から–. 言語処理学会第29回年次大会 (NLP2023), P11-6, pp. 2779–2784, 2023年3月.

URL
谷口大輔, 脇本宏平, 丹羽彩奈, 岡崎直観. 大規模言語モデルにおける文生成方向に関する依存性の検証. 言語処理学会第29回年次大会 (NLP2023), H9-1, pp. 2200–2205, 2023年3月.

URL
中本裕大, 瀬在恭介, 元川凱喜, 麻生英樹, 岡崎直観. 日本語大規模言語モデルにおける知識グラフを活用した意味理解性能の向上. 言語処理学会第29回年次大会 (NLP2023), B9-4, pp. 2140–2145, 2023年3月.

URL
Mengsay Loem, 高瀬翔, 金子正弘, 岡崎直観. マルチヘッドニューラルN-gramによる自己注意機構の代替. 言語処理学会第29回年次大会 (NLP2023), A9-1, pp. 2094–2099, 2023年3月.

URL
Panatchakorn Anantaprayoon, 金子正弘, 岡崎直観. 下流タスクでの日本語事前学習モデルの性別バイアスの評価. 言語処理学会第29回年次大会 (NLP2023), A7-3, pp. 1563–1568, 2023年3月.

URL
服部翔, Youmi Ma, 岡崎直観. クエリ指向要約におけるクエリと要約の統合的な生成. 言語処理学会第29回年次大会 (NLP2023), H5-2, pp. 1244–1249, 2023年3月.

URL
金子正弘, Graham Neubig, 岡崎直観. 人間とシステムの議論に基づくNLPタスクの問題に対する予測. 言語処理学会第29回年次大会 (NLP2023), H4-5, pp. 979–983, 2023年3月.

URL
水木栄, 岡崎直観. 埋め込み表現の意味適応による知識ベース語義曖昧性解消. 言語処理学会第29回年次大会 (NLP2023), C3-1, pp. 622–627, 2023年3月.

URL
Youmi Ma, An Wang, 岡崎直観. 文書レベル関係抽出における根拠認識の統合. 言語処理学会第29回年次大会 (NLP2023), B3-3, pp. 605–610, 2023年3月.

URL
遠藤洸亮, Zhishen Yang, 岡崎直観. 画像キャプション生成におけるJPEG圧縮への頑健性の改善. 言語処理学会第29回年次大会 (NLP2023), P2-2, pp. 419–424, 2023年3月.

URL
飯田大貴, 岡崎直観. 事前学習済みモデルに基づく検索モデルにおけるドメイン適応手法の比較と相乗効果の検証. 言語処理学会第29回年次大会 (NLP2023), P1-9, pp. 176–181, 2023年3月.

URL
飯田大貴, 岡崎直観. 疎ベクトル検索における語彙と単語頻度のギャップ解消を通じた教師なしドメイン適合. 第17回NLP若手の会シンポジウム, P4-08, 2022年8月.
馬尤咪, 王安, 岡崎直観. 文書レベル関係抽出における人間と注意機構の根拠文の対応付け. 第17回NLP若手の会シンポジウム, P2-03, 2022年8月.
古山翔太, 永田亮, 高村大也, 岡崎直観. 日本語誤り訂正のための誤り区間と誤り種類の自動アノテーションに向けて. 第17回NLP若手の会シンポジウム, P4-09, 2022年8月.
Mengsay Loem, 高瀬翔, 岡崎直観. Are Neighbors Enough? Multi-Head Neural n-gram can be Alternative to Self-attention. 第17回NLP若手の会シンポジウム, P5-07, 2022年8月.
谷口大輔, 脇本宏平, 黒田和矢, 川本峻頌, 西口佳佑, 丹羽彩奈, 岡崎直観. 商品レビューと商品特徴を用いた広告文制作支援. 2022年度人工知能学会全国大会 (JSAI2022), 3Yin2-07, pp. (4 pages), 2022年6月. (doi: 10.11517/pjsai.JSAI2022.0_3Yin207)

DOI
植木滉一郎, 平岡達也, 岡崎直観. 記事に忠実ではない訓練事例も活用した見出し生成モデルの忠実性の改善法. 言語処理学会第28回年次大会 (NLP2022), pp. 2002–2007, 2022年3月.

URL
平岡達也, 高瀬翔, 内海慶, 欅惇志, 岡崎直観. 単語の長さと構成要素を考慮した単語レベルの摂動. 言語処理学会第28回年次大会 (NLP2022), pp. 1455–1460, 2022年3月.

URL
前田航希, 金子正弘, 岡崎直観. IMPARA: パラレルデータにおける修正の影響度に基づいた文法誤り訂正の自動評価法. 言語処理学会第28回年次大会 (NLP2022), pp. 328–333, 2022年3月.

URL
吉川和, 岡崎直観. 確信度を考慮した言語モデルの関係知識評価. 言語処理学会第28回年次大会 (NLP2022), pp. 532–537, 2022年3月.

URL
Loem Mengsay, 高瀬翔, 金子正弘, 岡崎直観. 抽出型要約と言い換えによる生成型要約の訓練データ拡張. 言語処理学会第28回年次大会 (NLP2022), pp. 1996–2001, 2022年3月.

URL
Youmi Ma, 平岡達也, 岡崎直観. 畳み込みニューラルネットワークを用いた表ラベリングによる固有表現認識と関係抽出 . 言語処理学会第28回年次大会 (NLP2022), pp. 1197–1202, 2022年3月.

URL
石川遼伍, 丹羽彩奈, 水木栄, 岡崎直観. 疑似訓練データによる格助詞の省略に頑健な係り受け解析. 言語処理学会第28回年次大会 (NLP2022), pp. 1808–1813, 2022年3月.

URL
丹羽彩奈, 高瀬翔, 岡崎直観. 近傍の事例を用いた非自己回帰生成. 言語処理学会第28回年次大会 (NLP2022), pp. 1306–1311, 2022年3月.

URL
嘉田紗世, 山野陽祐, 新美茜, 田森秀明, 小海則人, 岡崎直観, 乾健太郎. 動画タイトルを用いたサムネイル画像の自動選択手法の提案. 言語処理学会第28回年次大会 (NLP2022), pp. 1366–1370, 2022年3月.

URL
浦川通, 新妻巧朗, 田口雄哉, 田森秀明, 岡崎直観, 乾健太郎. モーラを考慮したFine-tuningによる口語短歌生成. 言語処理学会第28回年次大会 (NLP2022), pp. 1328–1332, 2022年3月.

URL
水木栄, 岡崎直観. 階層コード表現を用いた上位下位関係の識別. 言語処理学会第27回年次大会 (NLP2021), pp. 1236–1241, 2021年3月.

URL
平岡達也, 高瀬翔, 内海慶, 欅惇志, 岡崎直観. 後段モデルの損失値を用いた単語分割のタスクへの最適化. 言語処理学会第27回年次大会 (NLP2021), pp. 486–491, 2021年3月.

URL
丹羽彩奈, 西口佳佑, 岡崎直観. 文脈を考慮した対義語穴埋め. 言語処理学会第27回年次大会 (NLP2021), pp. 1702–1707, 2021年3月.

URL
笹沢裕一, 岡崎直観. 属性情報を追加した事前学習済みモデルのファインチューニング. 言語処理学会第27回年次大会 (NLP2021), pp. 765–770, 2021年3月.

URL
昇夏海, 平岡達也, 丹羽彩奈, 西口佳佑, 岡崎直観. 企業情報を考慮したキャッチコピーの自動生成. 言語処理学会第27回年次大会 (NLP2021), pp. 450–454, 2021年3月.

URL
Youmi Ma, 平岡達也, 岡崎直観. BERTを用いたTable-Fillingによる固有表現抽出と関係抽出. 言語処理学会第27回年次大会 (NLP2021), pp. 1274–1279, 2021年3月.

URL
古山翔太, 高村大也, 岡崎直観. ニューラル文法誤り訂正のための多様な規則を用いる人工誤り生成. 言語処理学会第27回年次大会 (NLP2021), pp. 1017–1022, 2021年3月.

URL
山田康輔, 人見雄太, 田森秀明, 岡崎直観, 乾健太郎. 指定語句を確実に含む見出し生成. 言語処理学会第27回年次大会 (NLP2021), pp. 1070–1074, 2021年3月.

URL
叶内晨, 根石将人, 林部祐太, 大内啓樹, 岡崎直観. 宿の推薦根拠説明システムにおける魅力度の考慮と実用を見据えた評価. 言語処理学会第27回年次大会 (NLP2021), pp. 461–465, 2021年3月.

URL
丹羽彩奈, 脇本宏平, 西口佳佑, 毛利真崇, 岡崎直観. 単語の対応関係を利用したスパン候補の絞り込みによるキャッチコピーの対句構造解析. 第34回人工知能学会全国大会 (JSAI2020), pp. (4 pages), 2020年6月. (doi: 10.11517/pjsai.JSAI2020.0_1E5GS901)

DOI
人見雄太, 田口雄哉, 田森秀明, 岡崎直観, 乾健太郎. 小規模リソースにおける生成型要約のためのスタイル転移. 言語処理学会第26回年次大会 (NLP2020), pp. 929–932, 2020年3月.

URL
叶内晨, 根石将人, 林部祐太, 岡崎直観. 旅行情報サイトのレビューを用いた抽象的な要求に対する根拠付き推薦文の生成. 言語処理学会第26回年次大会 (NLP2020), pp. 29–32, 2020年3月.

URL
陳宏, 西田典起, 朱中元, 岡崎直観, 中山英樹. RST Discourse Structure Improves Story Ending Generation. 言語処理学会第26回年次大会 (NLP2020), pp. 21–24, 2020年3月.

URL
高瀬翔, 岡崎直観. 翻訳と見出し文生成の同時学習による言語横断見出し文生成モデル. 言語処理学会第26回年次大会 (NLP2020), pp. 1471–1474, 2020年3月.

URL
平岡達也, 高瀬翔, 内海慶, 欅惇志, 岡崎直観. RNNにより高次の依存を考慮したニューラル隠れマルコフモデル. 言語処理学会第26回年次大会 (NLP2020), pp. 1332–1335, 2020年3月.

URL
松丸和樹, 高瀬翔, 岡崎直観. 見出し生成の忠実性の改善. 言語処理学会第26回年次大会 (NLP2020), pp. 933–936, 2020年3月.

URL
丹羽彩奈, 脇本宏平, 西口佳佑, 毛利真崇, 岡崎直観. キャッチコピーにおける対句構造の解析. 言語処理学会第26回年次大会 (NLP2020), pp. 601–604, 2020年3月.

URL
平岡達也, 高瀬翔, 内海慶, 欅惇志, 岡崎直観. RNNによる遷移確率計算を用いた隠れマルコフモデル. 第242回自然言語処理研究会, 2019-NL-242(2), pp. 1–6, 2019年10月.

URL
丹羽彩奈, 岡崎直観, 西口佳佑, 亀山千尋, 毛利真崇. 修辞技法を考慮したキャッチコピー自動生成に向けた研究. 第14回NLP若手の会シンポジウム, 63, 2019年8月.

Poster
平岡達也, 高瀬翔, 岡崎直観. RNNによる遷移確率計算を用いた隠れマルコフモデル. 第14回NLP若手の会シンポジウム, 79, 2019年8月.
高瀬翔, 岡崎直観. 機械翻訳と要約生成の統一モデルによる言語横断見出し文生成. 第14回NLP若手の会シンポジウム, 85, 2019年8月.
松丸和樹, 高瀬翔, 岡崎直観. 含意関係に基づく見出し生成タスクの見直し. 第240回自然言語処理研究会, 2019-NL-240(1), pp. 1–8, 2019年6月.

URL
島津翔, 高瀬翔, 中澤敏明, 岡崎直観. 文脈を考慮した日英機械翻訳に向けた評価データの構築. 言語処理学会第25回年次大会 (NLP2019), pp. 5–8, 2019年3月.

URL
笹沢裕一, 高瀬翔, 岡崎直観. 対話型質問応答の省略補完. 言語処理学会第25回年次大会 (NLP2019), pp. 163–166, 2019年3月.

URL
晩鴻翔, 岡崎直観. 語りに基づく認知症傾向判別. 言語処理学会第25回年次大会 (NLP2019), pp. 501–504, 2019年3月.

URL
丹羽彩奈, 岡崎直観, 西口佳佑, 亀山千尋, 毛利真崇. キャッチコピーの自動生成に向けた分析. 言語処理学会第25回年次大会 (NLP2019), pp. 558–561, 2019年3月.

URL
高瀬翔, 岡崎直観. 位置エンコーディングを用いた出力長制御. 言語処理学会第25回年次大会 (NLP2019), pp. 687–690, 2019年3月.

URL
磯颯, 上原由衣, 石垣達也, 能地宏, 荒牧英治, 小林一郎, 宮尾祐介, 岡崎直観, 高村大也. Data-to-Textにおける主題遷移のモデル化. 言語処理学会第25回年次大会 (NLP2019), pp. 727–730, 2019年3月.

URL
人見雄太, 田口雄哉, 田森秀明, 菊田洸, 西鳥羽二郎, 岡崎直観, 乾健太郎, 奥村学. 出力長制御を考慮した見出し生成モデルのための大規模コーパス. 言語処理学会第25回年次大会 (NLP2019), pp. 1225–1228, 2019年3月.

URL
塙一晃, 佐々木彬, 岡崎直観, 乾健太郎. Wikipediaから獲得した外部知識を用いた賛否分類. 第237回自然言語処理研究会, 2018-NL-237(6), pp. 1–8, 2018年9月.

URL
鈴木正敏, 松田耕史, 岡崎直観, 乾健太郎. 読解による解答可能性を付与した質問応答データセットの構築. 言語処理学会第24回年次大会 (NLP2018), pp. 702–705, 2018年3月.

URL
伊藤拓海, 山口健史, 田然, 松田耕史, 岡崎直観, 乾健太郎. 自治体FAQの比較マイニング. 言語処理学会第24回年次大会 (NLP2018), pp. 536–539, 2018年3月.

URL
阿部香央莉, 松林優一郎, 岡崎直観, 乾健太郎. ニューラルネットを用いた多方言の翻訳と類型分析. 言語処理学会第24回年次大会 (NLP2018), pp. 304–307, 2018年3月.

URL
清野舜, 高瀬翔, 鈴木潤, 岡崎直観, 乾健太郎, 永田昌明. ニューラルヘッドライン生成における誤生成問題の改善. 言語処理学会第24回年次大会 (NLP2018), pp. 1–4, 2018年3月.

URL
松田耕史, 岡崎直観, 乾健太郎. クラウドソーシングを系に組み込んだテキストからの関係知識抽出. 第12回NLP若手の会シンポジウム, P17, 2017年9月.
伊藤拓海, 鈴木正敏, 田然, 山口健史, 岡崎直観, 乾健太郎. 自治体QAサービスのためのFAQの自治体間の横断的解析. 第12回NLP若手の会シンポジウム, P19, 2017年9月.
塙一晃, 佐々木彬, 岡崎直観, 乾健太郎. トピックに関する因果関係知識を利用した賛否分類. 第12回NLP若手の会シンポジウム, P28, 2017年9月.
鈴木正敏, 松田耕史, 岡崎直観, 乾健太郎. Wikipediaを知識源に用いた文書検索と読解によるクイズ解答システム. 第12回NLP若手の会シンポジウム, P46, 2017年9月.

Non-refereed papers

Keito Sasagawa, Koki Maeda, Issa Sugiura, Shuhei Kurita, Naoaki Okazaki, Daisuke Kawahara. Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model, 2024年.

URL arXiv
Masahiro Kaneko, Youmi Ma, Yuki Wata, Naoaki Okazaki. Sampling-based Pseudo-Likelihood for Membership Inference Attacks, 2024年.

arXiv
Wiem Ben Rim, Carolin Lawrence, Kiril Gashteovski, Mathias Niepert, Naoaki Okazaki. Behavioral Testing of Knowledge Graph Embedding Models for Link Prediction. Proceedings of the Fifth Widening Natural Language Processing Workshop (WiNLP2021), 2021年11月.
Zhishen Yang, Tosho Hirasawa, Mamoru Komachi, Naoaki Okazaki. Do Videos Guide Translations? Evaluation on Video-guided Machine Translation dataset. Visually Grounded Interaction and Language (ViGIL), 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2021) workshop, 2021年6月.

URL
Tosho Hirasawa, Zhishen Yang, Mamoru Komachi, Naoaki Okazaki. Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020. First Workshop on Advances in Language and Vision Research (ALVR 2020), ACL 2020, 2020年7月.

arXiv
Youmi Ma, Tatsuya Hiraoka, Naoaki Okazaki. Named Entity Recognition and Relation Extraction using Enhanced Table Filling by Contextualized Representations, 2020年.

arXiv