Publications - ELION Lab

2026

Knowledge-Based Systems

🌟 SERA: Self-referential Assessment Framework for Bidirectional Generative Commonsense Reasoning

Jaehyung Seo, Hyeonseok Moon, Yoonna Jang, Heuiseok Lim*

Knowledge-Based Systems, Volume 345, 116152, 2026

Paper

ACL 2026 (Oral + Best Paper Award Nomination)

🌟 No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand

Jimin Jung, MyoungJin Kim, Jaehyung Seo*, Heuiseok Lim*

Annual Meeting of the Association for Computational Linguistics (ACL), 2026

Paper (TBD)

ACL 2026 (Oral)

🌟 HiKEY: Hierarchical Multimodal Retrieval for Open-Domain Document Question Answering

Joongmin Shin, Gyuho Shim, Jeongbae Park, Jaehyung Seo*, Heuiseok Lim*

Annual Meeting of the Association for Computational Linguistics (ACL), 2026

Paper (TBD)

ACL 2026

MMAC: A Multilingual, Multimodal Alignment Framework for Cultural Grounding Evaluation

Weihua Zheng, Zhengyuan Liu, Tanmoy Chakraborty, Weiwen Xu, Xiaoxue Gao, Bryan Chen Zhengyu Tan, Bowei Zou, Chang Liu, Yujia Hu, Xing Xie, Xiaoyuan Yi, Jing Yao, Chaojun Wang, Long Li, Rui Liu, Huiyao Liu, Koji Inoue, Ryuichi Sumida, Tatsuya Kawahara, Fan Xu, Lingyu Ye, Wei Tian, Dongjun Kim, Jimin Jung, Jaehyung Seo, Nadya Yuki Wangsajaya, Pham Minh Duc, Ojasva Saxena, Palash Nandi, Xiyan Tao, Wiwik Karlina, Tuan Luong, Keertana Arun Vasan, Roy Ka-Wei Lee, Nancy F. Chen

Annual Meeting of the Association for Computational Linguistics (ACL), 2026

Paper (TBD)

CVPR 2026 (Highlight)

🌟 Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Uncertainty Estimation

Yongchan Chun, Chanhee Park, Jeongho Yoon, Jaehyung Seo*, Heuiseok Lim*

Conference on Computer Vision and Pattern Recognition (CVPR), 2026

Paper (TBD)

Evidential Transformation Network Figure

CVPR 2026

🌟 M3DocDep: Multi-modal, Multi-page, Multi-document Dependency Chunking with Large Vision-Language Models

Joongmin Shin, Jeongbae Park, Jaehyung Seo*, Heuiseok Lim*

Conference on Computer Vision and Pattern Recognition (CVPR), 2026

Paper (TBD)

2025

WBL 2025

VAETKI Technical Report

NC-AI Consortium

Soverign AI Foundation Model Project, 2025

Paper Model

EMNLP 2025

🌟 The Impact of Negated Text on Hallucination with Large Language Models

Jaehyung Seo, Hyeonseok Moon, Heuiseok Lim*

Empirical Methods in Natural Language Processing (EMNLP), 2025

Paper

EMNLP 2025 Findings

🌟 KoLEG: On-the-Fly Korean Legal Knowledge Editing with Continuous Retrieval

Jaehyung Seo, Dahyun Jung, Jaewook Lee, Yongchan Chun, Dongjun Kim, Hwijung Ryu, Donghoon Shin, Heuiseok Lim*

Empirical Methods in Natural Language Processing (EMNLP) Findings, 2025

Paper

EMNLP 2025

🌟 MultiDocFusion: Hierarchical and Multimodal Chunking Pipeline for Enhanced RAG on Long Industrial Documents

Joong Min Shin, Chanjun Park, Jeongbae Park, Jaehyung Seo*, Heuiseok Lim*

Empirical Methods in Natural Language Processing (EMNLP), 2025

Paper

EMNLP 2025

🌟 Metric Calculating Benchmark: Code-Verifiable Complicate Instruction Following Benchmark for Large Language Models

Hyeonseok Moon, Seongtae Hong, Jaehyung Seo*, Heuiseok Lim*

Empirical Methods in Natural Language Processing (EMNLP), 2025

Paper

EMNLP 2025 Findings

LimaCost: Data Valuation for Instruction Tuning of Large Language Models

Hyeonseok Moon, Jaehyung Seo, Seonmin Koo, Jinsung Kim, Youngkyoung Ham, Jiwon Moon, Heuiseok Lim*

Empirical Methods in Natural Language Processing (EMNLP) Findings, 2025

Paper

ICLR 2025

🌟 K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models

Jaehyung Seo, Heuiseok Lim*

International Conference on Learning Representations (ICLR), 2025

Paper

NAACL 2025 Findings

Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models

Hyeonseok Moon, Jaehyung Seo, Seungyoon Lee, Chanjun Park*, Heuiseok Lim*

North American Chapter of the ACL (NAACL) Findings, 2025

Paper

NAACL 2025

CoME: An Unlearning-based Approach to Conflict-free Model Editing

Dahyun Jung, Jaehyung Seo, Jaewook Lee, Chanjun Park*, Heuiseok Lim*

North American Chapter of the ACL (NAACL), 2025

Paper

Expert Systems with Applications

An analysis on language transfer of pre-trained language model with cross-lingual post-training

Suhyune Son, Chanjun Park, Jungseob Lee, Midan Shim, Chanhee Lee, Yoonna Jang, Jaehyung Seo, Heuiseok Lim*

Expert Systems with Applications, 2025

Paper

2024

HCLT 2024 🏆 Best Paper

🌟 Post-negation Text Induce New Hallucinations in Large Language Models

Jaehyung Seo, Aram So, Heuiseok Lim*

Annual Conference on Human and Cognitive Language Technology (HCLT), 2024

Paper

ACL 2024 Findings

🌟 KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models

Jaehyung Seo, Jaewook Lee, Chanjun Park, SeongTae Hong, Seungjun Lee, Heuiseok Lim*

Annual Meeting of the Association for Computational Linguistics (ACL) Findings, 2024

Paper

EACL 2024 Findings

Hyper-BTS Dataset: Scalability and Enhanced Analysis of Back TranScription (BTS) for ASR Post-Processing

Chanjun Park, Jaehyung Seo, Seolhwa Lee, Junyoung Son, Hyeonseok Moon, Sugyeong Eo, Chanhee Lee, Heuiseok Lim*

European Chapter of the ACL (EACL) Findings, 2024

Paper

EACL 2024 Findings

Generative Interpretation: Toward Human-Like Evaluation for Educational Question-Answer Pair Generation

Hyeonseok Moon, Jaewook Lee, Sugyeong Eo, Chanjun Park, Jaehyung Seo, Heuiseok Lim*

European Chapter of the ACL (EACL) Findings, 2024

Paper

EMNLP 2024 Industry

Intelligent Predictive Maintenance RAG framework for Power Plants: Enhancing QA with StyleDFS and Domain Specific Instruction Tuning

Seongtae Hong, Joong Min Shin, Jaehyung Seo, Taemin Lee, Jeongbae Park, Heuiseok Lim*

Empirical Methods in Natural Language Processing (EMNLP) Industry Track, 2024

Paper

ACL 2024 Findings

Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine Translation

Jungseob Lee, Hyeonseok Moon, Seungjun Lee, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Seonmin Koo, Heuiseok Lim*

Annual Meeting of the Association for Computational Linguistics (ACL) Findings, 2024

Paper

LREC-COLING 2024

Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in Korean

Seungyoon Lee, Chanjun Park, Dahyun Jung, Hyeonseok Moon, Jaehyung Seo, Sugyeong Eo, Heuiseok Lim*

Language Resources and Evaluation Conference (LREC-COLING), 2024

Paper

LREC-COLING 2024

Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean Translation

Sugyeong Eo, Jungwoo Lim, Chanjun Park, Dahyun Jung, Seonmin Koo, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim*

Language Resources and Evaluation Conference (LREC-COLING), 2024

Paper

IEEE Access

Exploiting Hanja-Based Resources in Processing Korean Historic Documents Written by Common Literati

Hyeonseok Moon, Myunghoon Kang, Yoonseok Choi, Hyunjoong Kim, Jaehyung Seo, Heuiseok Lim*

IEEE Access, 2024

Paper

2023

EMNLP 2023

🌟 CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme Ingredients

Jaehyung Seo, Hyeonseok Moon, Jaewook Lee, Sugyeong Eo, Chanjun Park, Heuiseok Lim*

Empirical Methods in Natural Language Processing (EMNLP), 2023

Paper

EMNLP 2023

KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processing

Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim*

Empirical Methods in Natural Language Processing (EMNLP), 2023

Paper

EMNLP 2023 Findings

CReTIHC: Designing Causal Reasoning Tasks about Temporal Interventions and Hallucinated Confoundings

Changwoo Chun, SongEun Lee, Jaehyung Seo, Heuiseok Lim*

Empirical Methods in Natural Language Processing (EMNLP) Findings, 2023

Paper

Expert Systems with Applications

Doubts on the reliability of parallel corpus filtering

Hyeonseok Moon, Chanjun Park, Seonmin Koo, Jungseob Lee, Seungjun Lee, Jaehyung Seo, Sugyeong Eo, Yoonna Jang, Heuiseok Lim*

Expert Systems with Applications, 2023

Paper

IJCNLP-AACL 2023

Informative Evidence-guided Prompt-based Fine-tuning for English-Korean Critical Error Detection

Dahyun Jung, Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim*

International Joint Conference on Natural Language Processing and Conference of the Asia-Pacific Chapter of the ACL (IJCNLP-AACL), 2023

Paper

IEEE Access

Uncovering the Risks and Drawbacks Associated with the Use of Synthetic Data for Grammatical Error Correction

Seonmin Koo, Chanjun Park, Seolhwa Lee, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim*

IEEE Access, 2023

Paper

ACL 2023 Demo

PEEP-Talk: A Situational Dialogue-based Chatbot for English Education

Seungjun Lee, Yoonna Jang, Chanjun Park, Jungseob Lee, Jaehyung Seo, Hyeonseok Moon, Sugyeong Eo, Seounghoon Lee, Bernardo Yahya, Heuiseok Lim*

Annual Meeting of the Association for Computational Linguistics (ACL) Demo, 2023

Paper

2022

Knowledge-Based Systems

🌟 PU-GEN: Enhancing generative commonsense reasoning for language models with human-centered knowledge

Jaehyung Seo, Dongsuk Oh, Sugyeong Eo, Chanjun Park, Kisu Yang, Hyeonseok Moon, Kinam Park, Heuiseok Lim*

Knowledge-Based Systems, 2022

Paper

IEEE Access

🌟 Plain Template Insertion: Korean-Prompt-Based Engineering for Few-Shot Learners

Jaehyung Seo, Hyeonseok Moon, Chanhee Lee, Sugyeong Eo, Chanjun Park, Jihoon Kim, Changwoo Chun, Heuiseok Lim*

IEEE Access, 2022

Paper

NAACL 2022 Findings

🌟 A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation

Jaehyung Seo*, Seounghoon Lee*, Chanjun Park, Yoonna Jang, Hyeonseok Moon, Sugyeong Eo, Seonmin Koo, Heuiseok Lim*

North American Chapter of the ACL (NAACL) Findings, 2022

Paper

Mathematics

🌟 Dense-to-Question and Sparse-to-Answer: Hybrid Retriever System for Industrial Frequently Asked Questions

Jaehyung Seo, Taemin Lee, Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Imatitikua D Aiyanyo, Kinam Park, Aram So, Sungmin Ahn, Jeongbae Park*

Mathematics, 2022

Paper

COLING 2022

QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine Translation

Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Gyeongmin Kim, Jungseob Lee, Heuiseok Lim*

International Conference on Computational Linguistics (COLING), 2022

Paper

AACL 2022 Demo

PicTalky: Augmentative and Alternative Communication for Language Developmental Disabilities

Chanjun Park, Yoonna Jang, Seolhwa Lee, Jaehyung Seo, Kisu Yang, Heuiseok Lim*

Asia-Pacific Chapter of the ACL (AACL) Demo, 2022

Paper

LREC 2022

Priming Ancient Korean Neural Machine Translation

Chanjun Park, Seolhwa Lee, Jaehyung Seo, Hyeonseok Moon, Sugyeong Eo, Heuiseok Lim*

Language Resources and Evaluation Conference (LREC), 2022

Paper

LREC 2022

Empirical Analysis of Noising Scheme based Synthetic Data Generation for Automatic Post-editing

Hyeonseok Moon, Chanjun Park, Seolhwa Lee, Jaehyung Seo, Jungseob Lee, Sugyeong Eo, Heuiseok Lim*

Language Resources and Evaluation Conference (LREC), 2022

Paper

IEEE Access

Word-Level Quality Estimation for Korean-English Neural Machine Translation

Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim*

IEEE Access, 2022

Paper

IEEE Access

An Automatic Post Editing with Efficient and Simple Data Generation Method

Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim*

IEEE Access, 2022

Paper

IEEE Access

Utilization Strategy of User Engagements in Korean Fake News Detection

Myunghoon Kang, Jaehyung Seo, Chanjun Park, Heuiseok Lim*

IEEE Access, 2022

Paper

2021

HCLT 2021 🏆 Outstanding Paper

🌟 KommonGen: A Dataset for Korean Generative Commonsense Reasoning Evaluation

Jaehyung Seo, Chanjun Park, Hyeonseok Moon, Sugyeong Eo, Myunghoon Kang, Seounghoon Lee, Heuiseok Lim*

Annual Conference on Human and Cognitive Language Technology (HCLT), 2021

Paper

WAT 2021

BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text

Chanjun Park, Jaehyung Seo, Seolhwa Lee, Chanhee Lee, Hyeonseok Moon, Sugyeong Eo, Heuiseok Lim*

Workshop on Asian Translation (WAT), 2021

Paper

IEEE Access

Grounded Vocabulary for Image Retrieval Using a Modified Multi-Generator Generative Adversarial Network

Kuekyeng Kim, Jaehyung Seo, Heuiseok Lim*

IEEE Access, 2021

Paper

IEEE Access

An Empirical Study on Automatic Post Editing for Neural Machine Translation

Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim*

IEEE Access, 2021

Paper

NeurIPS DCAI Workshop

Automatic Knowledge Augmentation for Generative Commonsense Reasoning

Jaehyung Seo, Chanjun Park, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim*

NeurIPS Data-Centric AI Workshop, 2021

Paper

Citing Our Work

If you use our research in your work, please cite the relevant papers. Click the "Paper" or "PDF" link to access the publication.

* denotes corresponding author

논문

2026

🌟 SERA: Self-referential Assessment Framework for Bidirectional Generative Commonsense Reasoning

🌟 No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand

🌟 HiKEY: Hierarchical Multimodal Retrieval for Open-Domain Document Question Answering

MMAC: A Multilingual, Multimodal Alignment Framework for Cultural Grounding Evaluation

🌟 Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Uncertainty Estimation

🌟 M3DocDep: Multi-modal, Multi-page, Multi-document Dependency Chunking with Large Vision-Language Models

2025

VAETKI Technical Report

🌟 The Impact of Negated Text on Hallucination with Large Language Models

🌟 KoLEG: On-the-Fly Korean Legal Knowledge Editing with Continuous Retrieval

🌟 MultiDocFusion: Hierarchical and Multimodal Chunking Pipeline for Enhanced RAG on Long Industrial Documents

🌟 Metric Calculating Benchmark: Code-Verifiable Complicate Instruction Following Benchmark for Large Language Models

LimaCost: Data Valuation for Instruction Tuning of Large Language Models

🌟 K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models

Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models

CoME: An Unlearning-based Approach to Conflict-free Model Editing

An analysis on language transfer of pre-trained language model with cross-lingual post-training

2024

🌟 Post-negation Text Induce New Hallucinations in Large Language Models

🌟 KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models

Hyper-BTS Dataset: Scalability and Enhanced Analysis of Back TranScription (BTS) for ASR Post-Processing

Generative Interpretation: Toward Human-Like Evaluation for Educational Question-Answer Pair Generation

Intelligent Predictive Maintenance RAG framework for Power Plants: Enhancing QA with StyleDFS and Domain Specific Instruction Tuning

Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine Translation

Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in Korean

Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean Translation

Exploiting Hanja-Based Resources in Processing Korean Historic Documents Written by Common Literati

2023

🌟 CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme Ingredients

KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processing

CReTIHC: Designing Causal Reasoning Tasks about Temporal Interventions and Hallucinated Confoundings

Doubts on the reliability of parallel corpus filtering

Informative Evidence-guided Prompt-based Fine-tuning for English-Korean Critical Error Detection

Uncovering the Risks and Drawbacks Associated with the Use of Synthetic Data for Grammatical Error Correction

PEEP-Talk: A Situational Dialogue-based Chatbot for English Education

2022

🌟 PU-GEN: Enhancing generative commonsense reasoning for language models with human-centered knowledge

🌟 Plain Template Insertion: Korean-Prompt-Based Engineering for Few-Shot Learners

🌟 A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation

🌟 Dense-to-Question and Sparse-to-Answer: Hybrid Retriever System for Industrial Frequently Asked Questions

QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine Translation

PicTalky: Augmentative and Alternative Communication for Language Developmental Disabilities

Priming Ancient Korean Neural Machine Translation

Empirical Analysis of Noising Scheme based Synthetic Data Generation for Automatic Post-editing

Word-Level Quality Estimation for Korean-English Neural Machine Translation

An Automatic Post Editing with Efficient and Simple Data Generation Method

Utilization Strategy of User Engagements in Korean Fake News Detection

2021

🌟 KommonGen: A Dataset for Korean Generative Commonsense Reasoning Evaluation

BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text

Grounded Vocabulary for Image Retrieval Using a Modified Multi-Generator Generative Adversarial Network

An Empirical Study on Automatic Post Editing for Neural Machine Translation

Automatic Knowledge Augmentation for Generative Commonsense Reasoning

Citing Our Work