About Me

Hello, I am Sugyeong Eo!
I am a Ph.D candidate in computer science and engineering at Korea university. I belong to NLP & AI Lab (Advisor: Prof. Heuiseok Lim). I am the founder and CSO of KU-NMT Group. Feel free to contact me!

Research Interest

Natural Language Processing, Mixture-of-Experts Modeling, Neural Machine Translation, Quality Estimation, Question Generation

Education

2020.09 - 2026.02: Graduate, Major in Computer Science and Engineering at Korea University
2016.02 - 2020.08: Undergraduate, Received B.A. degree, Major in Linguistics and Cognitive Science(1st), Language and Technology(2nd) at Hankuk University of Foreign Studies (HUFS)

Academic Services

Program committee: NAACL 2022-Industry Track
Program committee: ACL 2023
Program committee: EMNLP 2023
Program committee: NAACL 2024
Program committee: ACL/EMNLP 2025
Program committee: AAAI 2026

Publications

Top Conference

  1. [Mixture-of-Clustered-Experts: Advancing Expert Specialization and Generalization in Instruction Tuning]
    Sugyeong Eo, Jungjun Lee, Chanjun Park, Heuiseok Lim
    EMNLP 2025 (Oral)

  2. Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean Translation
    Sugyeong Eo, Jungwoo Lim, Chanjun Park, Dahyun Jung, Seonmin Koo, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim
    LREC-COLING 2024 (Oral)

  3. Towards Diverse and Effective Question-Answer Pair Generation from Children Storybooks
    Sugyeong Eo, Hyeonseok Moon, Jinsung Kim, Yuna Hur, Jeongwook Kim, Songeun Lee, Changwoo Chun, Sungsoo Park, Heuiseok Lim
    ACL 2023 - Findings

  4. KU X Upstage’s submission for the WMT22 Quality Estimation: Critical Error Detection Shared Task
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim
    WMT 2022

  5. QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine Translation
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Gyeongmin Kim, Jungseob Lee, Heuiseok Lim
    COLING 2022

  6. Should we find another model?: Improving Neural Machine Translation Performance with ONE-Piece Tokenization Method without Model Modification
    Chanjun Park, Sugyeong Eo (Co-author), Hyeonseok Moon, Heuiseok Lim
    NAACL-HLT 2021 Industry Track

  7. Towards precise localization of critical errors in machine translation
    Dahyun Jung, Sugyeong Eo, Chanjun Park, Heuiseok Lim
    ACL 2024-Findings

  8. Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine Translation
    Jungseob Lee, Hyeonseok Moon, Seungyoon Lee, Chanjun Park, Sugyeong Eo, Hyunwoong Ko, Jaehyung Seo, Seungyoon Lee, Heuiseok Lim
    ACL 2024-Findings

  9. Hyper-BTS Dataset: Scalability and Enhanced Analysis of Back TranScription (BTS) for ASR Post-Processing
    Chanjun Park, Jaehyung Seo, Seolhwa Lee, Junyoung Son, Hyeonseok Moon, Sugyeong Eo, Chanhee Lee, Heuiseok Lim
    EACL 2024-Findings

  10. Generative Interpretation: Toward Human-Like Evaluation for Educational Question-Answer Pair Generation
    Hyeonseok Moon, Jaewook Lee, Sugyeong Eo, Chanjun Park, Jaehyung Seo, Heuiseok Lim
    EACL 2024-Findings

  11. Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in Korean
    Seungyoon Lee, Chanjun Park, DaHyun Jung, Hyeonseok Moon, Jaehyung Seo, Sugyeong Eo, Heuseok Lim
    LREC-COLING 2024

  12. KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processing
    Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim
    EMNLP 2023

  13. CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme Ingredients
    Jaehyung Seo, Hyeonseok Moon, Jaewook Lee, Sugyeong Eo, Chanjun Park, Heuiseok Lim
    EMNLP 2023

  14. Informative Evidence-guided Prompt-based Fine-tuning for English-Korean Critical Error Detection
    DaHyun Jung, Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim
    IJCNLP-AACL 2023

  15. PEEP-Talk: A Situational Dialogue-based Chatbot for English Education
    Seungjun Lee, Yoonna Jang, Chanjun Park, Jungseob Lee, Jaehyung Seo, Hyeonseok Moon, Sugyeong Eo, Seounghoon Lee, Bernardo Yahya, Heuiseok Lim
    ACL 2023 - Demo

  16. A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation
    Jaehyung Seo, Seounghoon Lee, Chanjun Park, Yoonna Jang, Hyeonseok Moon, Sugyeong Eo, Seonmin Koo, Heuiseok Lim
    NAACL 2022 - Findings

  17. Priming Ancient Korean Neural Machine Translation
    Chanjun Park, Seolhwa Lee, Hyeonseok Moon, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim
    LREC 2022

  18. Empirical Analysis of Noising Scheme based Synthetic Data Generation for Automatic Post-editing
    Hyeonseok Moon, Chanjun Park, Seolhwa Lee, Jaehyung Seo, Jeongsub Lee, Sugyeong Eo, Heuiseok Lim
    LREC 2022

Top Conference (Workshop)

  1. A New Tool for Efficiently Generating Quality Estimation Datasets
    Sugyeong Eo, Chanjun Park, Jaehyung Seo, Hyeonseok Moon, Heuiseok Lim
    NeurIPS 2021 - Data-centric AI (DCAI) workshop

  2. Dealing with the Paradox of Quality Estimation
    Sugyeong Eo, Chanjun Park, Jaehyung Seo, Hyeonseok Moon, Heuiseok Lim
    MT Summit 2021 - LoResMT

  3. EXPLAINABLE CED: A Dataset for Explainable Critical Error Detection in Machine Translation
    Dahyun Jung, Sugyeong Eo, Chanjun Park, Heuiseok Lim
    NAACL 2024 Student Research Workshop

  4. Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline
    Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim
    ICML 2023 - DataPerf workshop

  5. Synthetic Alone: Exploring the Dark Side of Synthetic Data for Grammatical Error Correction
    Chanjun Park, Seonmin Koo, Seolhwa Lee, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim
    ICML 2023 - DataPerf workshop

  6. Focus on FoCus: Is FoCus focused on Context, Knowledge and Persona?
    SeungYoon Lee, Jungseob Lee, Chanjun Park, Sugyeong Eo, Hyeonseok Moon, Jaehyung Seo, Jeongbae Park, Heuiseok Lim
    COLING 2022 - The 1st Workshop on Customized Chat Grounding Persona and Knowledge

  7. A Self-Supervised Automatic Post-Editing Data Generation Tool
    Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Seungjun Lee, Heuiseok Lim
    ICML 2022 – DataPerf workshop

  8. How should human translation coexist with NMT? Efficient tool for building high quality parallel corpus
    Chanjun Park, Seolhwa Lee, Hyeonseok Moon, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim
    NeurIPS 2021 - Data-centric AI (DCAI) workshop

  9. Automatic Knowledge Augmentation for Generative Commonsense Reasoning
    Jaehyung Seo, Chanjun Park, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim
    NeurIPS 2021 - Data-centric AI (DCAI) workshop

  10. BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text
    Chanjun Park, Jaehyung Seo, Seolhwa Lee, Chanhee Lee, Hyeonseok Moon, Sugyeong Eo, Heuiseok Lim
    ACL 2021 -WAT(Workshop on Asian Translation) 2021 Workshop

International Journal (SCI/SCIE)

  1. Word-level Quality Estimation for Korean-English Neural Machine Translation
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim
    IEEE Access, 2022

  2. Comparative Analysis of Current Approaches to Quality Estimation for Neural Machine Translation
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim
    Applied Sciences, 2021

  3. Empirical Analysis of Parallel Corpora and In-Depth Analysis Using LIWC
    Chanjun Park, Midan Shim, Sugyeong Eo (Co-author), Seolhwa Lee, Jaehyung Seo, Hyeonseok Moon, Heuiseok Lim
    Applied Sciences, 2021

  4. Exploiting Hanja-based Resources in Processing Korean Historic Documents Written by Common Literati
    Hyeonseok Moon, Myunghoon Kang, Jaehyung Seo, Sugyeong Eo, Chanjun Park, Yeongwook Yang, Heuiseok Lim IEEE Access, 2024

  5. Uncovering the Risks and Drawbacks Associated with the Use of Synthetic Data for Grammatical Error Correction
    Seonmin Koo, Chanjun Park, Seolhwa Lee, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim
    IEEE Access, 2023

  6. Doubts on the reliability of parallel corpus filtering
    Hyeonseok Moon, Chanjun Park, Seonmin Koo, Jungseob Lee, Seungjun Lee, Jaehyung Seo, Sugyeong Eo, Yoonna Jang, Hyunjoong Kim, Hyoung-gyu Lee, Heuiseok Lim
    ESWA, 2023

  7. A Survey on Evaluation Metrics for Machine Translation
    Seungjun Lee, Jungseob Lee, Hyeonseok Moon, Chanjun Park, Jaehyung Seo, Sugyeong Eo,Seonmin Koo, Heuiseok Lim
    Mathematics, 2023

  8. Enhancing Machine Translation Quality Estimation via Fine-grained Error Analysis and Large Language Model
    Dahyun Jung, Chanjun Park, Sugyeong Eo, Heuiseok Lim
    Mathematics, 2023

  9. Plain Template Insertion: Korean-Prompt-based Engineering for Few-shot Learners
    Jaehyung Seo, Hyeonseok Moon, Chanhee Lee, Sugyeong Eo, Chanjun Park, Jihoon Kim, Changwoo Chun, Heuiseok Lim
    IEEE Access, 2022

  10. PU-GEN: Enhancing Generative Commonsense Reasoning for Language Models with Human-Centered Knowledge
    Jaehyung Seo, Dongsuk Oh, Sugyeong Eo, Chanjun Park, Kisu Yang, Hyeonseok Moon, Kinam Park, Heuiseok Lim
    Knowledge-Based Systems, 2022

  11. BERTOEIC: Solving TOEIC Problems Using Simple and Efficient Data Augmentation Techniques with Pretrained Transformer Encoders
    Jeongwoo Lee, Hyeonseok Moon, Chanjun Park, Jaehyung Seo, Sugyeong Eo, Heuiseok Lim
    Applied Sciences, 2022

  12. Return on Advertising Spend Prediction with Task Decomposition-based LSTM Model
    Hyeonseok Moon, Taemin Lee, Jaehyung Seo, Chanjun Park, Sugyeong Eo, Imatitikua D. AIyanyo, Jeongbae Park, Aram So, Kyoungwha Ok, Kinam Park
    Mathematics, 2022

  13. Dense-to-Question and Sparse-to-Answer: Hybrid Retriever System for Industrial Frequently Asked Questions
    Jaehyung Seo, Taemin Lee, Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Imatitikua D AIyanyo, Kinam Park, Aram So, Sungmin Ahn, Jeongbae Park
    Mathematics, 2022

  14. Mimicking Infants’ Bilingual Language Acquisition for Domain Specialized Neural Machine Translation
    Chanjun Park, Woo-Young Go, Sugyeong Eo, Hyeonseok Moon, Seolhwa Lee, Heuiseok Lim
    IEEE Access, 2022

  15. An Automatic Post Editing with Efficient and Simple Data Generation Method
    Hyeonseok Moon, Chanjun Park, Jaehyung Seo, Sugyeong Eo, Heuiseok Lim
    IEEE Access, 2022

  16. An Empirical Study on Automatic Post Editing for Neural Machine Translation
    Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim
    IEEE Access, 2021

Domestic Conference

  1. A Method for Efficient Ensemble of Large Language Model (효율적인 거대 언어모델 앙상블을 위한 점진적 기법)
    Sugyeong Eo, Chanjun Park, Yuna Hur, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2024

  2. A Study on Proficiency in Solving Riddles of Large Language Models (초거대 언어모델의 재치에 관한 고찰: 수수께끼 해결 능력을 중심으로)
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Yuna Hur, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2023

  3. KoCED: English-Korean Critical Error Detection Dataset (KoCED: 윤리 및 사회적 문제를 초래하는 기계번역 오류 탐지를 위한 학습 데이터셋)
    Sugyeong Eo, Suwon Choi, Seonmin Koo, Dahyun Jung, Chanjun Park, Jaehyung Seo, Hyeonseok Moon, Jeongbae Park, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2022

  4. Word-level Korean-English Quality Estimation (단어 수준 한국어-영어 기계번역 품질 예측)
    Sugyeong Eo, Chanjun Park, Jaehyung Seo, Hyeonseok Moon, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2021

  5. Design Neural Machine Translation Model Combining External Symbolic Knowledge (심볼릭 지식 정보를 결합한 뉴럴기계번역 모델 설계)
    Sugyeong Eo, Chanjun Park, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2020

  6. Mixture of Models: Towards Effective Domain Expert Ensemble of Large Language Models (Mixture of Models: 거대 언어모델 기반 효과적 도메인 전문가 앙상블 기법 연구)
    Gyuho Shim, Sugyeong Eo, Seongtae Hong, Jinsung Kim, Sangyoon Jun, Hoondong Kim, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2024

  7. Empirical Study on the Hallucination of Large Language Models Derived by the Sentence-Closing Ending (어체에 따른 초거대언어모델의 한국어 환각 현상 분석)
    Hyeonseok Moon, Sugyeong Eo, Jaehyung Seo, Chanjun Park, Yuna Hur, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2023

  8. Critical Error Span Detection Model of Korean Machine Translation (한국어 기계 번역에서의 품질 검증을 위한 치명적인 오류 범위 탐지 모델)
    Dahyun Jung, Seungyoon Lee, Sugyeong Eo, Chanjun Park, Jaewook Lee, Kinam Park, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2023

  9. Korean Commonsense Reasoning Evaluation for Large Language Models (거대언어모델을 위한 한국어 상식추론 기반 평가)
    Jaehyung Seo, Chanjun Park, Hyeonseok Moon, Sugyeong Eo, Aram So, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2023

  10. SaJuTeller: Conditional Generation Deep-Learning based Fortune Telling Model (SaJuTeller: 조건부 생성 모델을 기반으로 한 인공지능 사주 풀이 모델)
    Hyeonseok Moon, Jungseob Lee, Jaehyung Seo, Sugyeong Eo, Chanjun Park, Woohyeon Kim, Jeongbae Park, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2022

  11. Automatic Generation of Training Data for Korean Speech Recognition Post-Processor (한국어 음성인식 후처리기를 위한 학습 데이터 자동 생성 방안)
    Seonmin Koo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Sugyeong Eo, Yuna Hur, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2022

  12. SRLev-BIH: An Evaluation Metric for Korean Generative Commonsense Reasoning (SRLev-BIH: 한국어 일반 상식 추론 및 생성 능력 평가 지표)
    Jaehyung Seo, Yoonna Jang, Jaewook Lee, Hyeonseok Moon, Sugyeong Eo, Chanjun Park, Aram So, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2022

  13. A Synthetic Dataset for Korean Knowledge Graph-to-Text Generation (한국어 지식 그래프-투-텍스트 생성을 위한 데이터셋 자동 구축)
    Dahyun Jung, Seungyoon Lee, Seungjun Lee, Jaehyung Seo, Sugyeong Eo, Chanjun Park, Yuna Hur, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2022

  14. Verification of the Domain Specialized Automatic Post Editing Model (도메인 특화 기계번역 사후교정 모델 검증 연구)
    Hyeonseok Moon, Chanjun Park, Jaehyeong Seo, Sugyeong Eo, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2021

  15. BackTranScription (BTS)-based Jeju Automatic Speech Recognition Post-processor Research (BackTranScription (BTS)기반 제주어 음성인식 후처리기 연구)
    Chanjun Park, Jaehyung Seo, Seolhwa Lee, Heonseok Moon, Sugyeong Eo, Yoonna Jang, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2021

  16. Kommongen: A Dataset for Korean Generative Commonsense Reasoning Evaluation (KommonGen: 한국어 생성 모델의 상식 추론 평가 데이터셋)
    Jaehyung Seo, Chanjun Park, Hyeonseok Moon, Sugyeong Eo, Myunghoon Kang, Seounghoon Lee, Heuiseok Lim Annual Conference on Human and Language Technology, 2021

  17. Semi-supervised GPT2 for News Article Recommendation with Curriculum Learning (준 지도 학습과 커리큘럼 학습을 이용한 유사 기사 추천 모델)
    Jaehyung Seo, Dongsuk Oh, Sugyeong Eo, Sungjin Park, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2020

Domestic Journal

  1. Study on Zero-shot based Quality Estimation (Zero-Shot 기반 기계번역 품질 예측 연구)
    Sugyeong Eo, Chanjun Park, Jaehyung Seo, Hyeonseok Moon, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  2. Research on Subword Tokenization of Korean Neural Machine Translation and Proposal for Tokenization Method to Separate Jongsung from Syllables (한국어 인공신경망 기계번역의 서브 워드 분절 연구 및 음절 기반 종성 분리 토큰화 제안)
    Sugyeong Eo, Park Chanjun, Hyeonseok Moon, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  3. Research on Recent Quality Estimation (최신 기계번역 품질 예측 연구)
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim Journal of the Korea Convergence Society, 2021

  4. Policy-based performance comparison study of Real-time Simultaneous Translation (실시간 동시통번역의 정책기반 성능 비교 연구)
    Jungseob Lee, Hyeonseok Moon, Chanjun Park, Jaehyung Seo, Sugyeong Eo, Seungjun Lee, Seonmin Koo, Heuiseok Lim
    Journal of the Korea Convergence Society, 2022

  5. Study on Decoding Strategies in Neural Machine Translation (인공신경망 기계번역에서 디코딩 전략에 대한 연구)
    Jaehyung Seo, Chanjun Park, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  6. A Study on Verification of Back TranScription (BTS)-based Data Construction (Back TranScription(BTS)기반 데이터 구축 검증 연구)
    Chanjun Park, Jaehyung Seo, Seolhwa Lee, Hyeonseok Moon, Sugyeong Eo, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  7. The Verification of the Transfer Learning-based Automatic Post-Editing Model (전이학습 기반 기계번역 사후교정 모델 검증)
    Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  8. Recent Automatic Post Editing Research (최신 기계번역 사후 교정 연구)
    Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim
    Journal of Digital Convergence, 2021

  9. Filter-mBART based Neural Machine Translation Using Parallel Corpus Filtering (병렬 말뭉치 필터링을 적용한 Filter-mBART기반 기계번역 연구)
    Hyeonseok Moon, Chanjun Park, Sugyeong Eo, JeongBae Park, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  10. A Study on Performance Improvement Considering the Balance Between Corpus in Neural Machine Translation (인공신경망 기계번역에서 말뭉치 간의 균형성을 고려한 성능 향상 연구)
    Chanjun Park, Kinam Park, Hyeonseok Moon, Sugyeong Eo, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

Patents

1. 신뢰도 기반 효율적인 다중 에이전트 협력을 위한 장치 및 방법
Heuiseok Lim, Sugyeong Eo
국내특허출원완료 (10-2025-0087619 ), 2025

2. DEVICE AND METHOD FOR GENERATION OF DIVERSE QUESTION-ANSWER PAIR
Heuiseok Lim, Sugyeong Eo
해외특허출원완료 (18/585,166)

3. 클러스터 정보 기반 전문가 혼합 학습 시스템
Heuiseok Lim, Sugyeong Eo
국내특허출원완료 (10-2024-0194707)

4. 유형 다양성을 고려한 교육용 질의응답쌍 생성 시스템
Heuiseok Lim, Sugyeong Eo, Hyeonseok Moon, Jinsung Kim, Yuna Hur, Jeongwook Kim
국내특허출원완료 (10-2024-0009742)

5. 프롬프트를 활용한 기계번역 결과 치명적 오류 감지 방법 및 장치
Heuiseok Lim, Sugyeong Eo
국내특허출원완료 (10-2022-0161686)

6. 기계 번역 품질 예측을 위한 학습 데이터 생성 장치 및 방법
Heuiseok Lim, Sugyeong Eo, Hyeonseok Moon, Chanjun Park
국내특허등록완료 (10-2021-0156657(출원)/10-2593447(등록))

Book Chapters

Natural Language Processing Bible
HeuiSeok Lim, Korea University NLP&AI Lab
Human Science

Honors & Awards

  • Received Korea University Best Paper Award 2023
  • Received Naver Ph.D. Fellowship 2022
  • 1st place in Quality Estimation Shared Task 2022 - Sentence-level “Critical Error Detection”, WMT 2022 (EMNLP 2022)
  • Best Paper Award, The 34th Annual Conference on Human & Cognitive Language Technology (HCLT2022)
    ▶️ Paper: KoCED: 윤리 및 사회적 문제를 초래하는 기계번역 오류 탐지를 위한 학습 데이터셋 (KoCED: English-Korean Critical Error Detection Dataset)
  • Best Paper Award, The 33rd Annual Conference on Human & Cognitive Language Technology (HCLT2021) - NLP Application 2 Section
    ▶️ Paper: KommonGen: 한국어 생성 모델의 상식 추론 평가 데이터셋 (KommonGen: A Dataset for Korean Generative Commonsense Reasoning Evaluation)
  • Ranked 4th on the CommonGen 1.1 Leaderboard (Nov. 2022 Ranked 7th, CommonGen 1.1)

Invited Talk

  • Basic practice of natural language processing for everyone
    PLACE: Hankuk University of Foreign Studies (2022.07)