About Me

Hello, I’m Sugyeong Eo, a Ph.D. candidate in Computer Science and Engineering at Korea University.
I am a member of the NLP & AI Lab under the supervision of Prof. Heuiseok Lim.
Feel free to reach out if you’d like to connect or collaborate!

Research Interest

Natural Language Processing, Language Modeling, Neural Machine Translation, Quality Estimation, Question Generation

Education

2020.09 - 2026.02: Graduate, Major in Computer Science and Engineering at Korea University
2016.02 - 2020.08: Undergraduate, Received B.A. degree, Major in Linguistics and Cognitive Science(1st), Language and Technology(2nd) at Hankuk University of Foreign Studies (HUFS)

Academic Services

Program committee: NAACL 2022-Industry Track
Program committee: ACL 2023
Program committee: EMNLP 2023
Program committee: NAACL 2024
Program committee: ACL/EMNLP 2025
Program committee: AAAI/ACL 2026

Publications

Preprints

Top Conference

Top Conference (Workshop)

International Journal (SCI/SCIE)

Domestic Conference

  • A Method for Efficient Ensemble of Large Language Model (효율적인 거대 언어모델 앙상블을 위한 점진적 기법)
    Sugyeong Eo, Chanjun Park, Yuna Hur, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2024

  • A Study on Proficiency in Solving Riddles of Large Language Models (초거대 언어모델의 재치에 관한 고찰: 수수께끼 해결 능력을 중심으로)
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Yuna Hur, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2023

  • KoCED: English-Korean Critical Error Detection Dataset (KoCED: 윤리 및 사회적 문제를 초래하는 기계번역 오류 탐지를 위한 학습 데이터셋)
    Sugyeong Eo, Suwon Choi, Seonmin Koo, Dahyun Jung, Chanjun Park, Jaehyung Seo, Hyeonseok Moon, Jeongbae Park, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2022

  • Word-level Korean-English Quality Estimation (단어 수준 한국어-영어 기계번역 품질 예측)
    Sugyeong Eo, Chanjun Park, Jaehyung Seo, Hyeonseok Moon, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2021

  • Design Neural Machine Translation Model Combining External Symbolic Knowledge (심볼릭 지식 정보를 결합한 뉴럴기계번역 모델 설계)
    Sugyeong Eo, Chanjun Park, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2020

  • Mixture of Models: Towards Effective Domain Expert Ensemble of Large Language Models (Mixture of Models: 거대 언어모델 기반 효과적 도메인 전문가 앙상블 기법 연구)
    Gyuho Shim, Sugyeong Eo, Seongtae Hong, Jinsung Kim, Sangyoon Jun, Hoondong Kim, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2024

  • Empirical Study on the Hallucination of Large Language Models Derived by the Sentence-Closing Ending (어체에 따른 초거대언어모델의 한국어 환각 현상 분석)
    Hyeonseok Moon, Sugyeong Eo, Jaehyung Seo, Chanjun Park, Yuna Hur, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2023

  • Critical Error Span Detection Model of Korean Machine Translation (한국어 기계 번역에서의 품질 검증을 위한 치명적인 오류 범위 탐지 모델)
    Dahyun Jung, Seungyoon Lee, Sugyeong Eo, Chanjun Park, Jaewook Lee, Kinam Park, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2023

  • Korean Commonsense Reasoning Evaluation for Large Language Models (거대언어모델을 위한 한국어 상식추론 기반 평가)
    Jaehyung Seo, Chanjun Park, Hyeonseok Moon, Sugyeong Eo, Aram So, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2023

  • SaJuTeller: Conditional Generation Deep-Learning based Fortune Telling Model (SaJuTeller: 조건부 생성 모델을 기반으로 한 인공지능 사주 풀이 모델)
    Hyeonseok Moon, Jungseob Lee, Jaehyung Seo, Sugyeong Eo, Chanjun Park, Woohyeon Kim, Jeongbae Park, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2022

  • Automatic Generation of Training Data for Korean Speech Recognition Post-Processor (한국어 음성인식 후처리기를 위한 학습 데이터 자동 생성 방안)
    Seonmin Koo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Sugyeong Eo, Yuna Hur, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2022

  • SRLev-BIH: An Evaluation Metric for Korean Generative Commonsense Reasoning (SRLev-BIH: 한국어 일반 상식 추론 및 생성 능력 평가 지표)
    Jaehyung Seo, Yoonna Jang, Jaewook Lee, Hyeonseok Moon, Sugyeong Eo, Chanjun Park, Aram So, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2022

  • A Synthetic Dataset for Korean Knowledge Graph-to-Text Generation (한국어 지식 그래프-투-텍스트 생성을 위한 데이터셋 자동 구축)
    Dahyun Jung, Seungyoon Lee, Seungjun Lee, Jaehyung Seo, Sugyeong Eo, Chanjun Park, Yuna Hur, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2022

  • Verification of the Domain Specialized Automatic Post Editing Model (도메인 특화 기계번역 사후교정 모델 검증 연구)
    Hyeonseok Moon, Chanjun Park, Jaehyeong Seo, Sugyeong Eo, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2021

  • BackTranScription (BTS)-based Jeju Automatic Speech Recognition Post-processor Research (BackTranScription (BTS)기반 제주어 음성인식 후처리기 연구)
    Chanjun Park, Jaehyung Seo, Seolhwa Lee, Heonseok Moon, Sugyeong Eo, Yoonna Jang, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2021

  • Kommongen: A Dataset for Korean Generative Commonsense Reasoning Evaluation (KommonGen: 한국어 생성 모델의 상식 추론 평가 데이터셋)
    Jaehyung Seo, Chanjun Park, Hyeonseok Moon, Sugyeong Eo, Myunghoon Kang, Seounghoon Lee, Heuiseok Lim Annual Conference on Human and Language Technology, 2021

  • Semi-supervised GPT2 for News Article Recommendation with Curriculum Learning (준 지도 학습과 커리큘럼 학습을 이용한 유사 기사 추천 모델)
    Jaehyung Seo, Dongsuk Oh, Sugyeong Eo, Sungjin Park, Heuiseok Lim
    Annual Conference on Human and Language Technology, 2020

Domestic Journal

  • Study on Zero-shot based Quality Estimation (Zero-Shot 기반 기계번역 품질 예측 연구)
    Sugyeong Eo, Chanjun Park, Jaehyung Seo, Hyeonseok Moon, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  • Research on Subword Tokenization of Korean Neural Machine Translation and Proposal for Tokenization Method to Separate Jongsung from Syllables (한국어 인공신경망 기계번역의 서브 워드 분절 연구 및 음절 기반 종성 분리 토큰화 제안)
    Sugyeong Eo, Park Chanjun, Hyeonseok Moon, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  • Research on Recent Quality Estimation (최신 기계번역 품질 예측 연구)
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim Journal of the Korea Convergence Society, 2021

  • Policy-based performance comparison study of Real-time Simultaneous Translation (실시간 동시통번역의 정책기반 성능 비교 연구)
    Jungseob Lee, Hyeonseok Moon, Chanjun Park, Jaehyung Seo, Sugyeong Eo, Seungjun Lee, Seonmin Koo, Heuiseok Lim
    Journal of the Korea Convergence Society, 2022

  • Study on Decoding Strategies in Neural Machine Translation (인공신경망 기계번역에서 디코딩 전략에 대한 연구)
    Jaehyung Seo, Chanjun Park, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  • A Study on Verification of Back TranScription (BTS)-based Data Construction (Back TranScription(BTS)기반 데이터 구축 검증 연구)
    Chanjun Park, Jaehyung Seo, Seolhwa Lee, Hyeonseok Moon, Sugyeong Eo, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  • The Verification of the Transfer Learning-based Automatic Post-Editing Model (전이학습 기반 기계번역 사후교정 모델 검증)
    Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  • Recent Automatic Post Editing Research (최신 기계번역 사후 교정 연구)
    Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim
    Journal of Digital Convergence, 2021

  • Filter-mBART based Neural Machine Translation Using Parallel Corpus Filtering (병렬 말뭉치 필터링을 적용한 Filter-mBART기반 기계번역 연구)
    Hyeonseok Moon, Chanjun Park, Sugyeong Eo, JeongBae Park, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

  • A Study on Performance Improvement Considering the Balance Between Corpus in Neural Machine Translation (인공신경망 기계번역에서 말뭉치 간의 균형성을 고려한 성능 향상 연구)
    Chanjun Park, Kinam Park, Hyeonseok Moon, Sugyeong Eo, Heuiseok Lim
    Journal of the Korea Convergence Society, 2021

Patents

신뢰도 기반 효율적인 다중 에이전트 협력을 위한 장치 및 방법
Heuiseok Lim, Sugyeong Eo
국내특허출원완료 (10-2025-0087619 ), 2025

DEVICE AND METHOD FOR GENERATION OF DIVERSE QUESTION-ANSWER PAIR
Heuiseok Lim, Sugyeong Eo
해외특허출원완료 (18/585,166)

클러스터 정보 기반 전문가 혼합 학습 시스템
Heuiseok Lim, Sugyeong Eo
국내특허출원완료 (10-2024-0194707)

유형 다양성을 고려한 교육용 질의응답쌍 생성 시스템
Heuiseok Lim, Sugyeong Eo, Hyeonseok Moon, Jinsung Kim, Yuna Hur, Jeongwook Kim
국내특허출원완료 (10-2024-0009742)

프롬프트를 활용한 기계번역 결과 치명적 오류 감지 방법 및 장치
Heuiseok Lim, Sugyeong Eo
국내특허출원완료 (10-2022-0161686)

기계 번역 품질 예측을 위한 학습 데이터 생성 장치 및 방법
Heuiseok Lim, Sugyeong Eo, Hyeonseok Moon, Chanjun Park
국내특허등록완료 (10-2021-0156657(출원)/10-2593447(등록))

Book Chapters

Natural Language Processing Bible
HeuiSeok Lim, Korea University NLP&AI Lab
Human Science

Honors & Awards

  • Received Korea University Best Paper Award 2023
  • Received Naver Ph.D. Fellowship 2022
  • 1st place in Quality Estimation Shared Task 2022 - Sentence-level “Critical Error Detection”, WMT 2022 (EMNLP 2022)
  • Best Paper Award, The 34th Annual Conference on Human & Cognitive Language Technology (HCLT2022)
    ▶️ Paper: KoCED: 윤리 및 사회적 문제를 초래하는 기계번역 오류 탐지를 위한 학습 데이터셋 (KoCED: English-Korean Critical Error Detection Dataset)
  • Best Paper Award, The 33rd Annual Conference on Human & Cognitive Language Technology (HCLT2021) - NLP Application 2 Section
    ▶️ Paper: KommonGen: 한국어 생성 모델의 상식 추론 평가 데이터셋 (KommonGen: A Dataset for Korean Generative Commonsense Reasoning Evaluation)
  • Ranked 4th on the CommonGen 1.1 Leaderboard (Nov. 2022 Ranked 7th, CommonGen 1.1)

Invited Talk

  • Basic practice of natural language processing for everyone
    PLACE: Hankuk University of Foreign Studies (2022.07)