Seokhwan Kim
Staff Software Engineer, Google Cloud AI

Contact

1175 Borregas Ave, Sunnyvale, CA 94089, USA

+1 (650) 426-9701

http://seokhwankim.com/

Click HERE to download my curriculum vitae.

Research Interests

Natural Language Processing

Spoken Dialogue Systems

Natural Language Understanding

Large Language Model

LLM Agents

Work Experiences

Staff Software Engineer

Jun 2024 -

Google

Cloud AI

Principal Applied Scientist

Oct 2021 - May 2024

Amazon

Alexa AI

Senior Applied Scientist

Jun 2019 - Oct 2021

Amazon

Alexa AI

NLP Research Scientist

Jul 2017 - May 2019

Adobe Research

Creative Intelligence Lab.

Research Scientist

Jan 2012 - Jul 2017

Institute for Infocomm Research (I2R)

Dialogue Technology Lab.,
Human Language Technology Department

Research Intern

Apr 2011 - Aug 2011

Max Planck Institute for Informatics (MPI-INF)

Databases and Information Systems
Advisor: Prof. Gerhard Weikum

Education

Ph.D.

Sep 2005 - Feb 2012

Pohang University of Science and Technology (POSTECH)

Computer Science and Engineering

Dissertation: Cross-Lingual Weakly-Supervised Learning of Semantic Relations
Advisor: Gary Geunbae Lee

B.S.

Mar 2001 - Aug 2005

Pohang University of Science and Technology (POSTECH)

Computer Science and Engineering

Professional Experiences

Program Chair

2025

SIGDIAL 2025

Virtual Infrastructure Co-Chair

2022

COLING 2022

Challenge & Demonstration Co-Chair

2022

IEEE SLT 2022

Panelist

2022

NSF Proposal Review Panel

Speech and Language Processing Technical Committee (SLTC)

2021-2023

IEEE Signal Processing Society

Board Member

2021-2023

Special Interest Group on Discourse and Dialogue (SIGdial)

Challenge Chair

2019

The Eighth Dialog System Technology Challenge (DSTC8)

Organizing Committee

2017-2020

Dialog System Technology Challenges (DSTC6-9)

Organizing Committee

2015-2016

Dialog State Tracking Challenges (DSTC4-5)

Local Organizing Committee Co-chair

2013

The Ninth Asia Information Retrieval Societies Conference (AIRS 2013)

Publicity Committee Member

2012

ACL Special Workshop: Rediscovering 50 Years of Discoveries

Area Chair

EMNLP 2020, IEEE SLT 2021

Reviewer

AAAI, ACL, ACM-SIGIR, AI Magazine, CSL, CoNLL, EACL, EMNLP, ICLR, IEEE ASRU, IEEE ICASSP, IEEE SLT, IEEE/ACM TASLP, IJCAI, IJCNLP, IWSDS, Interspeech, NAACL-HLT, NeurIPS

Tutorials and Invited Talks

2023

"How Robust R U?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations.

Seokhwan Kim

Invited Talk @ Hugging Face Audio Transformers Course Launch Event, 14 Jun 2023.

2020

Summary of the Eighth Dialog System Technology Challenge (DSTC8).

Seokhwan Kim

Invited Talk @ The Third Workshop on Reasoning and Learning for Human-Machine Dialogues (DEEP-DIAL 2020), 08 Feb 2020.

Just ask: An Interactive Learning Framework for Vision and Language Navigation.

Seokhwan Kim

Invited Talk @ AI Assistant Summit (RE•WORK), 30 Jan 2020.

2016

Natural Language in Human-Robot Interaction.

[slides]

Rafael E. Banchs, Seokhwan Kim, Luis Fernando D'Haro, Andreea I. Niculescu

Tutorial @ The 4th International Conference on Human-Agent Interaction (HAI 2016), 04 Oct 2016.

Publications

2024

[C71]

Redefining Proactivity for Information Seeking Dialogue.

[PDF]

[bib]

Jing Yang Lee, Seokhwan Kim, Kartik Mehta, Jiun-Yu Kao, Yu-Hsiang Lin, Arpit Gupta

Proceedings of the Second Workshop on Social Influence in Conversations (SICon 2024), Miami, Nov 2024.

2023

[C70]

CESAR: Automatic Induction of Compositional Instructions for Multi-turn Dialogs.

Taha Aksu, Devamanyu Hazarika, Shikib Mehri, Seokhwan Kim, Dilek Hakkani-Tur, Yang Liu, Mahdi Namazifar

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore, Dec 2023.

[C69]

"What do others think?": Task-Oriented Conversational Modeling with Subjective Knowledge.

[preprint]

Chao Zhao, Spandana Gella, Seokhwan Kim, Di Jin, Devamanyu Hazarika, Alexandros Papangelis, Behnam Hedayatnia, Mahdi Namazifar, Yang Liu, Dilek Hakkani-Tur

Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2023), Prague, Sep 2023.

[C68]

Investigating the Representation of Open Domain Dialogue Context for Transformer Models.

Vishakh Padmakumar, Behnam Hedayatnia, Di Jin, Patrick Lange, Seokhwan Kim, Nanyun Peng, Yang Liu, Dilek Hakkani-Tur

Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2023), Prague, Sep 2023.

[J10]

Overview of the Tenth Dialog System Technology Challenge: DSTC10.

[link]

Koichiro Yoshino, Yun-Nung Chen, Paul Crook, Satwik Kottur, Jinchao Li, Behnam Hedayatnia, Seungwhan Moon, Zhengcong Fei, Zekang Li, Jinchao Zhang, Yang Feng, Jie Zhou, Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis, Karthik Gopalakrishnan, Dilek Hakkani-Tur, Babak Damavandi, Alborz Geramifard, Chiori Hori, Ankit Shah, Chen Zhang, Haizhou Li, João Sedoc, Luis F. D'Haro, Rafael Banchs, Alexander Rudnicky

IEEE/ACM Transactions on Audio, Speech, and Language Processing, Jul 2023.

[C67]

Identifying Entrainment in Task-oriented Conversations.

Run Chen, Yang Liu, Alexandros Papangelis, Seokhwan Kim, Julia Hirschberg, Dilek Hakkani-Tur

Proceedings of the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023), Rhodes Island, Jun 2023.

[C66]

PLACES: Prompting Language Models for Social Conversation Synthesis.

[preprint]

Maximillian Chen, Alexandros Papangelis, Chenyang Tao, Seokhwan Kim, Andy Rosenbaum, Yang Liu, Zhou Yu, Dilek Hakkani-Tur

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), Dubrovnik, May 2023. (Findings)

[C65]

Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information.

[preprint]

Yen Ting Lin, Alexandros Papangelis, Seokhwan Kim, Sungjin Lee, Devamanyu Hazarika, Mahdi Namazifar, Di Jin, Yang Liu, Dilek Hakkani-Tur

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), Dubrovnik, May 2023.

2022

[C64]

Knowledge-Grounded Conversational Data Augmentation with Generative Conversational Networks.

[PDF]

[bib]

Yen-Ting Lin, Alexandros Papangelis, Seokhwan Kim, Dilek Hakkani-Tur

Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2022), Edinburgh, Sep 2022.

[C63]

Think Before You Speak: Explicitly Generating Implicit Commonsense Knowledge for Response Generation.

[PDF]

[bib]

Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), vol. 1, pp. 1237-1252, Dublin, May 2022.

[J9]

Towards Textual Out-of-Domain Detection Without In-Domain Labels.

[link]

Di Jin, Shuyang Gao, Seokhwan Kim, Yang Liu, Dilek Hakkani-Tur

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 30 (1386-1395), Mar 2022.

[C62]

Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations Track at DSTC10.

[PDF]

[bib]

Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis, Behnam Hedayatnia, Karthik Gopalakrishnan, Dilek Hakkani-Tur

Proceedings of the AAAI-22 Workshop on Dialog System Technology Challenges (DSTC10), Virtual, Feb 2022.

2021

[C61]

Towards Zero and Few-shot Knowledge-seeking Turn Detection in Task-orientated Dialogue Systems. (Best Paper)

Di Jin, Shuyang Gao, Seokhwan Kim, Yang Liu, Dilek Hakkani-Tur

Proceedings of the Efficient Natural Language and Speech Processing NeurIPS Workshop (ENLSP 2021), Virtual, Dec 2021.

[C60]

"How robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations.

[preprint]

Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis, Behnam Hedayatnia, Karthik Gopalakrishnan, Dilek Hakkani-Tur

Proceedings of the 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2021), Virtual, Dec 2021.

[C59]

Rome was built in 1776: A case study on factual correctness in knowledge-grounded response generation.

[preprint]

Sashank Santhanam, Behnam Hedayatnia, Spandana Gella, Aishwarya Padmakumar, Seokhwan Kim, Yang Liu, Dilek Hakkani-Tur

Proceedings of the 3rd Workshop on NLP for Conversational AI (NLP4ConvAI), Punta Cana, Nov 2021.

[C58]

Can I Be of Further Assistance? Using Unstructured Knowledge Access to Improve Task-oriented Conversational Modeling.

[PDF]

[bib]

Di Jin, Seokhwan Kim, Dilek Hakkani-Tur

Proceedings of the 1st DialDoc Workshop at ACL-IJCNLP (DialDoc), Virtual, Aug 2021.

[J8]

Editorial: Special Issue on the Eighth Dialog System Technology Challenge.

[link]

Seokhwan Kim, Hannes Schulz, Chulaka Gunasekara, Chiori Hori, Abhinav Rastogi, Luis Fernando D’Haro

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29 (2434-2436), Aug 2021.

[C57]

Commonsense-Focused Dialogues for Response Generation: An Empirical Study.

[PDF]

[bib]

Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2021), Singapore, Jul 2021.

[C56]

Generative Conversational Networks.

[PDF]

[bib]

Alexandros Papangelis, Karthik Gopalakrishnan, Aishwarya Padmakumar, Seokhwan Kim, Gokhan Tur, Dilek Hakkani-Tur

Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2021), Singapore, Jul 2021.

[J7]

Overview of the Eighth Dialog System Technology Challenge: DSTC8.

[link]

Seokhwan Kim, Michel Galley, Chulaka Gunasekara, Sungjin Lee, Adam Atkinson, Baolin Peng, Hannes Schulz, Jianfeng Gao, Jinchao Li, Mahmoud Adada, Minlie Huang, Luis Lastras, Jonathan K. Kummerfeld, Walter S. Lasecki, Chiori Hori, Anoop Cherian, Tim K. Marks, Abhinav Rastogi, Xiaoxue Zang, Srinivas Sunkara, Raghav Gupta

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29 (2529-2540), May 2021.

[C55]

Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access Track in DSTC9.

[preprint]

Seokhwan Kim, Mihail Eric, Behnam Hedayatnia, Karthik Gopalakrishnan, Yang Liu, Chao-Wei Huang, Dilek Hakkani-Tur

Proceedings of the AAAI-21 Workshop on Dialog System Technology Challenges (DSTC9), Virtual, Feb 2021.

2020

[C54]

Policy-Driven Neural Response Generation for Knowledge-Grounded Dialog Systems.

[preprint]

Behnam Hedayatnia, Karthik Gopalakrishnan, Seokhwan Kim, Yang Liu, Mihail Eric, Dilek Hakkani-Tur

Proceedings of the 13th International Conference on Natural Language Generation (INLG 2020), Dublin, Dec 2020.

[C53]

Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access.

[PDF]

[bib]

Seokhwan Kim, Mihail Eric, Karthik Gopalakrishnan, Behnam Hedayatnia, Yang Liu, Dilek Hakkani-Tur

Proceedings of the 21st Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2020), pp. 278-289, Boise, Jul 2020.

[C52]

Video Question Answering on Screencast Tutorials.

[PDF]

[bib]

Wentian Zhao, Seokhwan Kim, Ning Xu, Hailin Jin

Proceedings of the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence (IJCAI-PRICAI 2020), pp. 1061-1068, Yokohama, Jul 2020.

(12.6% acceptance)

[C51]

Screencast Tutorial Video Understanding.

[PDF]

Kunpeng Li, Chen Fang, Zhaowen Wang, Seokhwan Kim, Hailin Jin, Yun Fu

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020), pp. 12526-12535, Seattle, Jun 2020.

(22.1% acceptance)

[C50]

TutorialVQA: Question Answering Dataset for Tutorial Videos.

[PDF]

[bib]

Anthony Colas, Seokhwan Kim, Franck Dernoncourt, Siddhesh Gupte, Zhe Wang, Doo Soon Kim

Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 5450–5455, Marseille, May 2020.

[C49]

Just ask: An Interactive Learning Framework for Vision and Language Navigation.

[preprint]

Ta-Chung Chi, Mihail Eric, Seokhwan Kim, Minmin Shen, Dilek Hakkani-tur

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), New York, Feb 2020.

(20.6% acceptance)

2019

[C48]

The Eighth Dialog System Technology Challenge.

[preprint]

[poster]

Proceedings of the 3rd Conversational AI workshop: Today's Practice and Tomorrow's Potential (ConvAI 2019), Vancouver, Dec 2019.

[C47]

Analyzing Sentence Fusion in Abstractive Summarization.

[preprint]

Logan Lebanoff, John Muchovej, Franck Dernoncourt, Doo Soon Kim, Seokhwan Kim, Walter Chang, Fei Liu

Proceedings of the EMNLP 2019 Workshop on New Frontiers in Summarization (NewSum), Hong Kong, Nov 2019.

[C46]

Scoring Sentence Singletons and Pairs for Abstractive Summarization.

[PDF]

[bib]

Logan Lebanoff, Kaiqiang Song, Franck Dernoncourt, Doo Soon Kim, Seokhwan Kim, Walter Chang, Fei Liu

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), pp. 2175-2189, Florence, Jul 2019.

(25.7% acceptance)

[C45]

Learning Emphasis Selection for Written Text in Visual Media from Crowd-Sourced Label Distributions.

[PDF]

[bib]

Amirreza Shirani, Franck Dernoncourt, Paul Asente, Nedim Lipka, Seokhwan Kim, Jose Echevarria, Thamar Solorio

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), pp. 1167-1172, Florence, Jul 2019.

(18.2% acceptance)

[C44]

Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punctuation Restoration.

[PDF]

[poster]

Seokhwan Kim

Proceedings of the 44th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019), pp. 7280-7284, Brighton, May 2019.

[J6]

Overview of the Sixth Dialog System Technology Challenge: DSTC6.

[link]

Chiori Hori, Julien Perez, Ryuichi Higasinaka, Takaaki Hori, Y-Lan Boureau, Michimasa Inaba, Yuiko Tsunomori, Tetsuro Takahashi, Koichiro Yoshino, Seokhwan Kim

Computer Speech and Language, 55 (1-25), May 2019.

[C43]

Dynamic Memory Networks for Dialogue Topic Tracking.

[PDF]

[poster]

Seokhwan Kim

Proceedings of the AAAI-19 Workshop on Dialog System Technology Challenges (DSTC7), Honolulu, Jan 2019.

2018

[C42]

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents.

[PDF]

[bib]

Arman Cohan, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Seokhwan Kim, Walter Chang, Nazli Goharian

Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2018), pp. 615-621, New Orleans, Jun 2018.

(29.4% acceptance)

[C41]

PhotoshopQuiA: A Corpus of Non-Factoid Questions and Answers for Why-Question Answering.

[PDF]

Andrei Dulceanu, Thang Le Dinh, Walter Chang, Trung Bui, Doo Soon Kim, Manh Chien Vu, Seokhwan Kim

Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 2763-2770, Miyazaki, May 2018.

2017

[C40]

Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text (Team DL2.0, ranked at 22/650).

[PDF]

Zhe Wang, Kingsley Kuan, Mathieu Ravaut, Gaurav Manek, Sibo Song, Yuan Fang, Seokhwan Kim, Nancy F. Chen, Luis Fernando D'Haro, Anh Tuan Luu, Hongyuan Zhu, Zeng Zeng, Ngai Man Cheung, Georgios Piliouras, Jie Lin, Vijay Chandrasekhar

Proceedings of the CVPR 2017 Workshop on YouTube-8M Large-Scale Video Understanding (YouTube-8M), Honolulu, Jul 2017.

[J5]

Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems.

[PDF]

Seonghan Ryu, Seokhwan Kim, Junhwi Choi, Hwanjo Yu, Gary Geunbae Lee

Pattern Recognition Letters, 88 (26-32), Mar 2017.

2016

[C39]

The Fifth Dialog State Tracking Challenge.

[PDF]

[poster]

Seokhwan Kim, Luis Fernando D'Haro, Rafael E. Banchs, Jason D. Williams, Matthew Henderson, Koichiro Yoshino

Proceedings of the 2016 IEEE Spoken Language Technology Workshop (SLT 2016), pp. 511-517, San Diego, Dec 2016.

[C38]

Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling for Dialogue Topic Tracking.

[PDF]

[bib]

[poster]

Seokhwan Kim, Rafael E. Banchs, Haizhou Li

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), pp. 963-973, Berlin, Aug 2016.

(28.0% acceptance)

[C37]

The Fourth Dialog State Tracking Challenge.

[PDF]

[slides]

Seokhwan Kim, Luis Fernando D'Haro, Rafael E. Banchs, Jason D. Williams, Matthew Henderson

Proceedings of the 7th International Workshop on Spoken Dialogue Systems (IWSDS 2016), Saariselkä, Jan 2016.

2015

[C36]

A Robust Spoken Q&A System with Scarce In-Domain Resources.

Luis Fernando D'Haro, Seokhwan Kim, Rafael E. Banchs

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2015 (APSIPA ASC 2015), Hong Kong, Dec 2015.

[C35]

Configuration of Dialogue Agent with Multiple Knowledge Sources.

Ridong Jiang, Rafael Banchs, Seokhwan Kim, Luis Fernando D'Haro, Andreea Niculescu, Kheng Hui Yeo

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2015 (APSIPA ASC 2015), Hong Kong, Dec 2015.

[C34]

Towards Improving the Performance of Vector Space Model for Chinese Frequently Asked Question.

Ridong Jiang, Seokhwan Kim, Rafael E. Banchs, Haizhou Li

Proceedings of the 19th International Conference on Asian Language Processing (IALP 2015), Suzhou, Oct 2015.

[C33]

Conversational Agent and Management Tools for Conference and Tourism Domain.

[PDF]

Luis Fernando D'Haro, Seokhwan Kim, Rafael E. Banchs

Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech 2015), Dresden, Sep 2015. (Show and Tell)

[C32]

Towards Improving Dialogue Topic Tracking Performances with Wikification of Concept Mentions.

[PDF]

[bib]

[poster]

Seokhwan Kim, Rafael E. Banchs, Haizhou Li

Proceedings of the 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), pp. 124-128, Prague, Sep 2015.

[C31]

Wikification of Concept Mentions within Spoken Dialogues Using Domain Constraints from Wikipedia.

[PDF]

[bib]

[poster]

Seokhwan Kim, Rafael E. Banchs, Haizhou Li

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP 2015), pp. 2225-2229, Lisbon, Sep 2015.

(24.0% acceptance)

[C30]

CLARA: a multifunctional virtual agent for conference support and touristic information.

[PDF]

Luis Fernando D'Haro, Seokhwan Kim, Kheng Hui Yeo, Ridong Jiang, Andreea I. Niculescu, Rafael E. Banchs, Haizhou Li

Proceedings of the 6th International Workshop on Spoken Dialog System (IWSDS 2015), Busan, Jan 2015. (Demo)

2014

[C29]

Design and Evaluation of a Conversational Agent for the Touristic Domain.

Andreea Niculescu, Kheng Hui Yeo, Luis Fernando D'haro, Seokhwan Kim, Ridong Jiang, Rafael Banchs

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2014 (APSIPA ASC 2014), Siem Reap, Dec 2014.

[C28]

An empirical evaluation of an IR-based strategy for chat-oriented dialogue systems.

Rafael E. Banchs, Seokhwan Kim

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2014 (APSIPA ASC 2014), Siem Reap, Dec 2014.

[C27]

R-cube: a dialogue agent for Restaurant Recommendation and Reservation.

Seokhwan Kim, Rafael E. Banchs

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2014 (APSIPA ASC 2014), Siem Reap, Dec 2014.

[C26]

Grammatical Error Correction Based on Learner Comprehension Model in Oral Conversation.

[link]

Kyusong Lee, Seonghan Ryu, Paul Hongsuck Seo, Seokhwan Kim, Gary Geunbae Lee

Proceedings of the 2014 IEEE Spoken Language Technology Workshop (SLT 2014), South Lake Tahoe, Dec 2014.

[C25]

SARA - Singapore's Automated Responsive Assistant for the Touristic Domain.

[PDF]

Andreea Niculescu, Seokhwan Kim, Kheng Hui Yeo, Arthur Niswar, Ridong Jiang, Rafael E. Banchs

Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), pp. 2138-2139, Singapore, Sep 2014. (Show and Tell)

[C24]

Spoken Dialogue System for Restaurant Recommendation and Reservation.

[PDF]

Rafael E. Banchs, Seokhwan Kim

Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), pp. 1488-1489, Singapore, Sep 2014. (Show and Tell)

[C23]

SARA - Singapore's Automated Responsive Assistant - A Multimodal Dialogue System for Touristic Information.

Andreea Niculescu, Rafael E. Banchs, Seokhwan Kim, Kheng Hui Yeo, Arthur Niswar, Ridong Jiang

Proceedings of the 11th International Conference on Mobile Web Information Systems (MobiWIS 2014), pp. 153-164, Barcelona, Aug 2014.

[C22]

Sequential Labeling for Tracking Dynamic Dialog States.

[PDF]

[bib]

[poster]

Seokhwan Kim, Rafael E. Banchs

Proceedings of the 15th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2014), pp. 332-336, Philadelphia, Jun 2014.

[C21]

A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain Knowledge from Wikipedia.

[PDF]

[bib]

[poster]

Seokhwan Kim, Rafael E. Banchs, Haizhou Li

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), vol. 2, pp. 19-23, Baltimore, Jun 2014.

(25.0% acceptance)

[C20]

Wikipedia-based Kernels for Dialogue Topic Tracking.

[link]

[slides]

Seokhwan Kim, Rafael E. Banchs, Haizhou Li

Proceedings of the 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), pp. 131-135, Florence, May 2014.

[C19]

AIDA: An Avatar-Supported Multimodal Dialogue System.

Arthur Niswar, Rafael E. Banchs, Ridong Jiang, Seokhwan Kim

Proceedings of the 27th International Conference on Computer Animation and Social Agents (CASA 2014), Houston, May 2014.

[J4]

Cross-Lingual Annotation Projection for Weakly-Supervised Relation Extraction.

[link]

Seokhwan Kim, Minwoo Jeong, Jonghoon Lee, Gary Geunbae Lee

ACM Transactions on Asian Language Information Processing, 13:1 (3), Feb 2014.

[C18]

Web-based Multimodal Multi-domain Spoken Dialogue System.

Ridong Jiang, Rafael E. Banchs, Seokhwan Kim, Kheng Hui Yeo, Arthur Niswar, Haizhou Li

Proceedings of the 5th International Workshop on Spoken Dialog Systems (IWSDS 2014), Napa, Jan 2014.

2013

[C17]

AIDA: Artificial Intelligent Dialogue Agent.

[PDF]

[bib]

Rafael E. Banchs, Ridong Jiang, Seokhwan Kim, Arthur Niswar, Kheng Hui Yeo

Proceedings of the 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2013), pp. 145-147, Metz, Aug 2013. (Demo)

[C16]

A Graph-based Cross-lingual Projection Approach for Spoken Language Understanding Portability to a New Language.

[link]

[poster]

Seokhwan Kim

Proceedings of the 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), pp. 8332-8336, Vancouver, May 2013.

2012

[C15]

A Two-Step Approach for Efficient Domain Selection in Multi-Domain Dialog Systems.

[slides]

Injae Lee, Seokhwan Kim, Kyungduk Kim, Donghyeon Lee, Junhwi Choi, Seonghan Ryu, Geunbae Lee

Proceedings of the 4th International Workshop on Spoken Dialog System (IWSDS 2012), pp. 105-111, Ermenonville, Nov 2012.

[C14]

A Meta Learning Approach to Grammatical Error Correction.

[PDF]

[bib]

Hongsuck Seo, Jonghoon Lee, Seokhwan Kim, Kyusong Lee, Sechun Kang and Gary Geunbae Lee

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012), vol. 2, pp. 328-332, Jeju, Jul 2012.

(20.0% acceptance)

[C13]

A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction.

[PDF]

[bib]

[slides]

Seokhwan Kim, Gary Geunbae Lee

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012), vol. 2, pp. 48-53, Jeju, Jul 2012.

(20.0% acceptance)

[C12]

Seamless error correction interface for voice word processor.

[link]

Junhwi Choi, Kyungduk Kim, Sungjin Lee, Seokhwan Kim, Donghyeon Lee, Injae Lee, Gary Geunbae Lee

Proceedings of the 37th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), pp. 4973-4976, Kyoto, Mar 2012.

2011

[C11]

Multi-domain spoken dialog system for information access in mobile environment.

Junhwi Choi, Kyungduk Kim, Seokhwan Kim, Donghyeon Lee, Injae Lee, Gary Geunbae Lee

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (IEEE ASRU 2011), Hawaii, Dec 2011. (Demo)

[C10]

A Cross-lingual Annotation Projection-based Self-supervision Approach for Open Information Extraction.

[PDF]

[bib]

Seokhwan Kim, Minwoo Jeong, Jonghoon Lee, Gary Geunbae Lee

Proceedings of the 5th international joint conference on natural language processing (IJCNLP 2011), pp. 741-748, Chiang Mai, Nov 2011.

(27.0% acceptance)

[C9]

Web-search enhanced contents retrieval for information access dialog system.

[PDF]

Donghyeon Lee, Cheongjae Lee, Minwoo Jeong, Kyungduk Kim, Seokhwan Kim, Junhwi Choi, Gary Geunbae Lee

Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), pp. 1297-1300, Florence, Aug 2011.

[J3]

A Local Tree Alignment Approach to Relation Extraction of Multiple Arguments.

[link]

Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee

Information Processing and Management, 47:4 (593-605), Jul 2011.

2010

[C8]

A Cross-lingual Annotation Projection Approach for Relation Detection.

[PDF]

[bib]

[slides]

Seokhwan Kim, Minwoo Jeong, Jonghoon Lee, Gary Geunbae Lee

Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), pp. 564-571, Beijing, Aug 2010.

(19.0% acceptance)

2009

[C7]

A Local Tree Alignment-based Soft Pattern Matching Approach for Information Extraction.

[PDF]

[bib]

[poster]

Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee

Proceedings of the North American Chapter of the Association for Computational Linguistics/Human Language Technology (NAACL HLT 2009), pp. 169-172, Colorado, May 2009.

[J2]

Example-based dialog modeling for practical multi-domain dialog system.

[link]

Cheongjae Lee, Sangkeun Jung, Seokhwan Kim, Gary Geunbae Lee

Speech Communications, 51:5 (466-484), May 2009.

2008

[J1]

DialogStudio: A workbench for data-driven spoken dialog system development and management.

[link]

Sangkeun Jung, Cheongjae Lee, Seokhwan Kim, Gary Geunbae Lee

Speech Communications, 50:8-9 (683-697), , Aug-Sep 2008.

[C6]

An Alignment-based Pattern Representation Model for Information Extraction.

[link]

[poster]

Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee

Proceedings of the 31st Annual International ACM SIGIR Conference (SIGIR '08), pp. 875-876, Singapore, Jul 2008.

[C5]

An alignment-based approach to semi-supervised relation extraction including multiple arguments.

[poster]

Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee. Kwangil Ko, Zino Lee

Proceedings of the fourth Asian Information Retrieval Symposium (AIRS 2008), Harbin, Jan 2008.

2007

[C4]

Example-based spoken dialog processing for guidance robots.

Cheongjae Lee, Seokhwan Kim, Minwoo Jeong, Sangkeun Jung, Donghyeon Lee, Gary Geunbae Lee

Proceedings of the 4th international conference on ubiquitous robots and ambient intelligence (URAI 2007), Pohang, Nov 2007.

[C3]

A spoken dialog system for electronic program guide information access.

[link]

[poster]

Seokhwan Kim, Cheongjae Lee, Sangkeun Jung, Gary Geunbae Lee

Proceedings of The 16th IEEE International Symposium on Robot and Human Interactive Communication (IEEE RO-MAN 2007), pp. 178-181, Jeju, Aug 2007.

[C2]

A semi-supervised method for efficient construction of statistical spoken language understanding resources.

[link]

[bib]

[poster]

Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee

Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), pp. 2797-2800, Antwerp, Aug 2007.

2006

[C1]

MMR-based active machine learning for Bio named entity recognition.

[PDF]

[bib]

[poster]

Seokhwan Kim, Yu Song, Kyungduk Kim, Jeong-Won Cha, Gary Geunbae Lee

Proceedings of the Human Language Technology/North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2006), pp. 69-72, New York, Jun 2006.

Patents

[P20]

Utilizing deep recurrent neural networks with layer-wise attention for punctuation restoration.

Seokhwan Kim

Registration #US11521071B2, USA, 06 Dec 2022.

[P19]

Classifying terms from source texts using implicit and explicit class-recognition-machine-learning models.

Sean MacAvaney, Franck Dernoncourt, Walter Chang, Seokhwan Kim, Doo Soon Kim, Chen Fang

Registration #US11630952B2, USA, 18 Apr 2023.

[P18]

Answering questions during video playback.

Seokhwan Kim

Registration #US11544590B2, USA, 03 Jan 2023.

[P17]

Generating a response to a user query utilizing visual features of a video segment and a query-response-neural network.

Wentian Zhao, Seokhwan Kim, Ning Xu, Hailin Jin

Registration #US11244167B2, USA, 08 Feb 2022.

[P16]

Automatic text segmentation based on relevant context.

Chan Young Park, Seokhwan Kim, Franck Dernoncourt, Nedim Lipka, Walter W. Chang

Registration #US11210470B2, USA, 28 Dec 2021.

[P15]

Generating ground truth annotations corresponding to digital image editing dialogues for training state tracking models.

Trung Bui, Zahra Rahimi, Yinglan Ma, Seokhwan Kim, Franck Dernoncourt

Registration #US11100917B2, USA, 24 Aug 2021.

[P14]

Utilizing a dynamic memory network to track digital dialog states and generate responses.

Seokhwan Kim, Walter Chang

Registration #US10909970B2, USA, 02 Feb 2021.

[P13]

Emphasizing key points in a speech file and structuring an associated transcription.

Franck Dernoncourt, Walter Wei-Tuh Chang, Seokhwan Kim, Sean Fitzgerald, Ragunandan Rao Malangully, Laurie Marie Byrum, Frederic Thevenet, Carl Iwan Dockhorn

Registration #US10783314B2, USA, 22 Sep 2020.

[P12]

Entity Name Tagging Method Capable of Applying an Unsupervised Learning Method Including a Restriction Condition and a Device Thereof.

Gary Geunbae Lee, Seokhwan Kim, Kyungduk Kim, Donghyeon Lee, Junhwi Choi

Registration #10-1255957, South Korea, 11 Apr 2013.

[P11]

Conversation Classification Method Capable of Classifying Conversation Intention by Using a Hidden Markov Model.

Donghyeon Lee, Kyungduk Kim, Junhwi Choi, Seokhwan Kim, Gary Geunbae Lee

Registration #10-1255468, South Korea, 10 Apr 2013.

[P10]

Information Search Method by Using the Web Capable of Using a Vector Space Database and a Voice Converstaion Method Using the Method.

Donghyeon Lee, Kyungduk Kim, Junhwi Choi, Seokhwan Kim, Gary Geunbae Lee

Registration #10-1252397, South Korea, 02 Apr 2013.

[P9]

Speech Processing Device Capable of Effective Input Modification and a Method Thereof.

Gary Geunbae Lee, Junhwi Choi, Seokhwan Kim, Kyungduk Kim, Donghyeon Lee

Registration #10-1197010, South Korea, 26 Oct 2012.

[P8]

Korean Open Type Information Extracting Method and Program Readable Recording Medium Capable of Increasing Korean Open Type Information Extracting Performances.

Gary Geunbae Lee, Seokhwan Kim, Kyungduk Kim, Donghyeon Lee, Junhwi Choi

Registration #10-1180589, South Korea, 31 Aug 2012.

[P7]

Method for searching for information using the web and method for voice conversation using same.

Gary Geunbae Lee, Seokhwan Kim, Kyungduk Kim, Donghyeon Lee, Junhwi Choi

Registration #PCT/KR2012/004405, 28 Aug 2012.

[P6]

User profile automatic creation apparatus through voice dialog meaning process, and contents recommendation apparatus using the same.

Hyungjong Noh, Seokhwan Kim, Donghyeon Lee, Gary Geunbae Lee

Registration #10-1072176, South Korea, 04 Oct 2011.

[P5]

Method and apparatus for searching voice data from audio and video data under the circumstances including unregistered words.

Donghyeon Lee, Hyungjong Noh, Seokhwan Kim, Gary Geunbae Lee

Registration #10-1069534, South Korea, 26 Sep 2011.

[P4]

Method of automatically detecting dangerous situation using natural language extraction and apparatus thereof.

Seokhwan Kim, Donghyeon Lee, Hyungjong Noh, Gary Geunbae Lee

Registration #10-1023031, South Korea, 10 Mar 2011.

[P3]

Method and apparatus for automatically constructing ontology from non-structure web documents.

Seokhwan Kim, Hyungjong Noh, Gary Geunbae Lee

Registration #10-0917176, South Korea, 07 Sep 2009.

[P2]

Method and system for recognizing biological named entity based on workbench.

Seokhwan Kim, Yu Song, Kyungduk Kim, Gary Geunbae Lee

Registration #10-0825687, South Korea, 22 Apr 2008.

[P1]

Dialog management apparatus and method for chatting agent.

Cheongjae Lee, Seokhwan Kim, Gary Geunbae Lee

Registration #10-0818979, South Korea, 27 Mar 2008.

Seokhwan KimStaff Software Engineer, Google Cloud AI

Contact

Research Interests

Work Experiences

Education

Professional Experiences

Tutorials and Invited Talks

Publications

Patents

Seokhwan Kim
Staff Software Engineer, Google Cloud AI