photo

Seokhwan Kim Ph.D.
Research Scientist, Adobe Research

Contact

345 Park Avenue, E7-421, San Jose, CA 95110, USA
+1 (408) 536-6852

Research Interests

Natural Language Processing
Spoken Dialogue Systems
Natural Language Understanding
Text Mining

Education

Ph.D.
Sep 2005 - Feb 2012
Computer Science and Engineering
Dissertation: Cross-Lingual Weakly-Supervised Learning of Semantic Relations
Advisor: Gary Geunbae Lee
B.S.
Mar 2001 - Aug 2005

Research Experiences

Research Scientist
Jul 2017 -
Creative Intelligence Lab.
Research Scientist
Jan 2012 - Jul 2017
Dialogue Technology Lab.,
Human Language Technology Department
Research Intern
Apr 2011 - Aug 2011
Databases and Information Systems
Advisor: Prof. Gerhard Weikum

Teaching Experiences

Teaching Assistant
Spring 2010
CS101, Introduction to Computing
Professor: Gary Geunbae Lee
Teaching Assistant
Spring 2006
CS421, Database System
Professor: Byoung-kee Yi

Professional Experiences

Reviewer
ACL, ACM-SIGIR, CSL, EMNLP, IEEE ASRU, IEEE ICASSP, IEEE SLT, IEEE/ACM TASLP, IJCAI, IJCNLP, IWSDS, Interspeech, NAACL-HLT

Tutorials and Invited Talks

2016
Natural Language in Human-Robot Interaction.
Rafael E. Banchs, Seokhwan Kim, Luis Fernando D'Haro, Andreea I. Niculescu
Tutorial @ The 4th International Conference on Human-Agent Interaction (HAI 2016), 04 Oct 2016.

Publications

2019
[J6]
Overview of the Sixth Dialog System Technology Challenge: DSTC6.
Chiori Hori, Julien Perez, Ryuichi Higasinaka, Takaaki Hori, Y-Lan Boureau, Michimasa Inaba, Yuiko Tsunomori, Tetsuro Takahashi, Koichiro Yoshino, Seokhwan Kim
Computer Speech & Language, 55 (1-25), May 2019.
[C43]
Dynamic Memory Networks for Dialogue Topic Tracking.
Seokhwan Kim
To appear in Proceedings of the AAAI-19 Workshop on Dialog System Technology Challenges (DSTC7), Honolulu, Jan 2019.
2018
[C42]
A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents.
Arman Cohan, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Seokhwan Kim, Walter Chang, Nazli Goharian
Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2018), pp. 615-621, New Orleans, Jun 2018.
[C41]
PhotoshopQuiA: A Corpus of Non-Factoid Questions and Answers for Why-Question Answering.
Andrei Dulceanu, Thang Le Dinh, Walter Chang, Trung Bui, Doo Soon Kim, Manh Chien Vu, Seokhwan Kim
Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 2763-2770, Miyazaki, May 2018.
2017
[C40]
Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text (Team DL2.0, ranked at 22/650).
Zhe Wang, Kingsley Kuan, Mathieu Ravaut, Gaurav Manek, Sibo Song, Yuan Fang, Seokhwan Kim, Nancy F. Chen, Luis Fernando D'Haro, Anh Tuan Luu, Hongyuan Zhu, Zeng Zeng, Ngai Man Cheung, Georgios Piliouras, Jie Lin, Vijay Chandrasekhar
Proceedings of the CVPR 2017 Workshop on YouTube-8M Large-Scale Video Understanding (YouTube-8M), Honolulu, Jul 2017.
[J5]
Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems.
Seonghan Ryu, Seokhwan Kim, Junhwi Choi, Hwanjo Yu, Gary Geunbae Lee
Pattern Recognition Letters, 88 (26-32), Mar 2017.
2016
[C39]
The Fifth Dialog State Tracking Challenge.
Seokhwan Kim, Luis Fernando D'Haro, Rafael E. Banchs, Jason D. Williams, Matthew Henderson, Koichiro Yoshino
Proceedings of the 2016 IEEE Spoken Language Technology Workshop (SLT 2016), pp. 511-517, San Diego, Dec 2016.
[C38]
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling for Dialogue Topic Tracking.
Seokhwan Kim, Rafael E. Banchs, Haizhou Li
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), pp. 963-973, Berlin, Aug 2016.
(28.0% acceptance)
[C37]
The Fourth Dialog State Tracking Challenge.
Seokhwan Kim, Luis Fernando D'Haro, Rafael E. Banchs, Jason D. Williams, Matthew Henderson
Proceedings of the 7th International Workshop on Spoken Dialogue Systems (IWSDS 2016), Saariselkä, Jan 2016.
2015
[C36]
A Robust Spoken Q&A System with Scarce In-Domain Resources.
Luis Fernando D'Haro, Seokhwan Kim, Rafael E. Banchs
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2015 (APSIPA ASC 2015), Hong Kong, Dec 2015.
[C35]
Configuration of Dialogue Agent with Multiple Knowledge Sources.
Ridong Jiang, Rafael Banchs, Seokhwan Kim, Luis Fernando D'Haro, Andreea Niculescu, Kheng Hui Yeo
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2015 (APSIPA ASC 2015), Hong Kong, Dec 2015.
[C34]
Towards Improving the Performance of Vector Space Model for Chinese Frequently Asked Question.
Ridong Jiang, Seokhwan Kim, Rafael E. Banchs, Haizhou Li
Proceedings of the 19th International Conference on Asian Language Processing (IALP 2015), Suzhou, Oct 2015.
[C33]
Conversational Agent and Management Tools for Conference and Tourism Domain.
Luis Fernando D'Haro, Seokhwan Kim, Rafael E. Banchs
Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech 2015), Dresden, Sep 2015. (Show and Tell)
[C32]
Towards Improving Dialogue Topic Tracking Performances with Wikification of Concept Mentions.
Seokhwan Kim, Rafael E. Banchs, Haizhou Li
Proceedings of the 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), pp. 124-128, Prague, Sep 2015.
[C31]
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constraints from Wikipedia.
Seokhwan Kim, Rafael E. Banchs, Haizhou Li
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP 2015), pp. 2225-2229, Lisbon, Sep 2015.
(24.0% acceptance)
[C30]
CLARA: a multifunctional virtual agent for conference support and touristic information.
Luis Fernando D'Haro, Seokhwan Kim, Kheng Hui Yeo, Ridong Jiang, Andreea I. Niculescu, Rafael E. Banchs, Haizhou Li
Proceedings of the 6th International Workshop on Spoken Dialog System (IWSDS 2015), Busan, Jan 2015. (Demo)
2014
[C29]
Design and Evaluation of a Conversational Agent for the Touristic Domain.
Andreea Niculescu, Kheng Hui Yeo, Luis Fernando D'haro, Seokhwan Kim, Ridong Jiang, Rafael Banchs
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2014 (APSIPA ASC 2014), Siem Reap, Dec 2014.
[C28]
An empirical evaluation of an IR-based strategy for chat-oriented dialogue systems.
Rafael E. Banchs, Seokhwan Kim
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2014 (APSIPA ASC 2014), Siem Reap, Dec 2014.
[C27]
R-cube: a dialogue agent for Restaurant Recommendation and Reservation.
Seokhwan Kim, Rafael E. Banchs
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2014 (APSIPA ASC 2014), Siem Reap, Dec 2014.
[C26]
Grammatical Error Correction Based on Learner Comprehension Model in Oral Conversation.
Kyusong Lee, Seonghan Ryu, Paul Hongsuck Seo, Seokhwan Kim, Gary Geunbae Lee
Proceedings of the 2014 IEEE Spoken Language Technology Workshop (SLT 2014), South Lake Tahoe, Dec 2014.
[C25]
SARA - Singapore's Automated Responsive Assistant for the Touristic Domain.
Andreea Niculescu, Seokhwan Kim, Kheng Hui Yeo, Arthur Niswar, Ridong Jiang, Rafael E. Banchs
Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), pp. 2138-2139, Singapore, Sep 2014. (Show and Tell)
[C24]
Spoken Dialogue System for Restaurant Recommendation and Reservation.
Rafael E. Banchs, Seokhwan Kim
Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), pp. 1488-1489, Singapore, Sep 2014. (Show and Tell)
[C23]
SARA - Singapore's Automated Responsive Assistant - A Multimodal Dialogue System for Touristic Information.
Andreea Niculescu, Rafael E. Banchs, Seokhwan Kim, Kheng Hui Yeo, Arthur Niswar, Ridong Jiang
Proceedings of the 11th International Conference on Mobile Web Information Systems (MobiWIS 2014), pp. 153-164, Barcelona, Aug 2014.
[C22]
Sequential Labeling for Tracking Dynamic Dialog States.
Seokhwan Kim, Rafael E. Banchs
Proceedings of the 15th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2014), pp. 332-336, Philadelphia, Jun 2014.
[C21]
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain Knowledge from Wikipedia.
Seokhwan Kim, Rafael E. Banchs, Haizhou Li
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), vol. 2, pp. 19-23, Baltimore, Jun 2014.
(25.0% acceptance)
[C20]
Wikipedia-based Kernels for Dialogue Topic Tracking.
Seokhwan Kim, Rafael E. Banchs, Haizhou Li
Proceedings of the 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), pp. 131-135, Florence, May 2014.
[C19]
AIDA: An Avatar-Supported Multimodal Dialogue System.
Arthur Niswar, Rafael E. Banchs, Ridong Jiang, Seokhwan Kim
Proceedings of the 27th International Conference on Computer Animation and Social Agents (CASA 2014), Houston, May 2014.
[J4]
Cross-Lingual Annotation Projection for Weakly-Supervised Relation Extraction.
Seokhwan Kim, Minwoo Jeong, Jonghoon Lee, Gary Geunbae Lee
ACM Transactions on Asian Language Information Processing, 13:1 (3), Feb 2014.
[C18]
Web-based Multimodal Multi-domain Spoken Dialogue System.
Ridong Jiang, Rafael E. Banchs, Seokhwan Kim, Kheng Hui Yeo, Arthur Niswar, Haizhou Li
Proceedings of the 5th International Workshop on Spoken Dialog Systems (IWSDS 2014), Napa, Jan 2014.
2013
[C17]
AIDA: Artificial Intelligent Dialogue Agent.
Rafael E. Banchs, Ridong Jiang, Seokhwan Kim, Arthur Niswar, Kheng Hui Yeo
Proceedings of the 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2013), pp. 145-147, Metz, Aug 2013. (Demo)
[C16]
A Graph-based Cross-lingual Projection Approach for Spoken Language Understanding Portability to a New Language.
Seokhwan Kim
Proceedings of the 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), pp. 8332-8336, Vancouver, May 2013.
2012
[C15]
A Two-Step Approach for Efficient Domain Selection in Multi-Domain Dialog Systems.
Injae Lee, Seokhwan Kim, Kyungduk Kim, Donghyeon Lee, Junhwi Choi, Seonghan Ryu, Geunbae Lee
Proceedings of the 4th International Workshop on Spoken Dialog System (IWSDS 2012), pp. 105-111, Ermenonville, Nov 2012.
[C14]
A Meta Learning Approach to Grammatical Error Correction.
Hongsuck Seo, Jonghoon Lee, Seokhwan Kim, Kyusong Lee, Sechun Kang and Gary Geunbae Lee
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012), vol. 2, pp. 328-332, Jeju, Jul 2012.
(20.0% acceptance)
[C13]
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction.
Seokhwan Kim, Gary Geunbae Lee
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012), vol. 2, pp. 48-53, Jeju, Jul 2012.
(20.0% acceptance)
[C12]
Seamless error correction interface for voice word processor.
Junhwi Choi, Kyungduk Kim, Sungjin Lee, Seokhwan Kim, Donghyeon Lee, Injae Lee, Gary Geunbae Lee
Proceedings of the 37th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), pp. 4973-4976, Kyoto, Mar 2012.
2011
[C11]
Multi-domain spoken dialog system for information access in mobile environment.
Junhwi Choi, Kyungduk Kim, Seokhwan Kim, Donghyeon Lee, Injae Lee, Gary Geunbae Lee
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (IEEE ASRU 2011), Hawaii, Dec 2011. (Demo)
[C10]
A Cross-lingual Annotation Projection-based Self-supervision Approach for Open Information Extraction.
Seokhwan Kim, Minwoo Jeong, Jonghoon Lee, Gary Geunbae Lee
Proceedings of the 5th international joint conference on natural language processing (IJCNLP 2011), pp. 741-748, Chiang Mai, Nov 2011.
(27.0% acceptance)
[C9]
Web-search enhanced contents retrieval for information access dialog system.
Donghyeon Lee, Cheongjae Lee, Minwoo Jeong, Kyungduk Kim, Seokhwan Kim, Junhwi Choi, Gary Geunbae Lee
Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), pp. 1297-1300, Florence, Aug 2011.
[J3]
A Local Tree Alignment Approach to Relation Extraction of Multiple Arguments.
Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee
Information Processing and Management, 47:4 (593-605), Jul 2011.
2010
[C8]
A Cross-lingual Annotation Projection Approach for Relation Detection.
Seokhwan Kim, Minwoo Jeong, Jonghoon Lee, Gary Geunbae Lee
Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), pp. 564-571, Beijing, Aug 2010.
(19.0% acceptance)
2009
[C7]
A Local Tree Alignment-based Soft Pattern Matching Approach for Information Extraction.
Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee
Proceedings of the North American Chapter of the Association for Computational Linguistics/Human Language Technology (NAACL HLT 2009), pp. 169-172, Colorado, May 2009.
[J2]
Example-based dialog modeling for practical multi-domain dialog system.
Cheongjae Lee, Sangkeun Jung, Seokhwan Kim, Gary Geunbae Lee
Speech Communications, 51:5 (466-484), May 2009.
2008
[J1]
DialogStudio: A workbench for data-driven spoken dialog system development and management.
Sangkeun Jung, Cheongjae Lee, Seokhwan Kim, Gary Geunbae Lee
Speech Communications, 50:8-9 (683-697), , Aug-Sep 2008.
[C6]
An Alignment-based Pattern Representation Model for Information Extraction.
Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee
Proceedings of the 31st Annual International ACM SIGIR Conference (SIGIR '08), pp. 875-876, Singapore, Jul 2008.
[C5]
An alignment-based approach to semi-supervised relation extraction including multiple arguments.
Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee. Kwangil Ko, Zino Lee
Proceedings of the fourth Asian Information Retrieval Symposium (AIRS 2008), Harbin, Jan 2008.
2007
[C4]
Example-based spoken dialog processing for guidance robots.
Cheongjae Lee, Seokhwan Kim, Minwoo Jeong, Sangkeun Jung, Donghyeon Lee, Gary Geunbae Lee
Proceedings of the 4th international conference on ubiquitous robots and ambient intelligence (URAI 2007), Pohang, Nov 2007.
[C3]
A spoken dialog system for electronic program guide information access.
Seokhwan Kim, Cheongjae Lee, Sangkeun Jung, Gary Geunbae Lee
Proceedings of The 16th IEEE International Symposium on Robot and Human Interactive Communication (IEEE RO-MAN 2007), pp. 178-181, Jeju, Aug 2007.
[C2]
A semi-supervised method for efficient construction of statistical spoken language understanding resources.
Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee
Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), pp. 2797-2800, Antwerp, Aug 2007.
2006
[C1]
MMR-based active machine learning for Bio named entity recognition.
Seokhwan Kim, Yu Song, Kyungduk Kim, Jeong-Won Cha, Gary Geunbae Lee
Proceedings of the Human Language Technology/North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2006), pp. 69-72, New York, Jun 2006.

Patents

[P12]
Entity Name Tagging Method Capable of Applying an Unsupervised Learning Method Including a Restriction Condition and a Device Thereof.
Gary Geunbae Lee, Seokhwan Kim, Kyungduk Kim, Donghyeon Lee, Junhwi Choi
Registration #10-1255957, South Korea, 11 Apr 2013.
[P11]
Conversation Classification Method Capable of Classifying Conversation Intention by Using a Hidden Markov Model.
Donghyeon Lee, Kyungduk Kim, Junhwi Choi, Seokhwan Kim, Gary Geunbae Lee
Registration #10-1255468, South Korea, 10 Apr 2013.
[P10]
Information Search Method by Using the Web Capable of Using a Vector Space Database and a Voice Converstaion Method Using the Method.
Donghyeon Lee, Kyungduk Kim, Junhwi Choi, Seokhwan Kim, Gary Geunbae Lee
Registration #10-1252397, South Korea, 02 Apr 2013.
[P9]
Speech Processing Device Capable of Effective Input Modification and a Method Thereof.
Gary Geunbae Lee, Junhwi Choi, Seokhwan Kim, Kyungduk Kim, Donghyeon Lee
Registration #10-1197010, South Korea, 26 Oct 2012.
[P8]
Korean Open Type Information Extracting Method and Program Readable Recording Medium Capable of Increasing Korean Open Type Information Extracting Performances.
Gary Geunbae Lee, Seokhwan Kim, Kyungduk Kim, Donghyeon Lee, Junhwi Choi
Registration #10-1180589, South Korea, 31 Aug 2012.
[P7]
Method for searching for information using the web and method for voice conversation using same.
Gary Geunbae Lee, Seokhwan Kim, Kyungduk Kim, Donghyeon Lee, Junhwi Choi
Registration #PCT/KR2012/004405, 28 Aug 2012.
[P6]
User profile automatic creation apparatus through voice dialog meaning process, and contents recommendation apparatus using the same.
Hyungjong Noh, Seokhwan Kim, Donghyeon Lee, Gary Geunbae Lee
Registration #10-1072176, South Korea, 04 Oct 2011.
[P5]
Method and apparatus for searching voice data from audio and video data under the circumstances including unregistered words.
Donghyeon Lee, Hyungjong Noh, Seokhwan Kim, Gary Geunbae Lee
Registration #10-1069534, South Korea, 26 Sep 2011.
[P4]
Method of automatically detecting dangerous situation using natural language extraction and apparatus thereof.
Seokhwan Kim, Donghyeon Lee, Hyungjong Noh, Gary Geunbae Lee
Registration #10-1023031, South Korea, 10 Mar 2011.
[P3]
Method and apparatus for automatically constructing ontology from non-structure web documents.
Seokhwan Kim, Hyungjong Noh, Gary Geunbae Lee
Registration #10-0917176, South Korea, 07 Sep 2009.
[P2]
Method and system for recognizing biological named entity based on workbench.
Seokhwan Kim, Yu Song, Kyungduk Kim, Gary Geunbae Lee
Registration #10-0825687, South Korea, 22 Apr 2008.
[P1]
Dialog management apparatus and method for chatting agent.
Cheongjae Lee, Seokhwan Kim, Gary Geunbae Lee
Registration #10-0818979, South Korea, 27 Mar 2008.