研究テーマ
- 音声認識システムにおける音声処理と言語処理の統合化
- 自然な発話の音声認識・理解
- 大語彙連続音声認識システム
- 日本語連続音声認識システム SPOJUS-SYNO
- 音声言語インタフェースシステム
発表論文等のリスト
論文誌
- Jun-ichi Takami, Atsuhiko Kai, and Shigeki Sagayama:
``A pairwise discriminant approach using artificial neural
networks for continuous speech recognition'',
The Journal of the Acoustical Society of Japan (E),
Vol.13, No.6, pp.411--418, 1992.
- 中川 聖一, 甲斐 充彦:
``ワードスポッティング法を用いた文脈自由文法制御フレーム同期型
HMM連続音声認識法'',
電子情報通信学会論文誌, Vol.J76-D-II, No.7, pp.1329--1336, 1993.
- 中川 聖一, 甲斐 充彦:
``文脈自由文法制御による One Pass 型 HMM 連続音声認識法'',
電子情報通信学会論文誌, Vol.J76-D-II, No.7, pp.1337--1345, 1993.
- Seiichi Nakagawa and Atsuhiko Kai:
``A Context-Free Grammar-Driven, One-Pass HMM-Based Continuous
Speech Recognition Method'',
Systems and Computers in Japan, Vol.25, No.4, pp.92--102, 1994.
(上記論文の英訳版)
- Atsuhiko Kai and Seiichi Nakagawa:
``Relationship among Recognition Rate, Rejection Rate and False Alarm
Rate in a Spoken Word Recognition System'',
IEICE Trans. on Information and Systems, Vol.E78-D, No.6, pp.698--704,
1995.
- 甲斐 充彦, 中川 聖一:
``冗長語・言い直し等を含む発話のための未知語処理を用いた音声認識システムの比較評価'',
電子情報通信学会論文誌, Vol.J80-D-II, No.10, pp.2615--2625, 1997.
- Atsuhiko Kai and Seiichi Nakagawa:
``Comparison of Continuous Speech Recognition Systems with Unknown-Word Processing for Speech Disfluencies'',
Systems and Computers in Japan, Vol.29, No.9, pp.43--53, 1998.
(上記論文の英訳版)
- 中川 聖一, 鳥居美和子, 甲斐 充彦, 中西 宏文:
``任意語彙の追加登録可能な単語音声認識システム'',
電気学会論文誌C, Vol.118-C, No.6, pp.865--872, 1998.
- 甲斐 充彦, 廣瀬 良文, 中川 聖一:
``単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語の処理'',
情報処理学会論文誌, Vol.40, No.4, pp.1383--1394, 1999.
- 伊藤敏彦, 甲斐充彦, 岩本善行, 水谷 誠, 由浅裕規, 小西達裕, 伊東幸宏:
``目的地設定タスクにおける対話状況の違いによる言語・音響的特徴の比較'',
情報処理学会論文誌, Vol.43, No.7, pp.2118--2129, 2002.
- 川本,下平,新田,西本,中村,伊藤,森島,四倉,甲斐,李,山下,小林,徳田,広瀬,峯松,山田,伝,宇津呂,嵯峨山:
``カスタマイズ性を考慮した擬人化音声対話ソフトウェアツールキットの設計'',
情報処理学会論文誌, Vol.43, No.7, pp.2249--2263, 2002.
- 甲斐充彦, 盛 浩和, 中野 崇広, 中川聖一:
``フォーム型Web情報検索サービスのための音声ユーザインタフェースシステムと操作性の評価'',
情報処理学会論文誌, Vol.46, No.5, pp.1318--1329, 2005.
- 藤原敬記, 伊藤敏彦, 荒木健治, 甲斐充彦, 小西達裕, 伊東幸宏:
``認識信頼度と対話履歴を用いた音声言語理解手法'',
電子情報通信学会論文誌, Vol.J89-D, No.7, pp.1493--1503, 2006.
- N. Fujiwara, T. Itoh, K. Araki, A. Kai, T. Konishi and Y. Itoh:
``Spoken Language Understanding Method Using Confidence Measure and
Dialogue History'',
Systems and Computers in Japan, Vol.38, No.9, pp.21--31, 2007.
(上記論文の英訳版)
国際会議&シンポジウム
- Atsuhiko Kai and Seiichi Nakagawa:
``A Frame-Synchronous Continuous Speech Recognition Algorithm
Using a Top-Down Parsing of Context-Free Grammar'',
Proc. of International Conference on Spoken Language Processing (ICSLP 92),
Alberta, Canada, pp.257--260, 1992.
- Atsuhiko Kai and Seiichi Nakagawa:
``Evaluation of Unknown Word Processing in a Spoken Word Recognition System'',
Proc. of International Conference on Spoken Language Processing (ICSLP 94),
Yokohama, Japan, pp.2151--2154, 1994.
- Atsuhiko Kai and Seiichi Nakagawa:
``Investigation on Unknown Word Processing and Strategies for
Spontaneous Speech Understanding'', Proc. of EUROSPEECH'95, Madrid,
Spain, pp.2095--2098, 1995.
- Atsuhiko Kai and Seiichi Nakagawa:
``A Continuous Speech Recognition System Using Loosely Constrained
Linguistic Knowledge for Spontaneous Speech'', Proc. of the first
China-Japan Workshop on Spoken Language Processing
(CJSLP'97), Huang Shan, P.R. China, pp.240--245, 1997.
- Seiichi Nakagawa, Atsuhiko Kai, Toshihiko Itoh and Masaki Ida:
``An Isolated/Continuous Speech Recognition System on a Personal
Computer'', Proc. of the first China-Japan Workshop on Spoken Language
Processing (CJSLP'97), Huang Shan, P.R. China, pp.216--223, 1997.
- Atsuhiko Kai and Seiichi Nakagawa:
``An acoustic look-ahead method for efficient frame-synchronous search
in a large vocabulary speech recognition system'', Proc. of
International Conference on Speech Processing (ICSP'97), Seoul,
Korea, pp.513--518, 1997.
- Atsuhiko Kai, Yoshifumi Hirose and Seiichi Nakagawa:
``Dealing with out-of-vocabulary words and speech disfluencies in an
N-gram based speech understanding system'', Proc. of
International Conference on Spoken Language Processing (ICSLP 98),
Sydney, Australia, pp.2427--2430, 1998.
- Atsuhiko Kai, Takahiro Nakano and Seiichi Nakagawa:
``A speech interface system for information retrieval tasks on the WWW'', Proc. of International Workshop Speech and Computer (SPECOM'99),
Moscow, Russia, pp.141--144, 1999.
- Atsuhiko Kai, Takahiro Nakano and Seiichi Nakagawa:
``Usability of Browser-Based Pen-Touch/Speech User Interfaces for
Form-Based Applications in Mobile Environment'', Lecture Notes in Computer Science 1948: Advances in Multimodal Interfaces - ICMI2000, pp.549--556, 2000.
- Atsuhiko Kai and Seiichi Nakagawa:
``Analysis of prosodic features on key-phrases and corrections in spoken dialogue'', Proc. for 2001 2nd Plenary Meeting and Symposium on Prosody and Speech Processing (Organized by Scientific Research of Priority Areas(B), Ministry of Science, Culture, Sports, Education, Japan), pp.179--184, 2002.
- Atsuhiko Kai, Yukari Nonomura, Toshihiko Itoh, Tatsuhiro Konishi and Yukihiro Itoh:
``Influence of different dialogue situations on user's behavior in spoken corrections'',
Proc. of International Conference on Spoken Language Processing (ICSLP 2002),
Denver, Colorado USA, pp.1189--1192, 2002.
- Toshihiko Itoh, Atsuhiko Kai, Tatsuhiro Konishi and Yukihiro Itoh:
``Linguistic and acoustic changes of user's utterances caused by different dialogue situations'',
Proc. of International Conference on Spoken Language Processing (ICSLP 2002),
Denver, Colorado USA, pp.545--548, 2002.
- Shin-ichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito Utsuro, and Shigeki Sagayama:
``Open-source software for developing anthropomorphic spoken dialog agents'',
In Proceedings of the International Workshop on LIFELIKE ANIMATED AGENTS:
Tools, Affective Functions, and Applications, August 2002.
- Atsuhiko Kai and Toshihiko Itoh:
``Prosodic feature and its application for detecting spoken corrections'', Proc. for 2002 2nd Plenary Meeting and Symposium on Prosody and Speech Processing (Organized by Scientific Research of Priority Areas, Ministry of Education, Culture, Sports, Science and Technology, Japan), pp.115--120, 2003.
- Toshihiko Itoh, Atsuhiko Kai, Yukihiro Itoh and Tatsuhiro Konishi:
``An understanding strategy based on plausibility score in recognition history using CSR confidence measure'', Proc. of International Conference on Spoken Language Processing (INTERSPEECH 2004 - ICSLP), pp.2133--2136, 2004.
- Yonggee Jang, Atsuhiko Kai, Longbiao Wang:
``Speech Interface for Isolated Words Based on Combination of Search
Candidates from the Common Word Parts'',
Proc. of the 10th Western Pacific Acoustics Conference (WESPAC X 2009), pp.0261
(7 pages), 2009.
- Longbiao Wang, Yoshiki Kishi, Atsuhiko Kai:
``Distant Speaker Recognition Based on the Automatic Selection of Reverberant Environments Using GMMs'',
Proc. CJKPR2009, pp.954--958, 2009.
- Yonggee Jang, Atsuhiko Kai and Longbiao Wang:
``Multimodal Interface with N-best Display Including Candidates of Spoken Word Fragments'',
Proc. of the 2nd. APSIPA Annual Summit and Conference, pp.478--481, 2010.
- Junki Ema, Longbiao Wang, Atsuhiko Kai and Toshihiko Itoh:
``Investigation of Driving-Behavior Modeling for Recognition of a Driving Situation'',
Proc. of the 2nd. APSIPA Annual Summit and Conference, pp.161--164, 2010.
著書
- 松下温,屋代智之編著, ``ITSと情報通信技術'', 7.2節, pp.132-139, 裳華房, 2003.
- Shin-ichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta,
Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima,
Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita,
Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu,
Atsushi Yamada, Yasuharu Den, Takehito Utsuro, Shigeki Sagayama,
``Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents,''
Life-Like Characters. Tools, Affective Functions, and Applications.
Helmut Prendinger et al. (Eds.) Springer, pp.187-212, Nov 2003.
- S. Nakagawa, A. Kai and T. Itoh, ``The Spoken Dialogue System of TUT'' in S. Nakagawa, M. Okada and T. Kawahara (Eds.), Spoken Language Systems,
pp.129-142, Ohmsha/IOS Press, 2005.
博士学位論文
Last updated: 2011/1
甲斐のトップページへ戻る