S. Sakti

   
 
Dr.-Ing. S. Sakti

 
Scientific Book
  • 中村 哲, S. Sakti, G. Neubig, 戸田 智基, 高道 慎之介, "音声言語の自動翻訳- コンピュータによる自動翻訳を目指して", 音響サイエンスシリーズ 18, 日本音響学会 (編集), 2018 [in Japanese]
  • S. Sakti, K. Markov, S. Nakamura, W. Minker, "Incorporating Knowledge Sources into Statistical Speech Recognition", Springer, Boston (USA), Series: Lecture Notes in Electrical Engineering, Vol.42, 2009.
 
 
Chapter in Scientific Book
  • L. Nio, S. Sakti, G. Neubig, T. Toda, M. Adriani, S. Nakamura, "Developing Non-Goal Dialog System based on Examples of Drama Television", in Natural Interaction with Robots Knowbots and Smartphones", Chapter 32, pp. 307-314, Springer, New York, 2014.
  • H. Hofmann, S. Sakti, R. Isotani, H.Kawai, S. Nakamura, W. Minker, "Sequence-based Pronunciation Modeling Using a Noisy-Channel Approach", in Spoken Language Dialogue Systems for Ambient Environments, Series: Lecture Notes in Computer Science / Artificual Intelligent (LNCS/LNAI), pp. 156-162, Springer, Berlin, 2010.
 
 
Paper Journal
  • S. Nakayama, A. Tjandra, S. Sakti, S. Nakamura, "Code-Switching ASR and TTS using Semisupervised Learning with Machine Speech Chain", IEICE Transactions on Information and Systems, Vol.E104-D,No.10, July. 7-8, 2021
  • J. Effendi, A. Tjandra, S. Sakti, S. Nakamura, "Multimodal Chain: Cross-Modal Collaboration Through Listening, Speaking, and Visualizing", IEEE Access, 9./2021, 70286-70299, May. 6, 2021
  • J. Effendi, S. Sakti, S. Nakamura, "End-to-End Image-to-Speech Generation for Untranscribed Unknown Languages", IEEE Access, 55144-55154, Apr. 7, 2021
  • F. Yang and X. Chang and S. Sakti and Y. Wu and S. Nakamura, "ReMOT: A Model-agnostic Refinement for Multiple Object Tracking", Image and Vision Computing, Dec. 13, 2020
  • B. Wu, S. Sakti, J. Zhang and S. Nakamura, "Tackling Perception Bias in Unsupervised Phoneme Discovery Using DPGMM-RNN Hybrid Model and Functional Load", IEEE/ACM Transactions on Audio, Speech, and Language Processing, Dec. 2, 2020
  • T.-T Nguyen, K. Yoshino, S. Sakti, and S. Nakamura, "Policy reuse for dialog management using action-relation probability", IEEE Access 2020
  • F. Yang, Y. Wu, Z. Wang, X. Li, S. Sakti, S. Nakamura, "Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval", IEEE Transaction on Multimedia 2020
  • T. Kano, S. Sakti, S. Nakamura, "End-to-end Speech Translation with Transcoding by Multi-task Learning for Distant Language Pairs", IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol: 28, No. 1, pp. 1342-1355, Den. 2020
  • S. Shinagawa, K. Yoshino, Seyed Hossein Alavi, Kallirroi Georgila, David Traum, S. Sakti, S. Nakamura, "An Interactive Image Editing System using an Uncertainty-based Confirmation Strategy", IEEE Access, May 2020
  • A. Tjandra, S. Sakti, S. Nakamura",Machine Speech Chain", IEEE Transactions of Audio Speech and Language Processing, March 2020
  • H.Watanabe, H. Tanaka, S. Sakti, S. Nakamura",Synchronization between overt speech envelope and EEG oscillations during imagined speech", Neuroscience Research, vol. 153, pp. 48-55, April 2020
  • J. Effendi, K. Sudoh, S. Sakti, S. Nakamura, "Leveraging Neural Caption Translation with Visually Grounded Paraphrase Augmentation", IEICE, Vol.E103-D,No.03, Mar. 2020
  • T.-T Nguyen, K. Yoshino, S. Sakti, S. Nakamura, "Dialog Management of Healthcare Consulting System by Utilizing Deceptive Information", Journal of JSAI, Vol.35, No.1, 35_DSI-C, 2020
  • S. Shinagawa, K. Yoshino, S. Sakti, Yu Suzuki, S. Nakamura, "Image Manipulation System with Natural Language Instruction", IEICE Transactions on Information and Systems, Vol.J102-D, No.8, pp.514?529, August. 2019
  • A. Tjandra, S. Sakti, S. Nakamura, "Recurrent Neural Network Compression based on Low-Rank Tensor Representation", IEICE, October 2019
  • F. Yang, S. Sakti, W. Yang, S. Nakamura, "A Framework for Knowing Who is Doing What in Aerial Surveillance Videos", Journal IEEE Access, July 2019
  • A. Tjandra, S. Sakti, S. Nakamura, "End-to-End Speech Recognition Sequence Training with Reinforcement Learning", Journal IEEE Access, June 2019
  • N. Lubis, S. Sakti, K. Yoshino, S. Nakamura, "Positive Emotion Elicitation in Chat-Based Dialogue Systems", IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume: 27, Issue: 4, pp. 866-877, April 2019
  • H. Tanaka, H.Watanabe, H. Maki, S. Sakti, S. Nakamura, "Electroencephalogram-Based Single-Trial Detection of Language Expectation Violations in Listening to Speech", Frontiers in Computational Neuroscience, vol. 13, pp 1-15, March 2019
  • H.Watanabe, H. Tanaka, S. Sakti, S. Nakamura, "Neural Oscillation-Based Classification of Japanese Spoken Sentences During Speech Perception", IEICE TRANSACTIONS on Information and System, E102-D, 2, pp.383-391, 2019
  • M. Heck, S. Sakti, S. Nakamura, "Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling", IEEE/ACM Transactions on Audio, Speech, and Language Processing, Volume 26, No 11, pp. 2027-2042, November 2018
  • Q.-T. Do, S. Sakti, and S. Nakamura, "Sequence-to-Sequence Models for Emphasis Speech Translation", IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 26, No. 10, October 2018
  • N. Lubis, D. Lestari, S. Sakti, Ayu Purwarianti, and S. Nakamura, "Construction of Spontaneous Emotion Corpus from Indonesian TV Talk Shows and Its Application on Multimodal Emotion Recognition", IEICE TRANSACTIONS on Information and Systems, Vol.E101-D No.8, pp.2092-2100, August 2018
  • H. Maki, S. Sakti, H. Tanaka, S. Nakamura, "Quality Prediction of Synthesized Speech Based on Tensor Structured EEG Signals", PloS One, 2018, pp. to appear
  • Q.-T. Do, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Preserving Word-level Emphasis in Speech-to-speech Translation", IEEE Transactions on Audio, Speech and Language Processing, 2017
  • Y. Oshima, S. Takamichi, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Non-Native Text-To-Speech Preserving Speaker Individuality Based on Partial Correction of Prosodic and Phonetic Characteristics", IEICE Transactions on Information and Systems, December 2016.
  • L. Nio, S. Sakti, G. Neubig, K. Yoshino and S. Nakamura, "Neural Network Approaches to Dialog Response Retrieval and Generation", IEICE Transactions of Information and Systems, vol.E99-D, no.10, pp2508-2517, Oct. 2016.
  • H. Maki, T. Toda, S. Sakti, G. Neubig, S. Nakamura, "Enhancing Event-Related Potentials Based on Maximum a Posteriori Estimation with a Spatial Correlation Prior", IEICE Transactions on Information and Systems, E99-D-6, pp.1437-1446. June 2016.
  • S. Takamichi, T. Toda, A.W. Black, G. Neubig, S. Sakti, S. Nakamura, "Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis", IEEE/ACM Transactions on Audio, Speech, and Language Processing Vol.24, No.4, pp.755-767, April 2016.
  • H. Tanaka, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "NOCOA+: Multimodal Computer-Based Training for Social and Communication Skills", IEICE Transactions on Information and Systems, August 2015.
  • K. Kubo, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Adaptive Regularization of Weight Vectors for a Robust Grapheme-to-Phoneme Conversion Model", Trans. Inf. & Syst., June 2014.
  • L. Nio, S. Sakti, G. Neubig, T. Toda, S. Nakamura. "Utilizing Human-to-Human Conversation Examples for a Multi Domain Chat-oriented Dialog System", Trans. Inf. & Syst., June 2014.
  • K. Kobayashi, T. Toda, H. Doi, T. Nakano, M. Goto, G. Neubig, S. Sakti, S. Nakamura, "Voice Timbre Control Based on Perceived Age in Singing Voice Conversion"Trans. Inf. & Syst., June 2014.
  • K. Tanaka, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation", Trans. Inf. & Syst., June 2014.
  • S. Takamichi, T. Toda, Y. Shiga, S. Sakti, G. Neubig, S. Nakamura, "Parameter Generation Methods with Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis", IEEE Journal of Selected Topics in Signal Processing, April 2014.
  • H. Tanaka, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "NOCOA: A Computer-Based Training Tool for Social and Communication Skills That Exploits Non-verbal Behaviors"Journal of Information and Systems in Education, January 2014.
  • S. Sakti, M. Paul, A. Finch, S. Sakai, T.-T. Vu, N. Kimura, C. Hori, E. Sumita, S. Nakamura, J. Park, C. Wutiwiwatchai, B. Xu, H. Riza, K. Arora, C.-M. Luong, H. Li, "A-STAR: Toward Tranlating Asian Spoken Languages", Special issue on Speech-to-Speech Translation, Computer Speech and Language Journal (Elsevier), vol. 27, Issue 2, pp. 509-527, February 2013
  • H. Hofmann, S. Sakti, C. Hori, H. Kashioka, S. Nakamura, W. Minker, "Sequence-based Pronunciation Variation Modeling for Spontaneous ASR using a Noisy Channel Approach", IEICE Trans. Inf. & Syst., vol. E95-D, pp. 2084-2093, August 2012
  • S. Sakti, M. Paul, A. Finch, X. Hu, J. Ni, N. Kimura, S. Matsuda, C. Hori, Y. Ashikari, H. Kawai, H. Kashioka, E. Sumita, S. Nakamura, "Distributed Speech Translation Technologies for Multiparty Multilingual Communication", ACM Trans. Speech Lang. Process., vol. 9, Issue 2, Article 4, July 2012
  • S. Sakti, K. Markov, S. Nakamura, "Incorporating Knowledge Sources into a Statistical Acoustic Model for Spoken Language Communication Systems", IEEE Trans. on Computers, pp. 1199-1211, Oct 2007.
  • S. Sakti, S. Nakamura, K. Markov, "Improving Acoustic Model Precision by Incorporating a Wide Phonetic Context Based on a Bayesian Framework", IEICE Trans. Inf. & Syst., vol. E89-D, no.3, pp. 946-953, 2006.
  • S. Sakti, K. Markov, S. Nakamura, "A Hybrid HMM/BN Acoustic Model Utilizing Pentaphone-Context Dependency", IEICE Trans. Inf. & Syst., vol. E89-D, no.3, pp. 954-961, 2006.
 
 
Paper Conference
    Year 2021
    • R. Fukuda, Y. Oka, Y. Kano, Y. Yano, Y. Ko, H. Tokuyama, K. Doi, S. Sakti, K. Sudoh, S. Nakamura, "NAIST English-to-Japanese Simultaneous Translation System for IWSLT 2021 Simultaneous Text-to-text Task", Proc. the 18th International Conference on Spoken Language Translation (IWSLT 2021), pp. 39-45, Aug. 5, 2021
    • S. Takahashi, S. Sakti, S. Nakamura, "Unsupervised Neural-Based Graph Clustering for Variable-Length Speech Representation Discovery of Zero-Resource Languages", Proc. Interspeech 2021, pp. 1559-1563, Sep. 1, 2021
    • S. Novitasari, S. Sakti, S. Nakamura, "Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder", Proc. Interspeech 2021, pp. 4124-4128, Aug. 30, 2021
    • J. Effendi, S. Sakti, S. Nakamura, "Weakly-supervised Speech-to-text Mapping with Visually Connected Non-parallel Speech-text Data using Cyclic Partially-aligned Transformer", Proc. Interspeech 2021, Sep. 1, 2021
    • Y. Ko, Katsuhito Sudoh, S. Sakti and S. Nakamura, "ASR Posterior-based Loss for Multi-task End-to-end Speech Translation", Proc. Interspeech 2021, pp. 2272-2276, September. 1, 2021
    • H. Tokuyama, S. Sakti, Katsuhito Sudoh, S. Nakamura, "Transcribing Paralinguistic Acoustic Cues to Target Language Text in Transformer-Based Speech-to-Text Translation", Proc. Interspeech 2021, pp. 2262-2266, September. 1, 2021
    • T. Kano, S. Sakti, S. Nakamura, "Transformer-based Direct Speech-to-speech Translation with Transcoder", IEEE Spoken Language Technology Workshop, Jan. 20, 2021
    • B. Wu, S. Sakti and S. Nakamura, "Incorporating Discriminative DPGMM Posteriorgrams for Low-resource ASR", IEEE Spoken Language Technology Workshop, Jan. 22, 2021
    Year 2020
    • F. Yang, F. Li, Y. Wu, S. Sakti, S. Nakamura, "Using Panoramic Videos for Multi-person Localization and Tracking in a 3D Panoramic Coordinate", Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May. 2020
    • A. Tjandra, S. Sakti and S. Nakamura, "Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge", Proc. INTERSPEECH, pp. to appear, 2020
    • E. Dunbar, J. karadayi, M. Bernard, X.-N. Cao, R. Algayres, L. Ondel, L. Besacier, S. Sakti and E. Dupoux, "The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units", Proc. INTERSPEECH, 2020
    • K. Tsunematsu, J. Effendi, S. Sakti and S. Nakamura, "Neural Speech Completion", Proc. INTERSPEECH, 2020
    • S. Novitasari, A. Tjandra, Tomoya Yanagita, S. Sakti and S. Nakamura, "Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time", Proc. INTERSPEECH, 2020
    • J. Effendi, A. Tjandra, S. Sakti and S. Nakamura, "Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework", Proc. INTERSPEECH, 2020
    • I. Parmonangan, H. Tanaka, S. Sakti and S. Nakamura, "Combining Audio and Brain Activity for Predicting Speech Quality," Proc. INTERSPEECH", 2020
    • F. Yang, X. Chang, C. Dang, Z. Zheng, Y. Wu, S. Sakti, and S. Nakamura, "ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation", Proc. BMTT MOTChallenge Workshop of CVPR, June 2020
    • S. Novitasari, A. Tjandra, S. Sakti, S. Nakamura, "Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis", Proc. the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL), pp 131-138, May 2020
    • S. Asai, K. Yoshino, Seitaro Shinagawa, S. Sakti, S. Nakamura, "Emotional Speech Corpus for Persuasive Dialogue System", Proc. the 12th Conference on Language Resources and Evaluation (LREC 2020), pp. 491-497, May 2020
    • F. Yang, F. Li, Y. Wu, S. Sakti, S. Nakamura, "Using Panoramic Videos for Multi-person Localization and Tracking in a 3D Panoramic Coordinate", Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May. 2020
    Year 2019
    • T. Kano, S. Sakti, S. Nakamura, "Neural Machine Translation with Acoustic Embedding", Proc. of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, Dec 2019
    • S. Nakayama, A. Tjandra, S. Sakti, S. Nakamura, " Zero-shot Code-switching ASR and TTS with Multilingual Machine Speech Chain", Proc. of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, Dec 2019
    • J. Effendi, A. Tjandra, S. Sakti, S. Nakamura, "Listening while Speaking: Improving ASR through Multimodal Chain", Proc. of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, Dec 2019
    • A. Tjandra, S. Sakti, S. Nakamura, "Speech-to-speech Translation between Untranscribed Unknown Languages", Proc. of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, Dec 2019
    • N. Lubis, S. Sakti, K. Yoshino, S. Nakamura, "Dialogue Model and Response Generation for Emotion Improvement Elicitation", Proc. of the 3rd Conversational AI workshop - NeurIPS 2019, Dec. 2019
    • N.-T. Tung, K. Yoshino, S. Sakti, S. Nakamura, "Hierarchical Tensor Fusion Network for Deception Handling Negotiation Dialog Model", Proc. of the 3rd Conversational AI workshop - NeurIPS 2019, Dec. 2019
    • F. Yang, S. Sakti, Y. Wu, and S. Nakamura, "Make Skeleton-based Action Recognition Model Smaller, Faster and Better", Proc. of ACM International Conference on Multimedia in Asia, Dec. 2019
    • S. Nakayama, T. Kano, A. Tjandra, S. Sakti, and S. Nakamura, "Recognition and Translation of Code-switching Speech Utterances", Proc. of Oriental COCOSDA 2019, No.71, Oct. 2019
    • M. Okamoto, S. Sakti, and S. Nakamura, "Phoneme Level Speaking Rate Variation on Waveform Generation using GAN-TTS", Proc. of Oriental COCOSDA 2019, No.55, Oct. 2019
    • S. Novitasari, A. Tjandra, S. Sakti, S. Nakamura, "Sequence-to-sequence Learning via Attention Transfer for Incremental Speech Recognition", Proc. of Interspeech 2019, Graz, Austria, Sep 2019
    • A. Tjandra, B. Sisman, M. Zhang, S. Sakti, H. Li, S. Nakamura, "VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019", Interspeech 2019, Graz, Austria, Sep. 2019
    • E. Dunbar, R. Algayres, J. Karadayi, M, Bernard, J. Benjumea, X.-N. Cao, L. Miskic, C. Dugrain, L. Ondel, A.-W. Black, L. Besacier, S. Sakti, E. Dupoux, "The Zero Resource Speech Challenge 2019: TTS Without T", Proc. of Interspeach2019, Sep. 2019
    • T. Yanagita, S. Sakti and S. Nakamura, "Neural iTTS: Toward Synthesizing Speech in Real-time with End-to-end Neural Text-to-Speech Framework", Proc. of SSW, Sep. 2019
    • I. Parmonangan, H. Tanaka, S. Sakti, Shinnosuke Takamichi, S. Nakamura, "Speech Quality Evaluation of Synthesized Japanese Speech Using EEG", Proc. of INTERSPEECH 2019, Graz, Austria, Sep. 2019
    • I. Parmonangan, H. Tanaka, Sakti Sakriani, Shinnosuke Takamichi, S. Nakamura, "EEG Analysis towards Evaluating Synthesized Speech Quality", IEEE Engineering in Medicine and Biology Society, Jul. 2019
    • M. Vetter, S. Sakti, S. Nakamura, "Cross-lingual speech-based ToBI label generation using bidirectional LSTM", Proc of International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2019
    • A. Tjandra, S. Sakti, S. Nakamura, "End-to-end feedback loss in speech chain framework via straight-through estimator", Proc. of International Conference on Acoustics, Speech and Signal Processing (ICASSP), May. 2019
    • H. Lovenia, H. Tanaka, S. Sakti, Ayu Purwarianti, S. Nakamura, "Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition", Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May. 2019
    • K. Yoshino, Y. Murase, N. Lubis, K. Sugiyama, H. Tanaka, S. Sakti, S. Takamichi and S. Nakamura, "Spoken Dialogue Robot for Watching Daily Life of Elderly People", Proc. of International Workshop on Spoken Dialogue Systems Technology (IWSDS) 2019, Apr. 2019
    Year 2018
    • B. Sisman, M. Zhang, S. Sakti, Haizhou Li, S. Nakamura, "Adaptive WaveNet Vocoder for Residual Compensation in GAN-based Voice Conversion", Proc. of IEEE Workshop on Spoken Language Technology 2018, Dec 2018
    • A. Tjandra, S. Sakti, S. Nakamura, "Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-Sequence Model", Proc. of IEEE Workshop on Spoken Language Technology 2018, Dec 2018
    • S. Nakayama, A. Tjandra, S. Sakti, S. Nakamura, "Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS", Proc. of IEEE Workshop on Spoken Language Technology 2018, Dec 2018
    • N. Hosomi, S. Sakti, K. Yoshino, S. Nakamura, "Deception Detection and Analysis in Spoken Dialogues based on fastText", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018, Nov. 2018
    • J. Effendi, S. Sakti, K. Sudoh, S. Nakamura, "Multi-paraphrase Augmentation to Leverage Neural Caption Translation", Proceedings of the 15th International Workshop on Spoken Language Translation, 181-188, Oct. 2018
    • K. Osamura, T. Kano, S. Sakti, Katsuhito Sudoh, S. Nakamura, "Using Spoken Word Posterior Features in Neural Machine Translation", Proceedings of the 15th International Workshop on Spoken Language Translation, 181-188, Oct. 2018
    • A. Tjandra, S. Sakti, and S. Nakamura, "Machine Speech Chain with One-shot Speaker Adaptation", Proc. of INTERSPEECH, Sep 2018
    • T. Mori, A. Tjandra, S. Sakti, and S. Nakamura, "Compressing End-to-end ASR Networks by Tensor-Train Decomposition", Proc. of INTERSPEECH, Sep 2018
    • Y. Tomoya, S. Sakti, and S. Nakamura, "Incremental TTS for Japanese Language", Proc. INTERSPEECH, Sep 2018
    • K. Nur'aini, J. Effendi, S. Sakti, Mirna Adriani and S. Nakamura, "Corpus Construction and Semantic Analysis of Indonesian Image Description", SLTU 2018, Aug. 2018
    • B. Wu, S. Sakti, J. Zhang and S. Nakamura, "Optimizing DPGMM Clustering InZero Resource Setting Based on Functional Load", SLTU 2018, Aug. 2018
    • A. Tjandra, S. Sakti and S. Nakamura, "Tensor Decomposition for Compressing Recurrent Neural Network", Proceedings of the international joint conference on neural networks 2018, pp. 4451-4458, 8 July 2018
    • N. Lubis, S. Sakti, K. Yoshino, S. Nakamura, "Unsupervised Counselor Dialogue Clustering for Positive Emotion Elicitation in Neural Dialogue System", Proceedings of the 19th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2018), 161-170, 12-14 July 2018
    • S. Novitasari, Q.-T. Do, S. Sakti, D.Lestari, and S. Nakamura, "Multi-Modal Multi-Task Deep Learning for Speaker and Emotion Recognition of TV-Series Data", OCOCOSDA, May 2018, Miyazaki, Japan
    • S. Nakayama, T. Kano, Q.-T. Do, S. Sakti, and S. Nakamura, "Japanese-English Code-Switching Speech Data Construction", OCOCOSDA, May 2018, Miyazaki, Japan
    • S. Novitasari, Q.-T. Do, S. Sakti, D. Lestari and S. Nakamura, "Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas", LREC, May 2018, Miyazaki, Japan
    • K. Yoshino, Y. Ishikawa, M. Mizukami, Y. Suzuki, S. Sakti and S. Nakamura, "Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing", LREC, May 2018, Miyazaki, Japan
    • M. Honda, H. Tanaka, S. Sakti, S. Nakamura, "Detecting Suppression of Negative Emotion by Time Series Change of Cerebral Blood Flow using fNIRS", IEEE International Conference on Biomedical and Health Informatics (BHI), pp. 398-401, Mar. 2018, Las Vegas, USA
    • H. Maki, H. Tanaka, S. Sakti, S. Nakamura, "Graph regularized tensor factorization for single-trial EEG analysis", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr.2018, Calgary, Canada
    • A. Tjandra, S. Sakti, S. Nakamura, "Sequence-to-sequence ASR Optimization via Reinforcement Learning", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr. 2018, Calgary, Canada
    • S. Novitasari, Q.-T. Do, S. Sakti, D. Lestari, and S. Nakamura, "Multi-Modal Multi-Task Deep Learning for Speaker and Emotion Recognition of TV-Series Data", OCOCOSDA, May 2018, Miyazaki, Japan
    • S. Nakayama, T. Kano, Q.-T. Do, S. Sakti, and S. Nakamura, "Japanese-English Code-Switching Speech Data Construction", OCOCOSDA, May 2018, Miyazaki, Japan
    • S. Novitasari, Q.-T. Do, S. Sakti, D. Lestari and S. Nakamura, "Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas", LREC, May 2018, Miyazaki, Japan
    • K. Yoshino, Y. Ishikawa, M. Mizukami, Y. Suzuki, S. Sakti and S. Nakamura, "Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing", LREC, May 2018, Miyazaki, Japan
    • N. Lubis, S. Sakti, K. Yoshino, S. Nakamura, "Eliciting Positive Emotion through Affect-Sensitive Dialogue Response Generation: A Neural Network Approach", Proceedings of the The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), New Orleans, USA, 2018
    Year 2017
    • M. Heck, S. Sakti, S. Nakamura, "Feature Optimized DPGMM Clustering for Unsupervised Subword Modeling: A Contribution to ZeroSpeech 2017", Proceedings of IEEE Automatic Speech Recognition and Understanding (ASRU) 2017, Dec. 2017
    • A. Tjandra, S. Sakti, S. Nakamura, "End-to-End Speech Recognition with Local Monotonic Attention", Proceedings of Machine Learning for Audio Signal Processing (ML4Audio), Dec. 2017
    • S. Shinagawa, K. Yoshino, S. Sakti, Y. Suzuki, S. Nakamura, 。ネInteractive Image Manipulation with Natural Language Instruction Commands", Proceedings of Visually-Grounded Interaction and Language (ViGIL) workshop, NIPS 2017 Workshop, Dec. 2017
    • A. Tjandra, S. Sakti, S. Nakamura, "Attention-based Wav2Text with Feature Transfer Learning", Proceedings of IEEE Automatic Speech Recognition and Understanding (ASRU) 2017, Dec. 2017
    • A. Tjandra, S. Sakti, S. Nakamura, "Listening while Speaking: Speech Chain by Deep Learning", Proceedings of IEEE Automatic Speech Recognition and Understanding (ASRU) 2017, Dec. 2017
    • A. Tjandra, S. Sakti, S. Nakamura, "End-to-End Speech Recognition with Local Monotonic Attention", Proceedings of Machine Learning for Audio Signal Processing (ML4Audio), Dec. 2017
    • N. Terasawa, H. Tanaka, S. Sakti, S. Nakamura, "Tracking Liking State in Brain Activity while Watching Multiple Movies", In Proceedings of the 19th ACM International Conference on Multimodal Interaction (ICMI'17), 321-325, Nov. 2017
    • A. Tjandra, S. Sakti, S. Nakamura, "Local Monotonic Attention Mechanism for End-to-End Speech and Language Processing", Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP 2017), 431-440, Nov. 2017
    • J. Effendi, S. Sakti, S. Nakamura, "Creation of a Multi-paraphrase Corpus based on Various Elementary Operations", Proc. of Oriental COCOSDA 2017, 177-182, Nov. 2017
    • N. Lubis, M. Heck, S. Sakti, K. Yoshino, S. Nakamura, "Processing Negative Emotions Through Social Communication: Multimodal Data Collection and Analysis" Proceedings of the seventh International Conference on Affective Computing and Intelligent Interaction (ACII 2017), 79-85, Oct. 2017
    • A. Tjandra, S. Sakti, S. Nakamura, "Speech Recognition Features Based On Deep Latent Gaussian Models", IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2017), Sep. 2017
    • K. Mukaihara, S. Sakti, S. Nakamura, "Recognizing Emotionally Coloured Dialogue Speech Using Speaker-Adapted DNN-CNN Bottleneck Features", Proc. SPECOM, pp. 632-641, Sep 2017
    • T. Kano, S. Sakti, S. Nakamura, "Structured-based Curriculum Learning for End-to-end English-Japanese Speech Translation", Proc. INTERSPEECH, Aug. 2017
    • Q.-T. Do, S. Sakti, S. Nakamura, "Toward Expressive Speech Translation: A Unified Sequence-to-Sequence LSTMs Approach for Translating Words and Emphasis", Proc. INTERSPEECH, Aug. 2017
    • H. Watanabe, H. Tanaka, S. Sakti, S. Nakamura, "Subject-independent Classification of Japanese Spoken Sentences by Multiple Frequency Bands Phase Pattern of EEG Response during Speech Perception", Proc. INTERSPEECH, 2431-2435, Aug. 2017
    • N. Terasawa, H. Tanaka, S. Sakti, S. Nakamura, "EEG-based Emotional State Tracking during Watching Movie considering Self-Assessment Manikin", Proceedings of the 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC2017), Jul. 2017
    • N. Lubis, S. Sakti, K. Yoshino, S. Nakamura, "Eliciting Positive Emotional Impact in Dialogue Response Selection", 8th International Workshop on Spoken Dialog Systems (IWSDS2017), Jun. 2017
    • A. Tjandra, S. Sakti, S. Nakamura, "Compressing Recurrent Neural Network with Tensor Train", Proc. IJCNN 2017, 4451-4458, May. 2017
    Year 2016
    • S. Sakti, S. Kawanishi, G. Neubig, K. Yoshino, S. Nakamura, "Deep Bottleneck Features And Sound-Dependent i-Vectors for Simultaneous Recognition of Speech and Environmental Sounds", Proc. IEEE SLT, pp . 35-42, Dec 2016
    • M. Heck, S. Sakti, S. Nakamura, "Iterative Training of A DPGMM-HMM Acoustic Unit Recognizer in A Zero Resource Scenario", Proc. IEEE SLT, pp . 57-63, Dec 2016
    • R. Hiraoka, H. Tanaka, S. Sakti, G. Neubig, S. Nakamura, "Personalized Unknown Word Detection in Non-native Language Reading using Eye Gaze", ACM International Conference on Multimodal Interaction (ICMI), pp.66-70, Nov. 2016
    • S. Tsujioka, S. Sakti, K. Yoshino, G. Neubig, S. Nakamura, "Unsupervised Joint Estimation of Grapheme-to-Phoneme Conversion Systems and Acoustic Model Adaptation for Non-Native Speech Recognition", InterSpeech, pp. 3091 - 3095, Sep. 2016
    • M. Heck, S. Sakti, S. Nakamura, "Supervised Learning of Acoustic Models in a Zero Resource Setting to Improve DPGMM Clustering", InterSpeech, pp. 1310 - 1314, Sep. 2016
    • Q.-T. Do, S. Sakti, G. Neubig, S. Nakamura, "Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models", InterSpeech, pp. 2533 - 2537, Sep. 2016
    • Q.-T. Do, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training", InterSpeech, pp. 3196 - 3200, Sep. 2016
    • H. Tanaka, S. Sakti, G. Neubig, H. Negoro, H. Iwasaka, S. Nakamura, "Automated Social Skills Training with Audiovisual Information", International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 2262-2265, Aug. 2016
    • H. Maki, T. Toda, S. Sakti, G. Neubig, and S. Nakamura, "Removing Noise from Event-Related Potentials using a Probabilistic Generative Model with Grouped Covariance Matrices", International Conference of the IEEE Engineering in Medicine and Biology Society, pp.3728-3731, Aug. 2016
    • A. Tjandra, S. Sakti, R. Manurung, M. Adriani and S. Nakamura, "Gated Recurrent Neural Tensor Network", Proceedings of The 2016 International Joint Conference on Neural Networks (IJCNN 2016), July 2016
    • N. Lubis, R. Gomez, S. Sakti, K. Nakamura, K. Yoshino, S. Nakamura and K. Nakadai, "Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition", Proceedings of The Tenth International Conference on Language Resources and Evaluation (LREC 2016), May 2016
    • M. Heck, S. Sakti, S. Nakamura, "Unsupervised Linear Discriminant Analysis for Supporting DPGMM Clustering in the Zero Resource Scenario", International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU), Procedia Computer Science, Elsevier, Volume 81, 2016, pp.73-79, May 2016
    Year 2015
    • N. Lubis, S. Sakti, G. Neubig, K. Yoshino, T. Toda, S. Nakamura, "A Study of Social-Affective Communication: Automatic Prediction of Emotion Triggers and Responses in Television Talk Shows", in Proc. ASRU, Scottsdale, Arizona, USA, December 2015
    • Q.-T. Do, M. Heck, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "The NAIST ASR System for the 2015 Multi-genre Broadcast Challenge: On Combination of Deep Learning Systems using a Rank-score Function", in Proc. ASRU, Scottsdale, Arizona, USA. December 2015
    • S. Sakti, F. Ilham, G. Neubig, T. Toda, A. Purwarianti, S. Nakamura, "Incremental Sentence Compression using LSTM Recurrent Neural Networks", in Proc. ASRU, Scottsdale, Arizona, USA, December 2015
    • M. Mizukami, H. Kizuki, T. Nomura, G. Neubig, K. Yoshino, S. Sakti, T. Toda, S. Nakamura, "Adaptive Selection from Multiple Reponse Candidates in Example-based Dialogue", in Proc. ASRU, Scottsdale, Arizona, USA, December 2015
    • S. Sakti, O. Shagdar, F. Nashashibi, S. Nakamura, "Context Awareness and Priority Control for ITS based on Automatic Speech Recognition", in Proc. 14th International Conference on ITS Telecommunications (ITST), Copenhagen, Denmark, December 2015
    • Y. Oda, H. Fudaba, G. Neubig, H. Hata, S. Sakti, T. Toda, S. Nakamura, "Learning to Generate Pseudo-code from Source Code using Statistical Machine Translation", in Proc. ASE, Lincoln, Nebraska, USA. November 2015
    • H. Fudaba, Y. Oda, K. Akabe, G. Neubig, H. Hata, S. Sakti, T. Toda, S. Nakamura, "Pseudogen: A Tool to Automatically Generate Pseudo-code from Source Code", in Proc. ASE Tool Demos, Lincoln, Nebraska, USA. November 2015
    • K. Tanaka, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "An Enhanced Electrolarynx with Automatic Fundamental Frequency Control based on Statistical Prediction", in Proc. ASSETS (Poster/Demo Track), Lisbon, Portugal. October 2015.
    • T. Mieno, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Speed or Accuracy? A Study in Evaluation of Simultaneous Speech Translation", in Proc. INTERSPEECH, Dresden, Germany. September 2015, pp.2267-2271.
    • P. Tobing, K. Kobayashi, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Articulatory Controllable Speech Modification based on Gaussian Mixture Models with Direct Waveform Modification using Spectrum Differential", in Proc. INTERSPEECH, Dresden, Germany. September 2015.
    • Q.-T. Do, S. Takamichi, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Preserving Word-level Emphasis in Speech-to-speech Translation using Linear Regression HSMMs", in Proc. INTERSPEECH, Dresden, Germany. September 2015.
    • K. Kobayashi, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Statistical Singing Voice Conversion based on Direct Waveform Modification with Global Variance", in Proc. INTERSPEECH, Dresden, Germany. September 2015.
    • Y. Oshima, S. Takamichi, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Non-native Speech Synthesis Preserving Speaker Individuality based on Partial Correction of Prosodic and Phonetic Characteristics", in Proc. INTERSPEECH, Dresden, Germany. September 2015
    • T. Nguyen, G. Neubig, H. Shindo, S. Sakti, T. Toda, S. Nakamura, "A Latent Variable Model for Joint Pause Prediction and Dependency Parsing", in Proc. INTERSPEECH, Dresden, Germany. September 2015
    • Y. Tajiri, K. Tanaka, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Non-Audible Murmur Enhancement based on Statistical Conversion using Air- and Body-Conductive Microphones in Noisy Environments", in Proc. INTERSPEECH, Dresden, Germany. September 2015.
    • K. Sugiyama, M. Mizukami, G. Neubig, K. Yoshino, S. Sakti, T. Toda, S. Nakamura, "An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering", in Proc. WMT. Lisboa, Portugal. September 2015
    • Y. Nishigaki, S. Takamichi, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Prosody-Controllable HMM-based Speech Synthesis Using Speech Input", in Proc. MLSLP, Aizu, Japan. September 2015
    • H. Maki, T. Toda, S. Sakti, G. Neubig, S. Nakamura, "Evaluation of EEG Ocular Artifact Removal with a Multi-channel Wiener Filter Based on Probabilistic Generative Model", in Proc. EMBC, Milan, Italy, August 2015
    • Y. Oda, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Syntax-based Simultaneous Translation through Prediction of Unseen Syntactic Constituents", in Proc. ACL, Beijing, China, July 2015
    • Akiva Miura, Graham Neubig, S. Sakti, Tomoki Toda, S. Nakamura, 。ネImproving Pivot Translation by Remembering the Pivot", in Proc. ACL, Beijing, China, July 2015
    • Y. Oda, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Ckylark: A More Robust PCFG-LA Parser", in Proc. NAACL (Demo Track), Denver, USA, May 2015
    • A. Tjandra, S. Sakti, G. Neubig, T. Toda, M. Adriani, S. Nakamura, "Combination of Two-dimensional Cochleogram and Spectrogram Features for Deep Learning-based ASR", in Proc ICASSP, Brisbane, Australia. April 2015
    • H. Maki, T. Toda, S. Sakti, G. Neubig, S. Nakamura, "EEG Signal Enhancement using Multichannel Wiener Filter with a Spatial Correlation Prior", in Proc. ICASSP, Brisbane, Australia. April 2015
    • H. Tanaka, S. Sakti, G. Neubig, T. Toda, H. Negoro, H. Iwasaka, S. Nakamura, "Automated Social Skills Trainer", in Proc. IUI, Atlanta, USA. March 2015
    • M. Mizukami, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Linguistic Individuality Transformation for Spoken Language", in Proc. IWSDS, Busan, Korea, January 2015
    • T. Hiraoka, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Evaluation of a Fully Automatic Cooperative Persuasive Dialoguea System", in Proc. IWSDS, Busan, Korea, January 2015
    • F. Koto, S. Sakti, G. Neubig, T. Toda, M. Adriani, S. Nakamura, "A Study On Natural Expressive Speech: Automatic Memorable Spoken Quote Detection", in Proc. IWSDS, Busan, Korea, January 2015
    • Y. Tsunomori, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "An Analysis Towards Dialogue-based Deception Detection", in Proc. IWSDS, Busan, Korea, January 2015
    • T. Sasakura, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Unknown Word Detection based on Event-Related Brain Desynchronization Responses", in Proc. IWSDS, Busan, Korea. January 2015
    Year 2014
    • R. Yoshida, T. Hiraoka, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Unnecessary Utterance Detection for Avoiding Digressions in Discussion", in Proc. APSIPA, pp. to appear, Siem Reap, Cambodia, December 2014.
    • K. Kobayashi, T. Toda, Tomoyasu Nakano, Masataka Goto, G. Neubig, S. Sakti, S. Nakamura, "Gender-dependent Spectrum Differential Models for Perceived Age Control based on Direct Waveform Modification in Singing Voice Conversion", in Proc. APSIPA, pp. to appear, Siem Reap, Cambodia, December 2014.
    • K. Tanaka, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "An Inter-Speaker Evaluation through Simulation of Electrolarynx Control based on Statistical F0 Prediction", in Proc. APSIPA, pp. to appear, Siem Reap, Cambodia, December 2014.
    • S. Tsuruta, K. Tanaka, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "An Evaluation of Target Speech for a Nonaudible Murmur Enhancement System in Noisy Environments", in Proc. APSIPA, pp. to appear, Siem Reap, Cambodia, December 2014.
    • S. Sakti, Y. Odagaki, T. Sasakura, G. Neubig, T. Toda, S. Nakamura, "An Event-Related Brain Potential Study on the Impact of Speech Recognition Errors", in Proc. APSIPA, pp. to appear, Siem Reap, Cambodia, December 2014.
    • F. Koto, S. Sakti, G. Neubig, T. Toda, M. Adriani, S. Nakamura, "The Use of Semantic and Acoustic Features for Open-Domain TED Talk Summarization", in Proc. APSIPA, pp. to appear, Siem Reap, Cambodia, December 2014.
    • L. Nio, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Recursive Neural Network Paraphrase Identification for Example-based Dialog Retrieval", in Proc. APSIPA, pp. to appear, Siem Reap, Cambodia, December 2014.
    • N. Lubis, D. Lestari, A. Purwarianti, S. Sakti, S. Nakamura, "Emotion Recognition on Indonesian Television Talk Shows", in Proc. IEEE SLT, pp. to appear, Lake Tahoe, USA, December 2014.
    • L. Nio, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Improving the Robustness of Example-based Dialog Retrieval using Recursive Neural Network Paraphrase Identification", in Proc. IEEE SLT, pp. to appear, Lake Tahoe, USA, December 2014.
    • Y. Hatakoshi, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Rule-based Syntactic Preprocessing for Syntax-based Machine Translation", in Proc. SSST. Doha, Qatar, October 2014.
    • N. Jinbo, S. Takamichi, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "A Hearing Impairment Simulation Method Using Audiogram-based Approximation of Auditory Characteristics", in Proc. INTERSPEECH, Singapore, September 2014.
    • K. Kubo, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Structured Soft Margin Confidence Weighted Learning for Grapheme-to-Phoneme Conversion", in Proc. INTERSPEECH, Singapore, September 2014.
    • K. Tanaka, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Direct F0 Control of an Electrolarynx based on Statistical Excitation Feature Prediction and its Evaluation through Simulation", in Proc. INTERSPEECH, Singapore, September 2014.
    • K. Kobayashi, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Statistical Singing Voice Conversion with Direct Waveform Modification based on the Spectrum Differential", in Proc. INTERSPEECH, Singapore, September 2014.
    • S. Matsumiya, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Data-Driven Generation of Text Balloons based on Linguistic and Acoustic Features of a Comics-Anime Corpus", in Proc. INTERSPEECH, Singapore, September 2014.
    • P.-L. Tobing, T. Toda, G. Neubig, S. Sakti, S. Nakamura, A. Purwarianti, "Articulatory Controllable Speech Modification Based on Statistical Feature Mapping with Gaussian Mixture Models", in Proc. INTERSPEECH, Singapore, September 2014.
    • M. Mizukami, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Building a Free, General-Domain Paraphrase Database for Japanese", in Proc. Oriental COCOSDA, Phuket, Thailand, September 2014.
    • Lasguido Nio, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Conversation Dialog Corpora from Drama Television and Movie Scripts", in Proc. Oriental COCOSDA, Phuket, Thailand, September 2014.
    • F. Koto, S. Sakti, G. Neubig, T. Toda, G. Neubig, S. Nakamura, "Memorable Spoken Quote Corpora of TED Public Speaking", in Proc. Oriental COCOSDA, Phuket, Thailand, September 2014.
    • D.-Q. Truong, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Collection and Analysis of a Japanese-English Emphasized Speech Corpus", in Proc. Oriental COCOSDA, Phuket, Thailand, September 2014.
    • N. Lubis, D. Lestari, A. Purwarianti, S. Sakti, S. Nakamura, "Construction and Analysis of Indonesian Emotional Speech Corpus", in Proc. Oriental COCOSDA, Phuket, Thailand, September 2014.
    • K. Akabe, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Discriminative Language Models as a Tool for Machine Translation Error Analysis", in Proc. COLING, Dublin, Ireland, August 2014.
    • T. Hiraoka, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Reinforcement Learning of Cooperative Persuasive Dialogue Policies using Framing", in Proc. COLING, Dublin, Ireland, August 2014.
    • H. Maki, T. Toda, S. Sakti, G. Neubig, S. Nakamura, "Probabilistic Enhancement of EEG Component Using Prior Information of Component-Related Spatial Correlation", in Proc. IEEE EMBC, Chicago, USA, August 2014. Hiroki Tanaka, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Linguistic and Acoustic Features for Automatic Identification of Autism Spectrum Disorders in Children's Narrative" , in Proc. ACL Workshop on Computational Linguistics and Clinical Psychology. Baltimore, USA, June 2014.
    • Y. Oda, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Optimizing Segmentation Strategies for Simultaneous Speech Translation", in Proc. ACL, Baltimore, USA, June 2014.
    • S. Sakti, K. Kubo, S. Matsumiya, G. Neubig, T. Toda, S. Nakamura, F. Adachi, R. Isotani, "Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and a Network-based ASR System", in Proc. LREC, Reykjavik, Iceland, May 2014.
    • H. Shimizu, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Collection of a Simultaneous Translation Corpus for Comparative Analysis" in Proc. LREC. Reykjavik, Iceland, May 2014.
    • K. Tanaka, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "An Evaluation of Excitation Feature Prediction in A Hybrid Approach to Electrolaryngeal Speech Enhancement", in Proc. ICASSP, Florence, Italy, May 2014.
    • K. Kubo, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Narrow Adaptive Regularization of Weights for Grapheme-to-Phoneme Conversion", in Proc. ICASSP, Florence, Italy, May 2014.
    • K. Kobayashi, T. Toda, T. Nakano, Masataka Goto, G. Neubig, S. Sakti, S. Nakamura, "Regression Approaches to Perceptual Age Control in Singing Voice Conversion", in Proc. ICASSP, Florence, Italy, May 2014.
    • S. Takamichi, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "A Postfilter to Modify the Modulation Spectrum in HMM-based Speech Synthesis", in Proc. ICASSP, Florence, Italy, May 2014.
    • S. Sakti, S. Nakamura, "Recent Progress in Developing Grapheme-based Speech Recognition for Indonesian Ethnic Languages: Javanese, Sundanese, Balinese, and Bataks", in Proc. SLTU, St. Petersburg, Russia, May 2014.
    • H.-T. Vu, G. Neubig, S. Sakti, T. Toda, S. Nakamura. "Acquiring a Dictionary of Emotion-Provoking Events", in Proc. EACL, Gothenburg, Sweden, April 2014.
    • T. Hiraoka, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Construction and Analysis of a Persuasive Dialogue Corpus", in Proc. IWSDS, Napa, California, USA, January 2014.
    • N. Lubis, S. Sakti, G. Neubig, T. Toda, A. Purwarianti, S. Nakamura, "Emotion and Its Triggers in Human Spoken Dialogue: Recognition and Analysis", in Proc. IWSDS, Napa, California, USA, January 2014.
    Year 2013
    • T. Hiraoka, Y. Yamauchi, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Dialogue Management for Leading the Conversation in Persuasive Dialogue Systems", in Proc. ASRU, Olomouc, Czech Republic, December 2013.
    • H. Tanaka, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Modality and Contextual Differences in Computer Based Non-verbal Communication Training", in Proc. IEEE CogInfoCom, Budapest, Hungary, December 2013.
    • H. Shimizu, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Constructing a Speech Translation System using Simultaneous Interpretation Data", in Proc. IWSLT, Heidelberg, Germany, December 2013.
    • S. Sakti, K. Kubo, G. Neubig, T. Toda, S. Nakamura, "The NAIST English Speech Recognition System for IWSLT 2013", in Proc. IWSLT, Heidelberg, Germany, December 2013.
    • S. Sakti, S. Nakamura, "Towards Language Preservation: Design and Collection of Graphemically Balanced and Parallel Speech Corpora of Indonesian Ethnic Languages", in Proc. Oriental COCOSDA, Gurgaon, India, November 2013.
    • G. Neubig, S. Sakti, T. Toda, S. Nakamura, Y. Matsumoto, R. Isotani, Y. Ikeda, "Towards High-Reliability Speech Translation in the Medical Domain", in Proc. MedNLP, Nagoya, Japan, October 2013.
    • P. Arthur, G. Neubig, S. Sakti, T. Toda, and S. Nakamura, "Inter-Sentence Features and Thresholded Minimum Error Rate Training: NAIST at CLEF 2013 QA4MRE", in Proc. CLEF, Valencia, Spain, September 2013.
    • T. Kano, S. Takamichi, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Generalizing Continuous-space Translation of Paralinguistic Information", in Proc. INTERSPEECH, Lyon, France, August 2013.
    • T. Fujita, G. Neubig, S. Sakti, T. Toda, S. Nakamura. "Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation", in Proc. INTERSPEECH, Lyon, France, August 2013.
    • M. Ohgushi, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "An Empirical Comparison of Joint Optimization Techniques for Speech Translation", in Proc. INTERSPEECH, Lyon, France, August 2013.
    • S. Takamichi, T. Toda, Y. Shiga, S. Sakti, G. Neubig, S. Nakamura, "Improvements to HMM-Based Speech Synthesis Based on Parameter Generation with Rich Context Models", in Proc. INTERSPEECH, Lyon, France, August 2013.
    • K. Tanaka, T. Toda, G. Neubig, S. Sakti, S. Nakamura,"A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Spectral Subtraction and Statistical Voice Conversion", in Proc. INTERSPEECH, Lyon, France, August 2013.
    • K. Kubo, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Grapheme-to-phoneme Conversion based on Adaptive Regularization of Weight Vectors", in Proc. INTERSPEECH, Lyon, France, August 2013.
    • K. Kobayashi, H. Doi, T. Toda, T. Nakano, M. Goto, G. Neubig, S. Sakti, S. Nakamura, "An Investigation of Acoustic Features for Singing Voice Conversion based on Perceptual Age", in Proc. INTERSPEECH, Lyon, France, August 2013.
    • T. Moriguchi, T. Toda, M. Sano, Hiroshi Sato, G. Neubig, S. Sakti, S. Nakamura, "A Digital Signal Processor Implementation of Silent/Electrolaryngeal Speech Enhancement based on Real-Time Statistical Voice Conversion", in Proc. INTERSPEECH, Lyon, France, August 2013.
    • T. Inukai, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric", in Proc. ISCA SSW. Barcelona, Spain, August 2013.
    Year 2012
    • A. Sani, S. Sakti, G. Neubig, T. Toda, A. Mulyanto, S. Nakamura, "Towards Language Preservation: Preliminary Collection and Vowel Analysis of Indonesian Ethnic Speech Data", in Proc. Oriental COCOSDA, pp. 128-122, Macau, China, December 2012.
    • Lasguido, S. Sakti, G. Neubig, T. Toda, Mirna Adriani, S. Nakamura, "Developing Non-Goal Dialog System based on Examples of Drama Television", in Proc. IWSDS, pp. 315-320, Paris, France, December 2012.
    • H. Tanaka, S. Sakti, G. Neubig, T. Toda, N. Campbell, S. Nakamura, "Non-verbal Cognitive Skills and Autistic Conditions: An Analysis and Training Tool", in Proc. IEEE CogInfoCom, pp. 41-46, Kosice, Slovakia, December 2012.
    • T. Kano, S. Sakti, S. Takamichi, G. Neubig, T. Toda, S. Nakamura. "A Method for Translation of Paralinguistic Information", in Proc. IWSLT, pp. 158-163, Hong Kong, December 2012.
    • G. Neubig, K. Duh, M. Ogushi, T. Kano, T. Kiso, S. Sakti, T. Toda, S. Nakamura, "The NAIST Machine Translation System for IWSLT 2012", in Proc. IWSLT, pp. 54-60, Hong Kong, December 2012.
    • M. Heck, K. Kubo, M. Sperber, S. Sakti, S. Stueker, C. Saam, K. Kilgour, C. Mohr, G. Neubig, T. Toda, S. Nakamura, A. Waibel, "The KIT-NAIST (Contrastive) English ASR System for IWSLT 2012", in Proc. IWSLT, pp.91-95 Hong Kong, December 2012.
    • C. Saam, C. Mohr, K. Kilgour, M. Heck, M. Sperber, K. Kubo, S. St将臾er, S. Sakti, G. Neubig, T. Toda, S. Nakamura, A. Waibel, "The 2012 KIT and KIT-NAIST English ASR Systems for the IWSLT Evaluation", in Proc. IWSLT, pp. 87-90, Hong Kong, December 2012.
    • M. Kishimoto, T. Toda, H. Doi, S. Sakti, S. Nakamura, "Model training using parallel data with mismatched pause positions in statistical esophageal speech enhancement", in Proc. ICSP, pp. 590-594, Beijing, China, Oct. 2012.
    • S. Takamichi, T. Toda, Y. Shiga, H. Kawai, S. Sakti, S. Nakamura, "An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-based Speech Synthesis", in Proc. INTERSPEECH, Portland, USA, pp. 2614-2618, September 2012.
    • Y. Yamauchi, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "Answer Sentence Generation using Relationships between Words for Guiding Users to New Topics in Spoken Dialog Systems", in Proc. ASJ Autumn Meeting, pp. 81-82, September 2012. [In JAPANESE]
    • T. Hiraoka, G. Neubig, S. Sakti, T. Toda, S. Nakamura, "A Study on Dialog Management in Persuasive Dialog Systems", in Proc. ASJ Autumn Meeting, pp. 83-84, Nagano, Japan, September 2012. [In JAPANESE]
    • T. Kano, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "A Duration-Sensitive Speech Translation System", in Proc. ASJ Autumn Meeting, pp. 181-182, Nagano, Japan, September 2012. [In JAPANESE]
    • T. Moriguchi, T. Toda, M. Sano, H. Sato, G. Neubig, S. Sakti, S. Nakamura, "Implementation of Real-Time Body-Conducted Voice Conversion on DSP", in Proc. ASJ Autumn Meeting, pp. 217-218, Nagano, Japan, September 2012. [In JAPANESE]
    • S. Takamichi, T. Toda, Y. Shiga, S. Sakti, G. Neubig, S. Nakamura, "A Study on a Selection Method of Rich Context Models in HMM-based Speech Synthesis",in Proc. of ASJ Autumn Meeting, pp. 273-274, Nagano, Japan, September 2012. [In JAPANESE]
    • Tatsuo Inukai, T. Toda, G. Neubig, S. Sakti, S. Nakamura, "Spectral Parameter Variation Between Utterances of the Same Sentence by a Single Speaker and Its Prediction" in Proc. ASJ Autumn Meeting, pp. 291-292, Nagano, Japan, September 2012. [In JAPANESE]
    • S. Ishii, T. Toda, H. Saruwatari, S. Sakti, S. Nakamura, "Stereo Signal Integration in Blind Noise Suppression for Non-Audible Murmur Recognition", in Proc. ASJ Spring Meeting, pp. 27-28, Yokohama, Japan, March 2012. [In JAPANESE]
    • S. Takamichi, T. Toda, Y. Shiga, H. Kawai, S. Sakti, S. Nakamura, "A Study on the Effectiveness of Full-context Models with Tied-covariance Matrices in HMM-based Speech Synthesis", in Proc. of ASJ Spring Meeting, pp. 301-302, Yokohama, Japan, March 2012. [In JAPANESE]
    • M. Kishimoto, H. Doi, T. Toda, S. Sakti, S. Nakamura, "Model Training using Training Data Including Mismatched Pause Positions in Statistical Esophageal Speech Enhancement" in Proc. ASJ Spring Meeting, pp. 367-368, Yokohama, Japan, March 2012.
    Year 2011
    • S. Ishii, T. Toda, H. Saruwatari, S. Sakti, S. Nakamura, "Blind Noise Suppression for Non-Audible Murmur Recognition with Stereo Signal Processing", in Proc. ASRU, pp. 494-499, Hawaii, USA, December 2011
    • Y. Yamauchi, K. Sugiura, N. Iwahashi, S. Sakti, T. Toda, S. Nakamura, "Motion Generation and Obstacle Avoidance Using HMMs in Object Manipulation Tasks", in Proc. SICE System Integration Division, pp. 1614-1617, Kyoto, Japan, December 2011.
    • S. Ishii, T. Toda, H. Saruwatari, S. Sakti, S. Nakamura, "Blind noise suppression for non-audible murmur recognition with stereo signal processing", in Proc. ASRU, pp. 494-499, Hawaii, USA, December 2011.
    • S. Sakti, A. Finch, C. Hori, H. Kashioka, S. Nakamura, "Conditional Random Fields for Modeling Korean Pronunciation Variation", in Proc. IWSDS: Workshop on Paralinguistic Information and its Integration in Spoken Dialogue Systems, Springer, pp. 49-54, Granada, Spain, September 2011.
    • S. Sakti, A. Finch, R. Isotani, H. Kawai, S. Nakamura, "Unsupervised Determination of Efficient Korean LVCSR Units Using a Bayesian Dirichlet Process Model", in Proc. ICASSP, pp. 4664-4667, Prague, Czech Republic, May 2011.
    Year 2010
    • S. Sakti, R. Isotani, H. Kawai, S. Nakamura, "The Use of Indonesian Speech Corpora for Developing Filipino Continuous Speech Recognition System", in Proc. O-COCOSDA, pp. 56-61, Kathmandu, Nepal, November 2010.
    • S. Sakti, R. Isotani, H. Kawai, S. Nakamura, "Korean Pronunciation Variation Modeling with Probabilistic Bayesian Network", in Proc. IUCS, pp. 52-57, Beijing, China, October 2010.
    • H. Hofmann, S. Sakti, R. Isotani, H. Kawai, S. Nakamura, W. Minker, "Improving Spontanoues English ASR Using a Joint-Sequence Pronunciation Model", in Proc. IUCS, pp. 58-61, Beijing, China, October 2010.
    • S. Sakti, R. Isotani, H. Kawai, S. Nakamura, "Utilizing a Noisy-Channel Approach for Korean LVCSR", in Proc. INTERSPEECH, pp. 1513-1516, Makuhari, Japan, September 2010.
    • K. Abe, S. Sakti, H. Kawai, R. Isotani, S. Nakamura, "Brazilian Portuguese Acoustic Model Training Based on Data Borrowing From Other Languages", in Proc. INTERSPEECH, pp. 861-864, Makuhari, Japan, September 2010.
    • S. Sakti, S. Sakai, R. Isotani, H. Kawai, S. Nakamura, "Quality and Intelligibility Assessment of Indonesian HMM-Basaed Speech Synthesis System", in Proc. MALINDO, pp. 51-57, Jakarta, Indonesia, August 2010.
    • K. Abe, S. Sakti, H. Kawai, R. Isotani, S. Nakamura, "Acoustic Model Training for Portuguese Speech Recognition System", in Proc. ASJ Spring Meeting, pp. 221-222, Tokyo, Japan, March 2010.
    Year 2009
    • S. Sakti, N. Kimura, M. Paul, C. Hori, E. Sumita, S. Nakamura, J. Park, C. Wutiwiwachai, B. Xu, H. Riza, K. Arora, C. Luong and H. Li, "The Asian Network-based Speech-to-Speech Translation System", in Proc. ASRU, pp. 507-512, Merano, Italy, December 2009.
    • S. Sakti, M. Paul, R. Maia, S.Sakai, N. Kimura, Y. Ashikari, E. Sumita, S. Nakamura, "Toward Translating Indonesian Spoken Utterances to/from Other Languages", in Proc. O-COCOSDA, pp. 137-142, Beijing, China, August 2009.
    • S. Sakti, T. Vu, A. Finch, M. Paul, R. Maia, S. Sakai, T. Hayashi, S. Matsuda, N. Kimura, Y. Ashikari, E. Sumita, S. Nakamura, "NICT/ATR Asian Spoken Language Translation System for Multi-Party Travel Conversation", in Proc. TCAST, pp. 26-30, Singapore, August 2009.
    • S. Sakti, M. Paul, R. Maia, S.Sakai, N. Kimura, Y.Ashikari, E. Sumita, S. Nakamura, "Development of Indonesian Spoken Language Technologies for Multilingual Speech-to-Speech Translation System", in Proc. MALINDO, pp. 49-54, Singapore, August 2009.
    • S. Sakti, R. Maia, S. Sakai, T. Shimizu, S. Nakamura, "HMM-based Speech Synthesis of Indonesian Language", in Proc. ASJ Spring Meeting, pp. 301-302, Tokyo, Japan, March 2009.
    Year 2008
    • S. Sakti, K. Markov, S. Nakamura, "Probabilistic Pronunciation Variation Model Based on Bayesian Network for Conversational Speech Recognition", in Proc. ISUC, pp. 405-410, Osaka, Japan, December 2008.
    • S. Sakti, R. Maia, S. Sakai, T. Shimizu, S. Nakamura, "Development of HMM-based Indonesian Speech Synthesis", in Proc. O-COCOSDA, pp. 215-220, Kyoto, Japan, November, 2008.
    • S. Sakti, E. Kelana, H. Riza, S. Sakai, K. Markov, S. Nakamura, "Recent Progress in Developing Indonesian Large-Vocabulary Corpora and LVCSR System", pp. 40-45, Cyberjaya-Selangor, Malaysia, June, 2008.
    • S. Sakti, E. Kelana, H. Riza, S. Sakai, K. Markov, S. Nakamura, "Development of Indonesian Large Vocabulary Continuous Speech Recognition System within A-STAR Project", in Proc. TCAST, pp. 19-24, Hyderabad, India, January 2008.
    Year 2007
    • S. Sakti, E. Kelana, H. Riza, S. Nakamura, "Large Vocabulary ASR for Indonesian Language in the A-STAR Project", in Proc. ASJ Autumn Meeting, pp. 47-48, Yamanashi, Japan, 2007.
    • S. Nakamura, E. Sumita, T. Shimizu, S. Sakti, S. Sakai, J. Zhang, A. Finch, N. Kimura, Y. Asikari, "A-STAR: Asia Speech Translation Consortium", in Proc. ASJ Autumn Meeting, pp. 45-46, Yamanashi, Japan, 2007.
    • S. Sakti, K. Markov, S. Nakamura, "A method to Integrate Additional Knowledge Sources Into HMM Based on Junction Tree Decomposition", in Proc. EUSIPCO, pp. 2404-2408, Poznan, Poland, 2007.
    • S. Sakti, K. Markov, S. Nakamura, "An HMM Acoustic Model Incorporating Various Additional Knowledge Sources", in Proc. INTERSPEECH, pp. 2117-2120, Antwerp, Belgium, 2007.
    • S. Sakti, K. Markov, S. Nakamura, "Utilizing Junction Tree Decomposition for Incorporating Accent, Gender, and Wide-Context Dependency Information", in Proc. ASJ Spring Meeting, pp.45-46, Tokyo, Japan, 2007.
    Year 2006
    • S. Sakti, K. Markov, S. Nakamura, "Utilizing Bayesian Network and Junction Tree Decomposition for Incorporating Additional Knowledge Sources into a Statistical Acoustic Model", in Proc. 8th IEICE Speech Processing Symposium, pp.7-12, Nagoya, Japan, 2006
    • S. Sakti, K. Markov, S. Nakamura, "The Use of Bayesian Network for Incorporating Accent, Gender and Wide-Context Dependency Information", in Proc. ICSLP, pp. 1563-1566, Pittsburgh PA, USA, 2006.
    • S. Sakti, K. Markov, S. Nakamura, "Incorporation of Pentaphone-Context Dependency Based on Hybrid HMM/BN Acoustic Modeling Framework", in Proc. ICASSP, pp. 1177-1180, Toulouse, France, 2006.
    • S. Sakti, K. Markov, S. Nakamura, "A Hybrid Pentaphone HMM/BN Acoustic Model", in Proc. ASJ Spring Meeting, pp. 41-42, Tokyo, Japan, 2006.
    Year 2005
    • S. Sakti, K. Markov, S. Nakamura, "Rapid Development of Initial Indonesia Phoneme-Based Speech Recognition Using Cross-Language Approach", in Proc. O-COCOSDA, pp. 38-43, Jakarta, Indonesia, 2005.
    • S. Sakti, K. Markov, S. Nakamura, "Modeling Quasi-Pentaphone Units with the Hybrid HMM/BN Acoustic Model", in Proc. SPECOM, pp. 135-138, Patras, Greece, 2005.
    • S. Sakti, S. Nakamura, K. Markov, "Composing a Wide Phonetic Context Unit based on Bayesian Framework", in Proc. ASJ Autumn Meeting, pp. 99-100, Sendai, Japan, 2005.
    • S. Sakti, S. Nakamura, K. Markov, "Incorporating a Bayesian Wide Phonetic Context Model for Acoustic Rescoring", in Proc. EUROSPEECH, pp. 1629-1632, Lisbon, Portugal, 2005.
    Year 2004
    • S. Sakti, P. Hutagaol, A. Arman, S. Nakamura, "Development of Speech Corpus and Speech Recognition System for Indonesian Language", in Proc. IEICE, Kyoto, Japan, 2004.
    • S. Sakti, P. Hutagaol, A. Arman, S. Nakamura, "Indonesian Speech Recognition for Hearing and Speaking Impaired People", in Proc. ICSLP, Jeju, Korea, 2004.
    Year 2003
    • S. Sakti, S. Nakamura. "A View of Automatic Speech Recognition", BPPT Seminar on Language Technology, Jakarta-Indonesia, September 2003.