Acoustic Information Science, Unoki Laboratory

Publication (by Contents, by Time, by Field)

Paper
International conference
Invited lecture
Workshop in Japan
Verbal presentation
Gloss
Books
Other
JAIST Technical Report
Patent

Paper

Akagi, M. (1993). "Modeling of contextual effects based on spectral peak interaction", J. of Acoust. Society of America, 93, 2, 1076-1086.
Kitamura, T. and Akagi, M. (1995). "Speaker individualities in speech spectral envelopes", J. Acoust. Soc. Jpn. (E), 16, 5, 283-289.
Akagi, M. and Ienaga, T. (1997). "Speaker individuality in fundamental frequency contours and its control", J. Acoust. Soc. Jpn. (E), 18, 2 73-80.
Unoki, M. and Akagi, M. (1997). ��A method for signal extraction from noise-added signals��, Electronics and Communications in Japan, Part 3, 80, 11, 1-11.
Unoki, M. and Akagi, M. (1998). ��A method of signal extraction from noisy signal based on auditory scene analysis,�� Speech Communication, 27, 3-4, 261-279.
Mizumachi, M. and Akagi, M. (2000). "The auditory-oriented spectral distortion for evaluating speech signals distorted by additive noises," J. Acoust. Soc. Jpn. (E), 21, 5 251-258.
A. C. R. Nandasena, P. C. Nguyen, and M. Akagi (2001). " Spectral stability based event localizing temporal decomposition", Computer Speech & Language, Vol. 15, No. 4, 381-401
Akagi, M., Suzuki, N., Hayashi, K., Saito, H., and Michi, K. (2001). " Perception of Lateral Misarticulation and Its Physical Correlates", Folia Phoniatrica et Logopaedica, 53, 6, 291-307
Nguyen, P. C., Ochi, T., and Akagi, M.. (2003). ��Modified Restricted Temporal Decomposition and its Application of Low Rate Speech Coding,�� IEICE Trans. Inf. & Syst., E86-D, 3, 397-405.
Ishimoto, Y. and Akagi, M. (2004). ��Fundamental frequency estimation for noisy speech using entropy-weighted periodic and harmonic features,�� IEICE Trans. Inf. & Syst., E87-D, 1, 205-214.
Unoki, M., Furukawa, M., Sakata, K. and Akagi, M. (2004). "An improved method based on the MTF concept for restoring the power envelope from a reverberant signal," Acoust. Sci. & Tech., 25, 4, 232-242.
Unoki, M., M., Sakata, Furukawa, K. and Akagi, M. (2004). "A speech dereverberation method based on the MTF concept in power envelope restoration," Acoust. Sci. & Tech., 25, 4, 243-254.
Li, J. and Akagi, M. (2006). "Noise reduction method based on generalized subtractive beamformer," Acoust. Sci. & Tech., 27, 4, 206-215.
Nakanishi, J., Unoki, M., and Akagi, M. (2006). "Effect of ITD and component frequencies on perception of alarm signals in noisy environments," Journal of Signal Processing, 10, 4, 231-234.
Nishimoto, H. and Akagi, M. (2006). "Effects of complicated vocal tract shapes on vocal tract transfer functions," Journal of Signal Processing, 10, 4, 267-270.
Saitou, T., Unoki, M., and Akagi, M. (2006). "Analysis of acoustic features affecting singing-voice perception and its application to singing-voice synthesis from speaking-voice using STRAIGHT," J. Acoust. Soc. Am., 120, 5, Pt. 2, 3029.
Akagi, M., Dang, J., Lu, X., and Uchiyamada, T. (2006). "Investigation of interaction between speech perception and production using auditory feedback," J. Acoust. Soc. Am., 120, 5, Pt. 2, 3253.
Unoki, M., Toi, M., Shibano, Y., and Akagi, M. (2006). "Suppression of speech intelligibility loss through a modulation transfer function-based speech dereverberation method," J. Acoust. Soc. Am., 120, 5, Pt. 2, 3360.
Vu, T., Unoki, M., and Akagi, M. (2006). "A Study on Restoration of Bone-Conducted Speech with MTF-Based and LP-based Models," Journal of Signal Processing, 10, 6, 407-417.
Unoki, M., Kubo, M., Haniu, A., and Akagi, M. (2006). "A Model-Concept of the Selective Sound Segregation: - A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed Sound of Various Instruments -," Journal of Signal Processing, 10, 6, 419-431.
Nguyen B. P. and Akagi M. (2007). "Spectral Modification for Voice Gender Conversion using Temporal Decomposition," Journal of Signal Processing, 11, 4, 333-336.
Tomoike, S. and Akagi, M. (2008). "Estimation of local peaks based on particle filter in adverse environments," Journal of Signal Processing, 12, 4, 303-306.
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2008). "Adaptive -order generalized spectral subtraction for speech enhancement," Signal Processing, vol. 88, no. 11, pp. 2764-2776, 2008.
Lu, X., Unoki, M., and Akagi, M. (2008/11/1). "Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems," Acoustical Science and Technology, 29, 6, 351-361.
Nguyen, B. P. and Akagi, M. (2009/5/1) "A flexible spectral modification method based on temporal decomposition and Gaussian mixture model," Acoust. Sci. & Tech., 30, 3, 170-179.
Kinugasa, K., Unoki, M., and Akagi, M. (2009/07/01). "An MTF-based method for Blind Restoration for Improving Intelligibility of Bone-conducted Speech," Journal of Signal Processing, 13, 4, 339-342.
Hamada, Y., Kitamura, T., and Akagi, M. (2010/07/01). "A study of brain activities elicited by synthesized emotional voices controlled with prosodic features," Journal of Signal Processing, 14, 4, 265-268.
Morita, S., Unoki, M., and Akagi, M. (2010/07/01). "A study on the IMTF-based filtering on the modulation spectrum of reverberant signal," Journal of Signal Processing, 14, 4, 269-272.
Kuroda, N., Li, J., Iwaya, Y., Unoki, M., and Akagi, M. (2011). "Effects of spatial cues on detectability of alarm signals in noisy environments," In Principles and applications of spatial hearing (Eds. Suzuki, Y., Brungart, D., Iwaya, Y., Iida, K., Cabrera, D., and Kato, H.), World Scientific, 484-493.
Li, J., Yang, L., Zhang, J., Yan, Y., Hu, Y., Akagi, M., and Loizou, P. C. (2011/05). "Comparative intelligibility investigation of single-channel noise-reduction algorithms for Chinese, Japanese, and English," J. Acoust. Soc. Am., 129, 3291-3301.
Chau, D. T., Li, J., and Akagi, M. (2011/07/01). "Towards intelligent binaural speech enhancement by meaningful sound extraction," Journal of Signal Processing, 15, 4, 291-294.
Phung, T. N., Luong, M. C., and Akagi, M. (2012/08). "An investigation on speech perception under effects of coarticulation," International Journal of Computer and Electrical Engineering, Vol. 4, No. 4, 532-536.
Phung, T. N., Luong, M. C., and Akagi, M. (2012/08). "On the stability of spectral targets under effects of coarticulation," International Journal of Computer and Electrical Engineering, Vol. 4, No. 4, 537-541.
Phung, T. N., Unoki, M., and Akagi, M. (2012/09/01). "A study on restoration of bone-conducted speech in noisy environments with LP-based model and Gaussian mixture model," Journal of Signal Processing, 16, 5, 409-417.

International conference

Akagi, M. and Ienaga, T. (1995). "Speaker individualities in fundamental frequency contours and its control", Proc. EUROSPEECH95, 439-442.
Yonezawa, Y. and Akagi, M. (1996). "Modeling of contextual effects and its application to word spotting", Proc. Int. Conf. Spoken Lang. Process. 96, 2063-2066.
Kitamura, T. and Akagi, M. (1996). "Relationship between physical characteristics and speaker individualities in speech spectral envelopes", Proc ASA-ASJ Joint Meeting, 833-838.
Akagi, M., Kitamura, T., Suzuki, N. and Michi, K. (1996). "Perception of lateral misarticulation and its physical correlates", Proc ASA-ASJ Joint Meeting, 933-936.
Maki, K. and Akagi, M. (1997). "A functional model of the auditory peripheral system", Proc. ASVA97, Tokyo, 703-710.
Unoki, M. and Akagi, M. (1997). "A method of signal extraction from noisy signal based on auditory scene analysis", Proc. CASA97, IJCAI-97, Nagoya, 93-102.
Akagi, M. and Mizumachi, M. (1997). "Noise Reduction by Paired Microphones", Proc. EUROSPEECH97, 335-338.
Unoki, M. and Akagi, M. (1997). "A method of signal extraction from noisy signal", Proc. EUROSPEECH97, 2587-2590.
Nandasena, A.C.R. and Akagi, M. (1998). ��Spectral stability based event localizing temporal decomposition,�� Proc. ICASSP98, II, 957-960
Mizumachi, M. and Akagi, M. (1998). ��Noise reduction by paired-microphones using spectral subtraction,�� Proc. ICASSP98, II, 1001-1004
Maki, K., Hirota, K. and Akagi, M. (1998). ��A functional model of the auditory peripheral system: Responses to simple and complex stimuli,�� Computational Hearing, Italy, 13-18.
Itoh, K. and Akagi, M. (1998). ��A computational model of auditory sound localization,�� Computational Hearing, Italy, 67-72
Unoki, M. and Akagi, M. (1998). ��A computational model of co-modulation masking release,�� Computational Hearing, Italy, 129-134.
Unoki, M. and Akagi, M. (1998). ��Signal extraction from noisy signal based on auditory scene analysis,�� ICSLP98, Sydney, Vol.5, 2115-2118.
Akagi, M., Iwaki, M. and Sakaguchi, N. (1998). ��Spectral sequence compensation based on continuity of spectral sequence,�� Proc. ICSLP98, Sydney, Vol.4, 1407-1410.
Akagi, M., Iwaki, M. and Minakawa, T. (1998). ��Fundamental frequency fluctuation in continuous vowel utterance and its perception,�� ICSLP98, Sydney, Vol.4, 1519-1522.
Mizumachi, M. and Akagi, M. (1999). "Noise reduction method that is equipped for robust direction finder in adverse environments," Proc. Workshop on Robust Methods for Speech Recognition in Adverse Conditions, Tampere, Finland, 179-182.
Ito, K. and Akagi, M. (1999). "A computational model of auditory sound localization based on ITD," Abstracts of Symposium on Recent Developments in Auditory Mechanics, Sendai, Japan, 29P01, 156-157.
Maki, K., Akagi, M. and Hirota, K. (1999). "Effect of the basilar membrane nonlinearities on rate-place representation of vowel in the cochlear nucleus: A modeling approach," Abstracts of Symposium on Recent Developments in Auditory Mechanics, Sendai, Japan, 29P06, 166-167.
Unoki, M. and Akagi, M. (1999). "Segregation of vowel in background noise using the model of segregating two acoustic sources based on auditory scene analysis", Proc. CASA99, IJCAI-99, Stockholm, 51-60.
Unoki, M. and Akagi, M. (1999). "Segregation of vowel in background noise using the model of segregating two acoustic sources based on auditory scene analysis", Proc. EUROSPEECH99, 2575-2578.
Mizumachi, M. and Akagi, M. (1999). "An objective distortion estimator for hearing aids and its application to noise reduction," Proc. EUROSPEECH99, 2619-2622.
Ishimoto, Y. and Akagi, M. (2000). "A fundamental frequency estimation method for noisy speech," Proc. WESTPRAC7, 161-164.
Ito, K. and Akagi, M. (2000). "A study on temporal information based on the synchronization index using a computational model," Proc. WESTPRAC7, 263-266.
Mizumachi, M. and Akagi, M. (2000). "Noise reduction using a small-scale microphone array under non-stationary signal conditions," Proc. WESTPRAC7, 421-424.
Akagi, M. and Kitakaze, H. (2000). "Perception of synthesized singing voices with fine fluctuations in their fundamental frequency contours," Proc. ICSLP2000, Beijing, III-458-461.
Mizumachi, M., Akagi, M. and Nakamura, S. (2000). "Design of robust subtractive beamformer for noisy speech recognition," Proc. ICSLP2000, Beijing, IV-57-60.
Akagi, M., Mizumachi, M.,Ishimoto, Y., and Unoki, M. (2000). "Speech enhancement and segregation based on human auditory mechanisms", Proc. IS2000, Aizu, 246-253.
Ito, K. and Akagi, M. (2000). "A computational model of binaural coincidence detection using impulses based on synchronization index." Proc, ISA2000 (BIS2000), Wollongong, Australia.
Ishimoto, Y., Unoki, M., and Akagi, M. (2001). "A fundamental frequency estimation method for noisy speech based on periodicity and harmonicity", Proc. ICASSP2001, SPEECH-SF3, Salt Lake City.
Akagi, M., Kakehi, M., Kawaguchi, M., Nishinuma, M., and Ishigami, A. (2001). "Noisiness estimation of machine working noise using human auditory model", Proc. Internoise2001, 2451-2454.
Ishimoto, Y., Unoki, M., and Akagi, M. (2001). "A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency ", Proc. CRAC, Aalborg.
Ishimoto, Y., Unoki, M., and Akagi, M. (2001). "A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency", Proc. EUROSPEECH2001, Aalborg, 2439-2442.
Akagi, M. and Kago, T. (2002). " Noise reduction using a small-scale microphone array in multi noise source environment," Proc. ICASSP2002, Orlando, I-909-912.
Nguyen, P. C. and Akagi, M.. (2002). "Improvement of the restricted temporal decomposition method for line spectral frequency parameters," Proc. ICASSP2002, Orlando, I-265-268.
Nishimoto, H., Akagi, M., Kitamura, T., and Suzuki, N. (2002). "FEM analyses of three dimensional vocal tract models after tongue and mouth floor resection," NATO Advanced Study Institute 2002 Dynamics of Speech Production and Perception.
Unoki, M., Saitou, T., and Akagi, M. (2002). "Effect of F0 fluctuations and development of F0 control model in singing voice perception," NATO Advanced Study Institute 2002 Dynamics of Speech Production and Perception.
Saitou, T., Unoki, M., and Akagi, M. (2002). "Extraction of F0 dynamic characteristics and development of F0 control model in singing voice," Proc. ICAD2002, Kyoto.
Nguyen, P. C. and Akagi, M.. (2002). "Limited error based event localizing temporal decomposition," Proc. EUSIPCO2002, Toulouse, 190.
Nguyen, P. C. and Akagi, M.. (2002). "Coding speech at very low rates using STRAIGHT and temporal decomposition," Proc. ICSLP2002, Denver, 1849-1852.
Akagi, M. (2002). "Perception of fundamental frequency fluctuation," HEA-02-003-IP, Forum Acousticum Sevilla 2002 (Invited).
Unoki, M., Furukawa, M., and Akagi, M. (2002). "A method for recovering the power envelope from reverberant speech," SPA-Gen-002, Forum Acousticum Sevilla 2002.
Nguyen, P. C. and Akagi, M.. (2002). "Variable rate speech coding using STRAIGHT and temporal decomposition," Proc. SCW2002, Tsukuba, 26-28.
Nguyen, P. C., Akagi, M., and Ho, T. B. (2003). "Temporal decomposition: A promising approach to VQ-based speaker identification," Proc. ICASSP2003, Hong Kong, I-184-187.
Unoki, M., Furukawa, M., Sakata, K., and Akagi, M. (2003). "A method based on the MTF concept for dereverberating the power envelope from the reverberant signal," Proc. ICASSP2003, Hong Kong, I-840-843.
Nguyen, P. C., Akagi, M., and Ho, T. B. (2003). "Temporal decomposition: A promising approach to VQ-based speaker identification," Proc. ICME2003, Baltimore, V.III, 617-620.
Maki, K. and Akagi, M. (2003). ��A computational model of cochlear nucleus neurons,�� Proc. ISH2003, 70-76.
Ito, K. and Akagi, M. (2003). ��Study on improving regularity of neural phase locking in single neuron of AVCN via computational model,�� Proc. ISH2003, 77-83.
Nguyen, P. C. and Akagi, M. (2003). ��Efficient quantization of speech excitation parameters using temporal decomposition,�� Proc. EUROSPEECH2003, Geneva, 449-452.
Unoki, M., Sakata, K. and Akagi, M. (2003). ��A speech dereverberation method based on the MTF concept,�� Proc. EUROSPEECH2003, Geneva, 1417-1420.
Unoki, M., Kubo, M., and Akagi, M. (2003). ��A model for selective segregation of a target instrument sound from the mixed sound of various instruments,�� Proc. ICMC2003, Singapore, 295-298.
Akagi, M. and Nguyen, P. C. (2004). ��Temporal decomposition of speech and its application to speech coding and modification,�� Proc. Special Workshop in MAUI (SWIM), 1-4, 2004.
Unoki, M., Sakata, K., Toi, M., and Akagi, M. (2004). ��Speech dereverberation based on the concept of the modulation transfer function,�� Proc. NCSP2004, Hawaii, 423-426.
Saitou, T., Unoki, M., and Akagi, M. (2004). ��Development of the F0 control method for singing-voices synthesis,�� Proc. SP2004, Nara, 491-494.
Saitou, T., Unoki, M., and Akagi, M. (2004). ��Control methods of acoustic parameters for singing-voice synthesis,�� Proc. ICA2004, 501-504.
Nishimoto, H., Akagi, M., Kitamura, T. and Suzuki, N. (2004). ��Estimation of transfer function of vocal tract extracted from MRI data by FEM,�� Proc. ICA2004, 1473-1476.
Ito, S., Dang, J., and Akagi, M. (2004). ��Investigation of the acoustic features of emotional speech using physiological articulatory model,�� Proc. ICA2004, 2225-2226.
Kozaki, Y., Suzuki, N., Amagasa, T., and Akagi, M. (2004). ��Perception of hypernasality and its physical correlates,�� Proc. ICA2004. 3313-3316.
Unoki, M., Toi, M., and Akagi, M. (2004). "A speech dereverberation method based on the MTF concept using adaptive time-frequency divisions," Proc. EUSIPCO2004, 1689-1692.
Akagi, M., Nguyen, P. C., Saitou, T., Tsuji, N., and Unoki, M. (2004). "Temporal decomposition of speech and its application to speech coding and modification," Proc. KEST2004, 280-288.
Saitou, T., Tsuji, N., Unoki, M. and Akagi, M. (2004). "Analysis of acoustic features affecting "singing-ness" and its application to singing-voice synthesis from speaking-voice," Proc. ICSLP2004, Cheju, Korea.
Li, J. and Akagi, M. (2004). "Noise reduction using hybrid noise estimation technique and post-filtering," Proc. ICSLP2004, Cheju, Korea.
Toi, M., Unoki, M. and Akagi, M. (2005). "Development of adaptive time-frequency divisions and a carrier reconstruction in the MTF-based speech dereverberation method," Proc. NCSP05, Hawaii, 355-358.
Haniu, A., Unoki, M. and Akagi, M. (2005). "A study on a speech recognition method based on the selective sound segregation in noisy environment," Proc. NCSP05, Hawaii, 403-406.
Kimura, K., Unoki, M. and Akagi, M. (2005). "A study on a bone-conducted speech restoration method with the modulation filterbank," Proc. NCSP05, Hawaii, 411-414.
Li, J. and Akagi, M. (2005). "Suppressing localized and non-localized noises in arbitrary noise environments," Proc. HSCMA2005, Piscataway.
Li, J., Lu, X., and Akagi, M. (2005). "A noise reduction system in arbitrary noise environments and its application to speech enhancement and speech recognition," Proc. ICASSP2005, Philadelphia, III-277-280.
Huang, C. F. and Akagi, M. (2005). "A Multi-Layer fuzzy logical model for emotional speech Perception," Proc. EuroSpeech2005, Lisbon, Portugal, 417-420.
Unoki, M., Kubo, M., Haniu, A., and Akagi, M. (2005). "A model for selective segregation of a target instrument sound from the mixed sound of various instruments," Proc. EuroSpeech2005, Lisbon, Portugal, 2097-2100.
Li, J. and Akagi, M. (2005). "A hybrid microphone array post-filter in a diffuse noise field," Proc. EuroSpeech2005, Lisbon, Portugal, 2313-2316.
Li, J. and Akagi, M. (2005). "Theoretical analysis of microphone arrays with postfiltering for coherent and incoherent noise suppression in noisy environments," Proc. IWAENC2005, Eindhoven, The Netherlands, 85-88.
Nakanishi, J., Unoki, M., and Akagi, M. (2006). "Effect of ITD and component frequencies on perception of alarm signals in noisy environments," Proc. NCSP2006, 37-40.
Vu, T. T., Unoki, M., and Akagi, M. (2006). "A study on an LPC-based restoration model for improving the voice-quality of bone-conducted speech," Proc. NCSP2006, 110-113.
Nishimoto, H. and Akagi, M. (2006). "Effects of complicated vocal tract shapes on vocal tract transfer functions," Proc. NCSP2006, 114-117.
Takeyama, Y., Unoki, M., Akagi, M., and Kaminuma, A. (2006). "Synthesis of mimic speech sounds uttered in noisy car environments," Proc. NCSP2006, 118-121.
Lu, X., Unoki, M., and Akagi, M. (2006). "MTF-based sub-band power envelope restoration in reverberant environment for robust speech recognition, " Proc. NCSP2006, 162-165.
Unoki, M., Toi, M., and Akagi, M. (2006). "Refinement of an MTF-based speech dereverberation method using an optimal inverse-MTF filter," SPECOM2006, St. Petersburg, 323-326.
Li, J., Akagi, M., and Suzuki, Y. (2006). "Noise reduction based on generalized subtractive beamformer for speech enhancement," WESPAC2006, Seoul
Li, J, Akagi, M., and Suzuki, Y. (2006). "Improved hybrid microphone array post-filter by integrating a robust speech absence probability estimator for speech enhancement," Proc. ICSLP2006, Pittsburgh, USA, 2130-2133.
Lu, X., Unoki, M., and Akagi, M. (2006). "A robust feature extraction based on the MTF concept for speech recognition in reverberant environment," Proc. ICSLP2006, Pittsburgh, USA, 2546-2549.
Vu, T., Unoki, M., and Akagi, M. (2006). "A study on an LP-based model for restoring bone-conducted speech," Proc. HUT-ICCE2006, Hanoi.
Li, J., Akagi, M., and Suzuki, Y. (2006). "Noise reduction based on microphone array and post-filtering for robust speech recognition," Proc. ICSP, Guilin.
Minowa A., Unoki M., and Akagi M. (2007). "A study on physical conditions for auditory segregation/integration of speech signals based on auditory scene analysis," Proc. NCSP2007, 313-316.
Nguyen B. P. and Akagi M. (2007). "Spectral Modification for Voice Gender Conversion using Temporal Decomposition," Proc. NCSP2007, 481-484.
Uchiyama H., Unoki M., and Akagi M. (2007). "A study on perception of alarm signal in car environments," Proc. NCSP2007, 389-392.
Akagi, M., Saitou, T., and Huang, C-F. (2007). "Voice conversion to add non-linguistic information into speaking voices," Proc. JCA2007, CD-ROM.
Haniu, A., Unoki, M. and Akagi, M. (2007). "A study on a speech recognition method based on the selective sound segregation in noisy environment," Proc. JCA2007, CD-ROM.
Huang, C-H. and Akagi, M. (2007). "The building and verification of a three-layered model for expressive speech perception," Proc. JCA2007, CD-ROM.
Sawamura K., Dang J., Akagi M., Erickson D., Li, A., Sakuraba, K., Minematsu, N., and Hirose, K. (2007). "Common factors in emotion perception among different cultures," Proc. ICPhS2007, 2113-2116.
Nguyen, P. C., Akagi, M., and Nguyen, P. B. (2007). "Limited error based event localizing temporal decomposition and its application to variable-rate speech coding," Speech Communication, 49, 292-304.
Nguyen B. P. and Akagi M. (2007). "A flexible spectral modification method based on temporal decomposition and Gaussian mixture model," Proc. Interspeech2007, 538-541.
Huang, C. F. and Akagi, M. (2007). "A rule-based speech morphing for verifying an expressive speech perception model," Proc. Interspeech2007, 2661-2664.
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2007). "Noise reduction based on adaptive ��-order generalized spectral subtraction for speech enhancement," Proc. Interspeech2007, 802-805.
Saitou, T., Goto, M., Unoki, M., and Akagi, M. (2007). "Vocal conversion from speaking voice to singing voice using STRAIGHT," Proc. Interspeech2007, Singing Challenge.
Vu, T. T., Seide, G., Unoki, M., and Akagi, M. (2007). "Method of LP-based blind restoration for improving intelligibility of bone-conducted speech," Proc. Interspeech2007, 966-969.
Haniu, A., Unoki, M. and Akagi, M. (2007). "A study on a speech recognition method based on the selective sound segregation in various noisy environments," Proc. NOLTA2007, Vancouver, 445-448.
Vu, T. T., Unoki, M., and Akagi, M. (2007). "A blind restoration model for bone-conducted speech based on a linear prediction scheme," Proc. NOLTA2007, Vancouver, 449-452.
Uchiyama, H., Unoku, M., and Akagi, M. (2007). "Improvement in detectability of alarm signals in noisy environments by utilizing spatial cues," Proc. WASPAA2007, New Paltz, NY, pp.74-77.
Saitou, T., Goto, M., Unoku, M., and Akagi, M. (2007). "Speech-to-singing synthesis: converting speaking voices to singing voices by controlling acoustic features unique to singing voices," Proc. WASPAA2007, New Paltz, NY, pp.215-218
Vu, T. T. Unoki, M. and Akagi, M. (2007). "The Construction of Large-scale Bone-conducted and Air-conducted Speech Databases for Speech Intelligibility Tests," Proc. Oriental COCOSDA2007, 88-91.
Kusaba, M., Unoki, M., and Akagi, M. (2008/3/6). "A study on detectability of target signal in background noise by utilizing similarity of temporal envelopes in auditory search," Proc. NCSP08, 13-16.
Haniu, A., Unoki, M., and Akagi, M. (2008/3/6). "A speech recognition method based on the selective sound segregation in various noisy environments," Proc. NCSP08, 168-171.
Tezuka, T. and Akagi, M. (2008/3/6). "Influence of spectrum envelope on phoneme perception," Proc. NCSP08, 176-179.
Shibata, T. and Akagi, M. (2008/3/6). "A study on voice conversion method for synthesizing stimuli to perform gender perception experiments of speech," Proc. NCSP08, 180-183.
Nguyen B. P. and Akagi M. (2008/3/7). "Control of spectral dynamics using temporal decomposition in voice conversion and concatenative speech synthesis," Proc. NCSP08, 279-282.
Vu, T. T., Unoki, M., and Akagi, M. (2008/3/7). "A study of blind model for restoring bone-conducted speech based on liner prediction scheme," Proc. NCSP08, 287-290.
Tomoike, S. and Akagi, M. (2008/3/7). "Estimation of local peaks based on particle filter in adverse environments," Proc. NCSP08, 391-394
Akagi, M. (2008/6/5). "Voice conversion to add non-linguistic information into speaking voices," ICCE2008, Tutorial (Hoian, Vietnam).
Vu, T. T. Unoki, M. and Akagi, M. (2008/6/5). "An LP-based blind model for restoring bone-conducted speech," Proc. ICCE2008, 212-217.
Nguyen B. P. and Akagi M. (2008/6/6). "Phoneme-based spectral voice conversion using temporal decomposition and Gaussian mixture model," Proc. ICCE2008, 224-229.
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2008/06/30). "A two-stage binaural speech enhancement approach for hearing aids with preserving binaural benefits in noisy environments," Acoustics2008, Paris, 723-727.
Lu, X., Unoki, M., and Akagi, M. (2008/07/01). "An MTF-based blind restoration for temporal power envelopes as a front-end processor for automatic speech recognition systems in reverberant environments," Acoustics2008, Paris, 1419-1424.
Huang, C. F., Erickson, D., and Akagi, M. (2008/07/01). "Comparison of Japanese expressive speech perception by Japanese and Taiwanese listeners," Acoustics2008, Paris, 2317-2322.
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2008/08/16). "Improved two-stage binaural speech enhancement based on accurate interference estimation for hearing aids," IHCON2008
Li, J., Jiang, H., and Akagi, M. (2008/09/23). "Psychoacoustically-motivated adaptive ��-order generalized spectral subtraction based on data-driven optimization," Proc. InterSpeech2008, Brisbane, 171-174.
Petric, R., Lu, X., Unoki, M., Akagi, M., and Hoffmann, R. (2008/09/24). "Robust front end processing for speech recognition in reverberant environments: Utilization of speech characteristics," Proc. InterSpeech2008, Brisbane, 658-661.
Nguyen, B. P., Shibata, T., and Akagi, M. (2008/09/24). "High-quality analysis/synthesis method based on Temporal decomposition for speech modification," Proc. InterSpeech2008, Brisbane, 662-665.
Huang, C-F. and Akagi, M. (2008/10) "A three-layered model for expressive speech perception," Speech Communication 50, 810-828.
Kuroda, N., Li, J., Iwaya, Y., Unoki, M., and Akagi, M. (2009/03/01). "Effects from Spatial Cues on Detectability of Alarm Signals in Car Environments," Proc. NCSP'09, 45-48.
Kinugasa, K., Unoki, M., and Akagi, M. (2009/03/01). "An MTF-based Blind Restoration Method for Improving Intelligibility of Bone-conducted Speech," Proc. NCSP'09, 105-108.
Aoki, Y., Huang, C-F., and Akagi, M. (2009/03/01). "An emotional speech recognition system based on multi-layer emotional speech perception model," Proc. NCSP'09, 133-136.
Nakamura, T., Kitamura, T. and Akagi, M. (2009/03/01). "A study on nonlinguistic feature in singing and speaking voices by brain activity measurement," Proc. NCSP'09, 217-220.
Li, J., Fu, Q-J., Jiang, H., and Akagi, M. (2009/04/24). "Psychoacoustically-motivated adaptive ��-order generalized spectral subtraction for cochlear implant patients," Proc ICASSP2009, 4665-4668.
Kinugasa, K., Unoki, M., and Akagi, M. (2009/6/24). "An MTF-based method for blindly restoring bone-conducted speech," Proc. SPECOM2009, St. Petersburg, Russia, 199-204.
Akagi, M. (2009/08/14). "Multi-layer model for expressive speech perception and its application to expressive speech synthesis," Plenary lecture, NCMMSC2009, Lanzhou, China.
Saitou, T., Goto, M., Unoki, M., and Akagi, M. "Speech-to-Singing Synthesis System: Vocal Conversion from Speaking Voices to Singing Voices by Controlling Acoustic Features Unique to Singing Voices," NCMMSC2009, Lanzhou, China (2009/08/15).
Unoki, M., Yamasaki, Y., and Akagi, M. (2009/08/25). "MTF-based power envelope restoration in noisy reverberant environments," Proc. EUSIPCO2009, Glasgow, Scotland, 228-232.
Nguyen B. P. and Akagi M. (2009/9/9). "Efficient modeling of temporal structure of speech for applications in voice transformation," Proc. InterSpeech2009, Brighton, 1631-1634.
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2009/9/21). "Advancement of two-stage binaural speech enhancement (TS-BASE) for high-quality speech communication," Proc. WESPAC2009, Beijing, CD-ROM.
Haniu, A., Unoki, M., and Akagi, M. (2009/9/21). "A psychoacoustically-motivated conceptual model for automatic speech recognition," Proc. WESPAC2009, Beijing, CD-ROM.
Akagi, M. (2009/10/6). "Analysis of production and perception characteristics of non-linguistic information in speech and its application to inter-language communications," Proc. APSIPA2009, Sapporo, 513-519.
Dang, J., Li, A., Erickson, D., Suemitsu, A., Akagi, M., Sakuraba, K., Minematsu, N., and Hirose, K. (2009/10/6). "Comparison of emotion perception among different cultures," Proc. APSIPA2009, Sapporo, 538-544.
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2009/10/20). "Two-stage binaural speech enhancement with Wiener filter based on equalization-cancellation model," Proc. WASPAA, New Palts, NY, 133-136.
Naoki Kuroda, Junfeng Li, Yukio Iwaya, Masashi Unoki, and Masato Akagi, "Effects of spatial cues on detectability of alarm signals in noisy environments," Proc. IWPASH2009, P7. Zaou, Japan, Nov. 2009 (CDROM).
Morita, S., Unoki, M., and Akagi, M. (2010/03/04). "A study on the MTF-based inverse filtering for the modulation spectrum of reverberant speech," Proc. NCSP10, Hawaii, USA, 265-268.
Li, J. Sasaki, Y., Akagi, M. and Yan, Y. (2010/03/04). "Experimental evaluations of TS-BASE/WF in reverberant conditions," Proc. NCSP10, Hawaii, USA, 269-272.
Hamada, Y., Kitamura, T., and Akagi, M. (2010/03/04). "A study on brain activities elicited by synthesized emotional voices controlled with prosodic features," Proc. NCSP10, Hawaii, USA, 472-475.
Ishida, M. and Akagi, M. (2010/03/04). "Pitch perception of complex sounds with varied fundamental frequency and spectral tilt," Proc. NCSP10, Hawaii, USA, 480-483.
Hamada, Y., Kitamura, T., and Akagi, M. (2010/08/24). "A study on brain activities elicited by emotional voices with various F0 contours," Proc. ICA2010, Sydney, Australia.
Chau, D. T., Li, J., and Akagi, M. (2010/09/30). "A DOA estimation algorithm based on equalization-cancellation theory," Proc. INTERSPEECH2010, Makuhari, 2770-2773.
Akagi, M. (2010/11/29). "Rule-based voice conversion derived from expressive speech perception model: How do computers sing a song joyfully?" Tutorial, ISCSLP2010, Tainan, Taiwan �ʾ��Թֱ��.
Li, J., Chau, D. T., Akagi, M., Yang, L., Zhang, J., and Yan, Y. (2010/11/30). "Intelligibility Investigation of Single-Channel Noise Reduction Algorithms for Chinese and Japanese," in: Proc. ISCSLP2010, Tainan, Taiwan.
Phung, T. N., Unoki, M. and Akagi, M. (2010/12/14). "Improving Bone-Conducted Speech Restoration in noisy environment based on LP scheme," Proc. APSIPA2010, Student Symposium, 12.
Trung-Nghia Phung, Mai Chi Luong, and Masato Akagi (2011/02). "An investigation on speech perception over coarticulation," Proc. ICSAP2011, VI, 507-511.
Trung-Nghia Phung, Mai Chi Luong, and Masato Akagi (2011/02). "An investigation on perceptual line spectral frequency (PLP-LSF) target stability against the vowel neutralization phenomenon," Proc. ICSAP2011, VI, 512-514.
Kosugi, T., Haniu, A., Miyauchi, R., Unoki, M., and Akagi, M. (2011/03/01). "Study on suitable-architecture of IIR all-pass filter for digital-audio watermarking technique based on cochlear-delay characteristics,"Proc. NCSP2011, Tianjin, China, 135-138.
Mizukawa, S. and Akagi, M. (2011/03/02). "A binaural model accounting for spatial masking release," Proc. NCSP2011, Tianjin, China, 179-182.
Yano, Y., Miyauchi, R., Unoki, M., and Akagi, M. (2011/03/02). "Study on detectability of target signal by utilizing differences between movements in temporal envelopes of target and background signals," Proc. NCSP2011, Tianjin, China, 231-234.
Ikeda, T., Unoki, M., and Akagi, M. (2011/03/02). "Study on blind estimation of Speech Transmission Index in room acoustics,"Proc. NCSP2011, Tianjin, China, 235-238.
Morita, S., Lu, X., Unoki, M., and Akagi, M. (2011/03/02). "Study on MTF-based power envelope restoration in noisy reverberant environments," Proc. NCSP2011, Tianjin, China, 247-250.
Shih, T, Suemitsu, A., and Akagi, M. (2011/03/03). "Influences of transformed auditory feedback with first three formant frequencies," Proc. NCSP2011, Tianjin, China, 340-343.
Chau, D. T., Li, J., and Akagi, M. (2011/03/03). "Towards an intelligent binaural speech enhancement system by integrating meaningful signal extraction,"Proc. NCSP2011, Tianjin, China, 344-347.
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2011/06). "Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication," Speech Communication 53 677-689.
Unoki, M., Ikeda, T., and Akagi, M. (2011/06/27). "Blind estimation method of speech transmission index in room acoustics," Proc. Forum Acousticum 2011, Aalborg, Denmark, 1973-1978.
Unoki, M., Lu, X., Petrick, R., Morita, S., Akagi, M., and Hoffmann R. (2011/08/30). "Voice activity detection in MTF-based power envelope restoration," Proc. INTERSPEECH 2011, Florence, Italy, 2609-2612.
Shota Morita, Xugang Lu, Masashi Unoki, Masato Akagi, and Ruediger Hoffmann, "MTF-based sub-band power-envelope restoration for robust speech recognition in noisy reverberant environments," Proc. APSIPA2011, Xi'an, Oct. 2011 (CDROM).
Sasaki, Y. and Akagi, M. (2012/03/05), "Speech enhancement technique in noisy reverberant environment using two microphone arrays," Proc. NCSP2012, Honolulu, HW, 333-336.
Izumida, T. and Akagi, M. (2012/03/05). "Study on hearing impression of speaker identification focusing on dynamic features," Proc. NCSP2012, Honolulu, HW, 401-404.
Yano, Y., Miyauchi, R., Unoki, M., and Akagi, M. (2012/03/06). "Study on detectability of signals by utilizing differences in their amplitude modulation," Proc. NCSP2012, Honolulu, HW, 611-614.
Xia, R., Li, J., Akagi, M., and Yan, Y. (2012/03/28). "Evaluation of objective intelligibility prediction measures for noise-reduced signals in Mandarin," Proc. ICASSP2012, Kyoto, 4465-4468.
Akagi, M. and Irie, Y. (2012/08/22). "Privacy protection for speech based on concepts of auditory scene analysis," Proc. INTERNOISE2012, New York, 485.
Elbarougy, R. and Akagi, M. (2012/12/04). "Speech Emotion Recognition System Based on a Dimensional Approach Using a Three-Layered Model," Proc. APSIPA2012, Hollywood, USA.
Phung, T. N., Luong, M. C., and Akagi, M. (2012/12/06). "A concatenative speech synthesis for monosyllabic languages with limited data," Proc. APSIPA2012, Hollywood, USA.
Phung, T. N., Luong, M. C., and Akagi, M. (2012/12/12). "Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languages," Proc. O-COCOSDA2012, Macau, 129-134.
Motoda, H. and Akagi, M. (2013/03/05). "A singing voices synthesis system to characterize vocal registers using ARX-LF model," Proc. NCSP2013, Hawaii, USA, 93-96.
Hisatsune, H. and Akagi, M. (2013/03/05). "A Study on individualization of Head-Related Transfer Function in the median plane," Proc. NCSP2013, Hawaii, USA, 161-164.
Kubo, R. and Akagi, M. (2013/06/04). "Exploring auditory aging can exclusively explain Japanese adults�� age-related decrease in training effects of American English /r/-/l/," Proc. ICA2013, 2aSC34, Montreal.
Unoki, M., Ikeda, T., Sasaki, K., Miyauchi, R., Akagi, M., and Kim, N-S. (2013/07/08). "Blind method of estimating speech transmission index in room acoustics based on concept of modulation transfer function," Proc. ChinaSIP2013, Beijing, 308-312.
Chau, D. T., Li, J., and Akagi, M. (2013/07/08). "Improve equalization-cancellation-based sound localization in noisy reverberant environments using direct-to-reverberant energy ratio," Proc. ChinaSIP2013, Beijing, 322-326.
Li, J., Akagi, M., and Yan, Y. (2013/07/09). "Objective Japanese intelligibility prediction for noisy speech signals before and after noise-reduction processing," Proc. ChinaSIP2013, Beijing, 352-355.
Li, J, Chen, F., Akagi, M., and Yan, Y. (2013/08/27), "Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and Japanese," Proc. InterSpeech2013, Lyon, 1184-1187.
Phung, T. N., Luong, M. C., and Akagi, M. (2013/09/02). "A Hybrid TTS between Unit Selection and HMM-based TTS under limited data conditions," Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain 281-284.
Nishie, S. and Akagi, M. (2013/09/11). "Acoustic sound source tracking for a moving object using precise Doppler-shift measurement," Proc. EUSIPCO2013, Marrakesh, Morocco.
Akagi, M. and Hisatsune, H. (2013/10/17). "Admissible range for individualization of head-related transfer function in median plane," Proc. IIHMSP2013, Beijing.
Elbarougy, R. and Akagi, M. (2013/11/01). "Cross-lingual speech emotion recognition system based on a three-layer model for human perception," Proc. APSIPA2013, Kaohsiung, Taiwan.

Invited lecture

Akagi, M. (2007/12/6). "Conversion of speaking voice into singing voice," ICT Forum Hanoi, Invited talk (Hanoi, Vietnam)

Workshop in Japan

Li, J. and Akagi, M. (2004). "A noise reduction system in localized and non-localized noise environments," Tech. Report of IEICE, EA2004-34.
Li, J. and Akagi, M. (2005). "A noise reduction method based on a generalized subtractive beamformer," Tech. Report of IEICE, EA2005-44.
Vu T. T., Unoki M., and Akagi, M. (2007). "An LP-based blind restoration method for improving intelligibility of bone-conducted speech," �ŻҾ��̿��ز񵻽��SP2006-172.
Nguyen B. P. and Akagi M. (2007). "A flexible temporal decomposition-based spectral modification method using asymmetric Gaussian mixture model," �ŻҾ��̿��ز񵻽��SP2007-25.
Huang C. F., Erickson, D., and Akagi M. (2007). "A study on expressive speech and perception of semantic primitives: Comparison between Taiwanese and Japanese," �ŻҾ��̿��ز񵻽��SP2007-32.
Unoki M., Vu T. T., Seide J., and Akagi, M. (2007). "Evaluation of an LP-based blind restoration method to improve intelligibility of BC speech," �ŻҾ��̿��ز񵻽��SP2007-41.
Petric, R., Lu, X., Unoki, M., Akagi, M., and Hoffmann, R. (2008/07/17). "Robust front end processing for speech recognition in reverberant environments: Utilization of Speech Properties," �ŻҾ��̿��ز񵻽��SP2008-44.
Yang, L., Li, J., Zhang, J., Yan, Y., and Akagi, M. (2009/6/26). "Effects of single-channel enhancement algorithms on Mandarin speech intelligibility," IEICE Tech. Report, EA2009-32.
Chau, D. T., Li, J., and Akagi, M. (2010/06/11). "A DOA estimation algorithm based on equalization-cancellation theory," IEICE Tech. Report, EA2010-28.
Phung, T. N., Unoki, M., and Akagi, M. (2010/06/11). "Comparative evaluation of bone-conducted-speech restoration based on linear prediction scheme," IEICE Tech. Report, EA2010-31.
Zhou, Y., Li, J., Sun, Y., Zhang, J., Yan, Y., and Akagi, M. (2010/10). "A hybrid speech emotion recognition system based on spectral and prosodic features," IEICE Trans. Info. & Sys., E93D (10): 2813-2821.
Dang, J., Li, A., Erickson, D., Suemitsu, A., Akagi, M., Sakuraba, K., Mienmatasu, N., and Hirose, K. (2010/11/01). "Comparison of emotion perception among different cultures," Acoust. Sci. & Tech. 31, 6, 394-402.
Phung, T. N., Luong, M. C., and Akagi, M. (2012/06/14). "A low-cost concatenative TTS for monosyllabic languages," IEICE Tech. Report, SP-2012-35.
Elbarougy, R. and Akagi, M. (2012/06/14). "Comparison of methods for emotion dimensions estimation in speech using a three-layered model," IEICE Tech. Report, SP-2012-36.
Elbarougy, R. and Akagi, M. (2013/03/01). "Automatic Speech Emotion Recognition Using A Three Layer Model," �ŻҾ��̿��ز񵻽��SP2012-127.
Chau, D. T., Li, J., and Akagi, M. (2013/05/16). "Adaptive equalization-cancellation model and its application to sound localization in noisy reverberant environments," IEICE Tech. Report, EA2013-24, SP-2013-24.

Verbal presentation

Nandasena, A.C.R. and Akagi, M. (1997). "A new approach to temporal decomposition of speech", Proc. ASJ '97 Spring Meeting, 2-7-6.
Nandasena, A.C.R. and Akagi, M. (1997). "S2BEL Temporal Decomposition for Efficient Spectral Coding", Proc. ASJ '97 Fall Meeting, 1-P-21.
Nandasena, A.C.R. and Akagi, M. (1998). "Temporal decomposition of speech excitation parameters," Proc. ASJ '98 Spring Meeting, 3-7-12.
Nguyen, P. C. and Akagi, M. (2001). "Limited error based event localizing temporal decomposition", Proc. ASJ '2002 Spring Meeting, 3-10-12.
Nguyen, P. C. and Akagi, M.. (2002). "Variable rate speech coding based on STRAIGHT using temporal decomposition," Proc. ASJ '2002 Fall Meeting, 1-10-4.
Nguyen, P. C. and Akagi, M. (2003). "On the application of temporal decomposition to VQ-based speaker identification," Proc. ASJ '2003 Spring Meeting, 1-10-4.
Li, J. and Akagi, M. (2004). ��A hybrid noise reduction method using single- and multi-channel techniques,�� Proc. ASJ '2004 Spring Meeting, 3-P-2.
Huang, C. F. and Akagi, M. (2004). "A perceptual model of emotional speech build by Fuzzy logic," Proc. ASJ '2004 Fall Meeting, 2-2-8.
Li, J. and Akagi, M. (2004). "Multi-channel post-filtering in diffuse noise environment," Proc. ASJ '2004 Fall Meeting, 2-3-10.
Huang, C. F. and Akagi, M. (2004). "A multi-layer fuzzy logical model for emotional speech perception," Trans. Psycho. & Physio. Acoust., ASJ, H-2004-95.
Huang, C. F. and Akagi, M. (2005). "Rule-Based Speech Morphing for Evaluating Linguistic Descriptions of Emotional Speech Perception," Proc. ASJ '2005 Fall Meeting, 1-6-3.
Li, J. and Akagi, M. (2005). "A noise reduction method based on a generalized subtractive beamformer," Proc. ASJ '2005 Fall Meeting, 2-2-19.
Vu, T. T., Unoki, M., and Akagi, M. (2006). "A method for restoring bone-conducted speech base on LPC model," Proc. ASJ '2006 Spring Meeting, 1-3-3
Li, J., Akagi, M., and Suzuki, Y. (2006). "Two-microphone noise reduction with preserving ITD cues in highly non-stationary multi-noise-source environments," Proc. ASJ '2006 Spring Meeting, 3-5-10
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2006). "Generalized spectral subtraction based on sub-band SNR," Proc. ASJ '2006 Fall Meeting, 1-1-21.
Vu, T., Unoki, M., and Akagi, M. (2006). "A study on predicting parameters of LP-based model for restoring bone-conducted speech," Proc. ASJ '2006 Fall Meeting, 2-5-1.
Nguyen B. P., Huang C. F., and Akagi M. (2007). "Temporal decomposition-based spectral modification and its application to emotional speech synthesis," Proc. ASJ '2007 Spring Meeting, 3-8-8.
Huang C. F., Nguyen B. P., and Akagi M. (2007). "Rule-Based Speech Morphing for Evaluating Emotional Speech Perception Model," Proc. ASJ '2007 Spring Meeting, 3-8-9.
Huang, C. F., Erickson, D., and Akagi, M. (2007). "Perception of Japanese expressive speech: Comparison between Japanese and Taiwanese listeners," Proc. ASJ '2007 Fall Meeting, 1-4-6.
Nguyen B. P. and Akagi M. (2007). "Temporal decomposition-based speech spectra modeling using asymmetric Gaussian mixture model," Proc. ASJ '2007 Fall Meeting, 3-4-6.
Lu, X., Unoki, M., and Akagi, M. (2008/3/17), "Comparative evaluation of modulation transfer function based dereverberation for robust speech recognition," Proc. ASJ '2008 Spring Meeting, 1-10-12.
Nguyen B. P. and Akagi M. (2008/3/17). "Improvement of Peak Estimation using Gaussian Mixture Model for Speech Modification," Proc. ASJ '2008 Spring Meeting, 1-11-27.
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2008/3/19). "A two-stage binaural speech enhancement approach with adaptive filter and Wiener filter: Theory, implementation and evaluation," Proc. ASJ '2008 Spring Meeting, 3-6-8.
Li, J., Jiang, H., Fu, Q., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2008/09/12). "Adaptive -order generalized spectral subtraction-based speech enhancement for cochlear implant patients," Proc. ASJ '2008 Fall Meeting, 3-8-5.
Zhou, Y., Li, J., Akagi, M., and Yan, Y. (2009/9/15). "Physiologically-Inspired Feature Extraction for Emotion Recognition," Proc. ASJ '2009 Fall Meeting, 1-R-11.
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2009/9/16). "Subjective evaluation of TS-BASE/WF for speech enhancement and sound localization," Proc. ASJ '2009 Fall Meeting, 2-4-3.
Li, J., Yang, L., Zhang, J., Yan, Y., and Akagi, M. (2009/9/16). "Comparative evaluations of single-channel speech enhancement algorithms on Mandarin and English speech intelligibility," Proc. ASJ '2009 Fall Meeting, 2-P-24.
Li, J. and Akagi, M. (2009/3/8). "Intelligibility investigation of single-channel speech enhancement algorithms using Japanese corpus," Proc. ASJ '2010 Spring Meeting, 1-9-3.
Shih, T., Suemitsu, A., and Akagi, M. (2010/10/16). "Influences of real-time auditory feedback on formant perturbations," Proc. Auditory Research Meeting, ASJ, 40, 8, H-2010-121.
Phung, T. N., Luong, M. C., and Akagi, M. (2013/03/13). "Improving the flexibility of unit-selection speech synthesis with Temporal Decomposition," Proc. ASJ '2013 Spring Meeting, 1-7-16.
Chau, T. D., Li, J. and Akagi, M. (2013/03/13). "Binaural multiple-source localization in noisy reverberant environments based on Equalization-Cancellation model," Proc. ASJ '2013 Spring Meeting, 1-P-44.
Elbarougy, R., Tokuda, I., and Akagi, M. (2013/03/13). "Acoustic Analysis of Register Transition between Chest-to-Head Register in Singing Voice," Proc. ASJ '2013 Spring Meeting, 1-Q-7c.
Elbarougy, R. and Akagi, M. (2013/09/25). "Cross-lingual Speech Emotion Dimensions Estimation Based on a Three-Layer Model," Proc. ASJ '2013 Fall Meeting, 1-P-1a.
Phung, T. N. and Akagi, M. (2013/09/25). "Improving the naturalness of speech synthesized by HMM-based systems by producing an appropriate smoothness," Proc. ASJ '2013 Fall Meeting, 2-7-9.

Gloss

Nothing in English

Books

Ito, K. and Akagi, M. (2000). "A computational model of auditory sound localization based on ITD," In Recent Developments in Auditory Mechanics, World Scientific Publishing, 483-489.
Maki, K., Akagi, M. and Hirota, K. (2000). "Effect of the basilar membrane nonlinearities on rate-place representation of vowel in the cochlear nucleus: A modeling approach," In Recent Developments in Auditory Mechanics, World Scientific Publishing, 490-496.

Other

Matsuoka, R., Lu, X., Dang, J., and Akagi, M. (2004). "Investigation of interaction between speech perception and speech production," Proc. KIT Int. Sympo. Brain and Language 2004, 27-28.
Kozaki-Yamaguchi, Y., Suzuki, N., Fujita, Y., Yoshimasu, H., Akagi, M., and Amagasa, T. (2005). "Perception of hypernasality and its physical correlates," Oral Science International, 2, 1, 21-35.
Saitou, T., Unoki, M. and Akagi, M. (2005). "Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis," Speech Communication 46, 405-417.
Unoki, M., Toi, M., and Akagi, M. (2005). "Development of the MTF-based speech dereverberation method using adaptive time-frequency division," Proc. Forum Acousticum 2005, 51-56.
Li, J., Lu, X., and Akagi, M. (2005). "Noise reduction based on microphone array and post-filtering for robust speech recognition in car environments," Proc. Workshop DSPinCar2005, S2-9
Maki, K. and Akagi, M. (2005). "A computational model of cochlear nucleus neurons," In Auditory Signal Processing, Springer, 84-90.
Ito, K. and Akagi, M. (2005). "Study on improving regularity of neural phase locking in single neurons of AVCN via a computational model," In Auditory Signal Processing, Springer, 91-99.
Huang, C. F. and Akagi, M. (2005). "Toward a rule-based synthesis of emotional speech on linguistic description of perception," Affective Computing and Intelligent Interaction, Springer LNCS 3784, 366-373.
Li, J., Akagi, M., and Suzuki, Y. (2006). "Multi-channel noise reduction in noisy environments," Chinese Spoken Language Processing, Proc. ISCSLP2006, Springer LNCS 4274, 258-269.
Dang, J., Akagi, M., and Honda, K. (2006). "Communication between speech production and perception within the brain - Observation and simulation," J. Comp. Sci. & Tech., 21, 1, 95-105.
Li, J. and Akagi, M. (2006). "A noise reduction system based on hybrid noise estimation technique and post-filtering in arbitrary noise environments," Speech Communication, 48, 111-126.
Vu, T. T., Unoki, M., and Akagi, M. (2006). "A study on restoration of bone-conducted speech with the lpc-based model," Proc. Int. Sympo. Frontiers in Speech and Hearing Research, 67-72.
Lu, X., Unoki, M., and Akagi, M. (2006). "Sub-band temporal envelope restoration for ASR in reverberation environment," Proc. Int. Sympo. Frontiers in Speech and Hearing Research, 73-78
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2006). "Adaptive -order generalized spectral subtraction for speech enhancement," Tech. Report of IEICE, EA2006-42.
Vu, T., Unoki, M., and Akagi, M. (2006). "A parameter estimation method for a bone-conducted speech restoration based on the linear presiction," Trans. Tech. Comm. Psychol. Physiol. Acoust., ASJ, 36, 7, H-2006-104.
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2007). "A speech enhancement approach for binaural hearing aids," Proc. 22th SIP Symposium, Sendai, 263-268.
Vu, T. T., Unoki, M., and Akagi, M. (2008/3/20) "A study on the LP-based blind model in restoring bone-conducted speech," Asian Student workshop, Tokyo SP2007-189
Haniu, A., Unoki, M., and Akagi, M. (2008/3/20) "Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments," Asian Student workshop, Tokyo SP2007-196
Li, J. and Akagi, M. (2008). "A hybrid microphone array post-filter in a diffuse noise field," Applied Acoustics 69, 546-557.
Li, J., Akagi, M., and Suzuki, Y. (2008). "A two-microphone noise reduction method in highly non-stationary multiple-noise-source environments," IEICE Trans. Fundamentals, E91-A, 6, 1337-1346.
Nguyen, B. P. and Akagi, M. (2009/02/20). "Applications of Temporal Decomposition to Voice Transformation," International symposium on biomechanical and physiological modeling and speech science, 19-24.
Akagi, M. (2009/02/20). "Introduction of SCOPE project: Analysis of production and perception characteristics of non-linguistic information in speech and its application to inter-language communications," International symposium on biomechanical and physiological modeling and speech science, 51-62.

JAIST Technical Report

Unoki, M. and Akagi, M. (1998). "A method of signal extraction from noisy signal based on auditory scene analysis", JAIST Tech. Report, IS-RR-98-0005P.
Unoki, M. and Akagi, M. (1998). "A computational model of co-modulation masking release", JAIST Tech. Report, IS-RR-98-0006P.
Ishimoto, Y., Unoki, M., and Akagi, M. (2005). "Fundamental frequency estimation for noisy speech based on instantaneous amplitude and frequency," JAIST Tech. Report, IS-RR-2005-006.

Patent

Nothing in English

Japan Advanced Institute of Science and Technology Acoustic Information Science, Unoki Laboratory Japanese
TOP RESEARCH MEMBER PUBLICATION EQUIPMENT LINK	Publication (by Contents, by Time, by Field) Paper International conference Invited lecture Workshop in Japan Verbal presentation Gloss Books Other JAIST Technical Report Patent Paper Akagi, M. (1993). "Modeling of contextual effects based on spectral peak interaction", J. of Acoust. Society of America, 93, 2, 1076-1086. Kitamura, T. and Akagi, M. (1995). "Speaker individualities in speech spectral envelopes", J. Acoust. Soc. Jpn. (E), 16, 5, 283-289. Akagi, M. and Ienaga, T. (1997). "Speaker individuality in fundamental frequency contours and its control", J. Acoust. Soc. Jpn. (E), 18, 2 73-80. Unoki, M. and Akagi, M. (1997). ��A method for signal extraction from noise-added signals��, Electronics and Communications in Japan, Part 3, 80, 11, 1-11. Unoki, M. and Akagi, M. (1998). ��A method of signal extraction from noisy signal based on auditory scene analysis,�� Speech Communication, 27, 3-4, 261-279. Mizumachi, M. and Akagi, M. (2000). "The auditory-oriented spectral distortion for evaluating speech signals distorted by additive noises," J. Acoust. Soc. Jpn. (E), 21, 5 251-258. A. C. R. Nandasena, P. C. Nguyen, and M. Akagi (2001). " Spectral stability based event localizing temporal decomposition", Computer Speech & Language, Vol. 15, No. 4, 381-401 Akagi, M., Suzuki, N., Hayashi, K., Saito, H., and Michi, K. (2001). " Perception of Lateral Misarticulation and Its Physical Correlates", Folia Phoniatrica et Logopaedica, 53, 6, 291-307 Nguyen, P. C., Ochi, T., and Akagi, M.. (2003). ��Modified Restricted Temporal Decomposition and its Application of Low Rate Speech Coding,�� IEICE Trans. Inf. & Syst., E86-D, 3, 397-405. Ishimoto, Y. and Akagi, M. (2004). ��Fundamental frequency estimation for noisy speech using entropy-weighted periodic and harmonic features,�� IEICE Trans. Inf. & Syst., E87-D, 1, 205-214. Unoki, M., Furukawa, M., Sakata, K. and Akagi, M. (2004). "An improved method based on the MTF concept for restoring the power envelope from a reverberant signal," Acoust. Sci. & Tech., 25, 4, 232-242. Unoki, M., M., Sakata, Furukawa, K. and Akagi, M. (2004). "A speech dereverberation method based on the MTF concept in power envelope restoration," Acoust. Sci. & Tech., 25, 4, 243-254. Li, J. and Akagi, M. (2006). "Noise reduction method based on generalized subtractive beamformer," Acoust. Sci. & Tech., 27, 4, 206-215. Nakanishi, J., Unoki, M., and Akagi, M. (2006). "Effect of ITD and component frequencies on perception of alarm signals in noisy environments," Journal of Signal Processing, 10, 4, 231-234. Nishimoto, H. and Akagi, M. (2006). "Effects of complicated vocal tract shapes on vocal tract transfer functions," Journal of Signal Processing, 10, 4, 267-270. Saitou, T., Unoki, M., and Akagi, M. (2006). "Analysis of acoustic features affecting singing-voice perception and its application to singing-voice synthesis from speaking-voice using STRAIGHT," J. Acoust. Soc. Am., 120, 5, Pt. 2, 3029. Akagi, M., Dang, J., Lu, X., and Uchiyamada, T. (2006). "Investigation of interaction between speech perception and production using auditory feedback," J. Acoust. Soc. Am., 120, 5, Pt. 2, 3253. Unoki, M., Toi, M., Shibano, Y., and Akagi, M. (2006). "Suppression of speech intelligibility loss through a modulation transfer function-based speech dereverberation method," J. Acoust. Soc. Am., 120, 5, Pt. 2, 3360. Vu, T., Unoki, M., and Akagi, M. (2006). "A Study on Restoration of Bone-Conducted Speech with MTF-Based and LP-based Models," Journal of Signal Processing, 10, 6, 407-417. Unoki, M., Kubo, M., Haniu, A., and Akagi, M. (2006). "A Model-Concept of the Selective Sound Segregation: - A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed Sound of Various Instruments -," Journal of Signal Processing, 10, 6, 419-431. Nguyen B. P. and Akagi M. (2007). "Spectral Modification for Voice Gender Conversion using Temporal Decomposition," Journal of Signal Processing, 11, 4, 333-336. Tomoike, S. and Akagi, M. (2008). "Estimation of local peaks based on particle filter in adverse environments," Journal of Signal Processing, 12, 4, 303-306. Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2008). "Adaptive -order generalized spectral subtraction for speech enhancement," Signal Processing, vol. 88, no. 11, pp. 2764-2776, 2008. Lu, X., Unoki, M., and Akagi, M. (2008/11/1). "Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems," Acoustical Science and Technology, 29, 6, 351-361. Nguyen, B. P. and Akagi, M. (2009/5/1) "A flexible spectral modification method based on temporal decomposition and Gaussian mixture model," Acoust. Sci. & Tech., 30, 3, 170-179. Kinugasa, K., Unoki, M., and Akagi, M. (2009/07/01). "An MTF-based method for Blind Restoration for Improving Intelligibility of Bone-conducted Speech," Journal of Signal Processing, 13, 4, 339-342. Hamada, Y., Kitamura, T., and Akagi, M. (2010/07/01). "A study of brain activities elicited by synthesized emotional voices controlled with prosodic features," Journal of Signal Processing, 14, 4, 265-268. Morita, S., Unoki, M., and Akagi, M. (2010/07/01). "A study on the IMTF-based filtering on the modulation spectrum of reverberant signal," Journal of Signal Processing, 14, 4, 269-272. Kuroda, N., Li, J., Iwaya, Y., Unoki, M., and Akagi, M. (2011). "Effects of spatial cues on detectability of alarm signals in noisy environments," In Principles and applications of spatial hearing (Eds. Suzuki, Y., Brungart, D., Iwaya, Y., Iida, K., Cabrera, D., and Kato, H.), World Scientific, 484-493. Li, J., Yang, L., Zhang, J., Yan, Y., Hu, Y., Akagi, M., and Loizou, P. C. (2011/05). "Comparative intelligibility investigation of single-channel noise-reduction algorithms for Chinese, Japanese, and English," J. Acoust. Soc. Am., 129, 3291-3301. Chau, D. T., Li, J., and Akagi, M. (2011/07/01). "Towards intelligent binaural speech enhancement by meaningful sound extraction," Journal of Signal Processing, 15, 4, 291-294. Phung, T. N., Luong, M. C., and Akagi, M. (2012/08). "An investigation on speech perception under effects of coarticulation," International Journal of Computer and Electrical Engineering, Vol. 4, No. 4, 532-536. Phung, T. N., Luong, M. C., and Akagi, M. (2012/08). "On the stability of spectral targets under effects of coarticulation," International Journal of Computer and Electrical Engineering, Vol. 4, No. 4, 537-541. Phung, T. N., Unoki, M., and Akagi, M. (2012/09/01). "A study on restoration of bone-conducted speech in noisy environments with LP-based model and Gaussian mixture model," Journal of Signal Processing, 16, 5, 409-417. International conference Akagi, M. and Ienaga, T. (1995). "Speaker individualities in fundamental frequency contours and its control", Proc. EUROSPEECH95, 439-442. Yonezawa, Y. and Akagi, M. (1996). "Modeling of contextual effects and its application to word spotting", Proc. Int. Conf. Spoken Lang. Process. 96, 2063-2066. Kitamura, T. and Akagi, M. (1996). "Relationship between physical characteristics and speaker individualities in speech spectral envelopes", Proc ASA-ASJ Joint Meeting, 833-838. Akagi, M., Kitamura, T., Suzuki, N. and Michi, K. (1996). "Perception of lateral misarticulation and its physical correlates", Proc ASA-ASJ Joint Meeting, 933-936. Maki, K. and Akagi, M. (1997). "A functional model of the auditory peripheral system", Proc. ASVA97, Tokyo, 703-710. Unoki, M. and Akagi, M. (1997). "A method of signal extraction from noisy signal based on auditory scene analysis", Proc. CASA97, IJCAI-97, Nagoya, 93-102. Akagi, M. and Mizumachi, M. (1997). "Noise Reduction by Paired Microphones", Proc. EUROSPEECH97, 335-338. Unoki, M. and Akagi, M. (1997). "A method of signal extraction from noisy signal", Proc. EUROSPEECH97, 2587-2590. Nandasena, A.C.R. and Akagi, M. (1998). ��Spectral stability based event localizing temporal decomposition,�� Proc. ICASSP98, II, 957-960 Mizumachi, M. and Akagi, M. (1998). ��Noise reduction by paired-microphones using spectral subtraction,�� Proc. ICASSP98, II, 1001-1004 Maki, K., Hirota, K. and Akagi, M. (1998). ��A functional model of the auditory peripheral system: Responses to simple and complex stimuli,�� Computational Hearing, Italy, 13-18. Itoh, K. and Akagi, M. (1998). ��A computational model of auditory sound localization,�� Computational Hearing, Italy, 67-72 Unoki, M. and Akagi, M. (1998). ��A computational model of co-modulation masking release,�� Computational Hearing, Italy, 129-134. Unoki, M. and Akagi, M. (1998). ��Signal extraction from noisy signal based on auditory scene analysis,�� ICSLP98, Sydney, Vol.5, 2115-2118. Akagi, M., Iwaki, M. and Sakaguchi, N. (1998). ��Spectral sequence compensation based on continuity of spectral sequence,�� Proc. ICSLP98, Sydney, Vol.4, 1407-1410. Akagi, M., Iwaki, M. and Minakawa, T. (1998). ��Fundamental frequency fluctuation in continuous vowel utterance and its perception,�� ICSLP98, Sydney, Vol.4, 1519-1522. Mizumachi, M. and Akagi, M. (1999). "Noise reduction method that is equipped for robust direction finder in adverse environments," Proc. Workshop on Robust Methods for Speech Recognition in Adverse Conditions, Tampere, Finland, 179-182. Ito, K. and Akagi, M. (1999). "A computational model of auditory sound localization based on ITD," Abstracts of Symposium on Recent Developments in Auditory Mechanics, Sendai, Japan, 29P01, 156-157. Maki, K., Akagi, M. and Hirota, K. (1999). "Effect of the basilar membrane nonlinearities on rate-place representation of vowel in the cochlear nucleus: A modeling approach," Abstracts of Symposium on Recent Developments in Auditory Mechanics, Sendai, Japan, 29P06, 166-167. Unoki, M. and Akagi, M. (1999). "Segregation of vowel in background noise using the model of segregating two acoustic sources based on auditory scene analysis", Proc. CASA99, IJCAI-99, Stockholm, 51-60. Unoki, M. and Akagi, M. (1999). "Segregation of vowel in background noise using the model of segregating two acoustic sources based on auditory scene analysis", Proc. EUROSPEECH99, 2575-2578. Mizumachi, M. and Akagi, M. (1999). "An objective distortion estimator for hearing aids and its application to noise reduction," Proc. EUROSPEECH99, 2619-2622. Ishimoto, Y. and Akagi, M. (2000). "A fundamental frequency estimation method for noisy speech," Proc. WESTPRAC7, 161-164. Ito, K. and Akagi, M. (2000). "A study on temporal information based on the synchronization index using a computational model," Proc. WESTPRAC7, 263-266. Mizumachi, M. and Akagi, M. (2000). "Noise reduction using a small-scale microphone array under non-stationary signal conditions," Proc. WESTPRAC7, 421-424. Akagi, M. and Kitakaze, H. (2000). "Perception of synthesized singing voices with fine fluctuations in their fundamental frequency contours," Proc. ICSLP2000, Beijing, III-458-461. Mizumachi, M., Akagi, M. and Nakamura, S. (2000). "Design of robust subtractive beamformer for noisy speech recognition," Proc. ICSLP2000, Beijing, IV-57-60. Akagi, M., Mizumachi, M.,Ishimoto, Y., and Unoki, M. (2000). "Speech enhancement and segregation based on human auditory mechanisms", Proc. IS2000, Aizu, 246-253. Ito, K. and Akagi, M. (2000). "A computational model of binaural coincidence detection using impulses based on synchronization index." Proc, ISA2000 (BIS2000), Wollongong, Australia. Ishimoto, Y., Unoki, M., and Akagi, M. (2001). "A fundamental frequency estimation method for noisy speech based on periodicity and harmonicity", Proc. ICASSP2001, SPEECH-SF3, Salt Lake City. Akagi, M., Kakehi, M., Kawaguchi, M., Nishinuma, M., and Ishigami, A. (2001). "Noisiness estimation of machine working noise using human auditory model", Proc. Internoise2001, 2451-2454. Ishimoto, Y., Unoki, M., and Akagi, M. (2001). "A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency ", Proc. CRAC, Aalborg. Ishimoto, Y., Unoki, M., and Akagi, M. (2001). "A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency", Proc. EUROSPEECH2001, Aalborg, 2439-2442. Akagi, M. and Kago, T. (2002). " Noise reduction using a small-scale microphone array in multi noise source environment," Proc. ICASSP2002, Orlando, I-909-912. Nguyen, P. C. and Akagi, M.. (2002). "Improvement of the restricted temporal decomposition method for line spectral frequency parameters," Proc. ICASSP2002, Orlando, I-265-268. Nishimoto, H., Akagi, M., Kitamura, T., and Suzuki, N. (2002). "FEM analyses of three dimensional vocal tract models after tongue and mouth floor resection," NATO Advanced Study Institute 2002 Dynamics of Speech Production and Perception. Unoki, M., Saitou, T., and Akagi, M. (2002). "Effect of F0 fluctuations and development of F0 control model in singing voice perception," NATO Advanced Study Institute 2002 Dynamics of Speech Production and Perception. Saitou, T., Unoki, M., and Akagi, M. (2002). "Extraction of F0 dynamic characteristics and development of F0 control model in singing voice," Proc. ICAD2002, Kyoto. Nguyen, P. C. and Akagi, M.. (2002). "Limited error based event localizing temporal decomposition," Proc. EUSIPCO2002, Toulouse, 190. Nguyen, P. C. and Akagi, M.. (2002). "Coding speech at very low rates using STRAIGHT and temporal decomposition," Proc. ICSLP2002, Denver, 1849-1852. Akagi, M. (2002). "Perception of fundamental frequency fluctuation," HEA-02-003-IP, Forum Acousticum Sevilla 2002 (Invited). Unoki, M., Furukawa, M., and Akagi, M. (2002). "A method for recovering the power envelope from reverberant speech," SPA-Gen-002, Forum Acousticum Sevilla 2002. Nguyen, P. C. and Akagi, M.. (2002). "Variable rate speech coding using STRAIGHT and temporal decomposition," Proc. SCW2002, Tsukuba, 26-28. Nguyen, P. C., Akagi, M., and Ho, T. B. (2003). "Temporal decomposition: A promising approach to VQ-based speaker identification," Proc. ICASSP2003, Hong Kong, I-184-187. Unoki, M., Furukawa, M., Sakata, K., and Akagi, M. (2003). "A method based on the MTF concept for dereverberating the power envelope from the reverberant signal," Proc. ICASSP2003, Hong Kong, I-840-843. Nguyen, P. C., Akagi, M., and Ho, T. B. (2003). "Temporal decomposition: A promising approach to VQ-based speaker identification," Proc. ICME2003, Baltimore, V.III, 617-620. Maki, K. and Akagi, M. (2003). ��A computational model of cochlear nucleus neurons,�� Proc. ISH2003, 70-76. Ito, K. and Akagi, M. (2003). ��Study on improving regularity of neural phase locking in single neuron of AVCN via computational model,�� Proc. ISH2003, 77-83. Nguyen, P. C. and Akagi, M. (2003). ��Efficient quantization of speech excitation parameters using temporal decomposition,�� Proc. EUROSPEECH2003, Geneva, 449-452. Unoki, M., Sakata, K. and Akagi, M. (2003). ��A speech dereverberation method based on the MTF concept,�� Proc. EUROSPEECH2003, Geneva, 1417-1420. Unoki, M., Kubo, M., and Akagi, M. (2003). ��A model for selective segregation of a target instrument sound from the mixed sound of various instruments,�� Proc. ICMC2003, Singapore, 295-298. Akagi, M. and Nguyen, P. C. (2004). ��Temporal decomposition of speech and its application to speech coding and modification,�� Proc. Special Workshop in MAUI (SWIM), 1-4, 2004. Unoki, M., Sakata, K., Toi, M., and Akagi, M. (2004). ��Speech dereverberation based on the concept of the modulation transfer function,�� Proc. NCSP2004, Hawaii, 423-426. Saitou, T., Unoki, M., and Akagi, M. (2004). ��Development of the F0 control method for singing-voices synthesis,�� Proc. SP2004, Nara, 491-494. Saitou, T., Unoki, M., and Akagi, M. (2004). ��Control methods of acoustic parameters for singing-voice synthesis,�� Proc. ICA2004, 501-504. Nishimoto, H., Akagi, M., Kitamura, T. and Suzuki, N. (2004). ��Estimation of transfer function of vocal tract extracted from MRI data by FEM,�� Proc. ICA2004, 1473-1476. Ito, S., Dang, J., and Akagi, M. (2004). ��Investigation of the acoustic features of emotional speech using physiological articulatory model,�� Proc. ICA2004, 2225-2226. Kozaki, Y., Suzuki, N., Amagasa, T., and Akagi, M. (2004). ��Perception of hypernasality and its physical correlates,�� Proc. ICA2004. 3313-3316. Unoki, M., Toi, M., and Akagi, M. (2004). "A speech dereverberation method based on the MTF concept using adaptive time-frequency divisions," Proc. EUSIPCO2004, 1689-1692. Akagi, M., Nguyen, P. C., Saitou, T., Tsuji, N., and Unoki, M. (2004). "Temporal decomposition of speech and its application to speech coding and modification," Proc. KEST2004, 280-288. Saitou, T., Tsuji, N., Unoki, M. and Akagi, M. (2004). "Analysis of acoustic features affecting "singing-ness" and its application to singing-voice synthesis from speaking-voice," Proc. ICSLP2004, Cheju, Korea. Li, J. and Akagi, M. (2004). "Noise reduction using hybrid noise estimation technique and post-filtering," Proc. ICSLP2004, Cheju, Korea. Toi, M., Unoki, M. and Akagi, M. (2005). "Development of adaptive time-frequency divisions and a carrier reconstruction in the MTF-based speech dereverberation method," Proc. NCSP05, Hawaii, 355-358. Haniu, A., Unoki, M. and Akagi, M. (2005). "A study on a speech recognition method based on the selective sound segregation in noisy environment," Proc. NCSP05, Hawaii, 403-406. Kimura, K., Unoki, M. and Akagi, M. (2005). "A study on a bone-conducted speech restoration method with the modulation filterbank," Proc. NCSP05, Hawaii, 411-414. Li, J. and Akagi, M. (2005). "Suppressing localized and non-localized noises in arbitrary noise environments," Proc. HSCMA2005, Piscataway. Li, J., Lu, X., and Akagi, M. (2005). "A noise reduction system in arbitrary noise environments and its application to speech enhancement and speech recognition," Proc. ICASSP2005, Philadelphia, III-277-280. Huang, C. F. and Akagi, M. (2005). "A Multi-Layer fuzzy logical model for emotional speech Perception," Proc. EuroSpeech2005, Lisbon, Portugal, 417-420. Unoki, M., Kubo, M., Haniu, A., and Akagi, M. (2005). "A model for selective segregation of a target instrument sound from the mixed sound of various instruments," Proc. EuroSpeech2005, Lisbon, Portugal, 2097-2100. Li, J. and Akagi, M. (2005). "A hybrid microphone array post-filter in a diffuse noise field," Proc. EuroSpeech2005, Lisbon, Portugal, 2313-2316. Li, J. and Akagi, M. (2005). "Theoretical analysis of microphone arrays with postfiltering for coherent and incoherent noise suppression in noisy environments," Proc. IWAENC2005, Eindhoven, The Netherlands, 85-88. Nakanishi, J., Unoki, M., and Akagi, M. (2006). "Effect of ITD and component frequencies on perception of alarm signals in noisy environments," Proc. NCSP2006, 37-40. Vu, T. T., Unoki, M., and Akagi, M. (2006). "A study on an LPC-based restoration model for improving the voice-quality of bone-conducted speech," Proc. NCSP2006, 110-113. Nishimoto, H. and Akagi, M. (2006). "Effects of complicated vocal tract shapes on vocal tract transfer functions," Proc. NCSP2006, 114-117. Takeyama, Y., Unoki, M., Akagi, M., and Kaminuma, A. (2006). "Synthesis of mimic speech sounds uttered in noisy car environments," Proc. NCSP2006, 118-121. Lu, X., Unoki, M., and Akagi, M. (2006). "MTF-based sub-band power envelope restoration in reverberant environment for robust speech recognition, " Proc. NCSP2006, 162-165. Unoki, M., Toi, M., and Akagi, M. (2006). "Refinement of an MTF-based speech dereverberation method using an optimal inverse-MTF filter," SPECOM2006, St. Petersburg, 323-326. Li, J., Akagi, M., and Suzuki, Y. (2006). "Noise reduction based on generalized subtractive beamformer for speech enhancement," WESPAC2006, Seoul Li, J, Akagi, M., and Suzuki, Y. (2006). "Improved hybrid microphone array post-filter by integrating a robust speech absence probability estimator for speech enhancement," Proc. ICSLP2006, Pittsburgh, USA, 2130-2133. Lu, X., Unoki, M., and Akagi, M. (2006). "A robust feature extraction based on the MTF concept for speech recognition in reverberant environment," Proc. ICSLP2006, Pittsburgh, USA, 2546-2549. Vu, T., Unoki, M., and Akagi, M. (2006). "A study on an LP-based model for restoring bone-conducted speech," Proc. HUT-ICCE2006, Hanoi. Li, J., Akagi, M., and Suzuki, Y. (2006). "Noise reduction based on microphone array and post-filtering for robust speech recognition," Proc. ICSP, Guilin. Minowa A., Unoki M., and Akagi M. (2007). "A study on physical conditions for auditory segregation/integration of speech signals based on auditory scene analysis," Proc. NCSP2007, 313-316. Nguyen B. P. and Akagi M. (2007). "Spectral Modification for Voice Gender Conversion using Temporal Decomposition," Proc. NCSP2007, 481-484. Uchiyama H., Unoki M., and Akagi M. (2007). "A study on perception of alarm signal in car environments," Proc. NCSP2007, 389-392. Akagi, M., Saitou, T., and Huang, C-F. (2007). "Voice conversion to add non-linguistic information into speaking voices," Proc. JCA2007, CD-ROM. Haniu, A., Unoki, M. and Akagi, M. (2007). "A study on a speech recognition method based on the selective sound segregation in noisy environment," Proc. JCA2007, CD-ROM. Huang, C-H. and Akagi, M. (2007). "The building and verification of a three-layered model for expressive speech perception," Proc. JCA2007, CD-ROM. Sawamura K., Dang J., Akagi M., Erickson D., Li, A., Sakuraba, K., Minematsu, N., and Hirose, K. (2007). "Common factors in emotion perception among different cultures," Proc. ICPhS2007, 2113-2116. Nguyen, P. C., Akagi, M., and Nguyen, P. B. (2007). "Limited error based event localizing temporal decomposition and its application to variable-rate speech coding," Speech Communication, 49, 292-304. Nguyen B. P. and Akagi M. (2007). "A flexible spectral modification method based on temporal decomposition and Gaussian mixture model," Proc. Interspeech2007, 538-541. Huang, C. F. and Akagi, M. (2007). "A rule-based speech morphing for verifying an expressive speech perception model," Proc. Interspeech2007, 2661-2664. Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2007). "Noise reduction based on adaptive ��-order generalized spectral subtraction for speech enhancement," Proc. Interspeech2007, 802-805. Saitou, T., Goto, M., Unoki, M., and Akagi, M. (2007). "Vocal conversion from speaking voice to singing voice using STRAIGHT," Proc. Interspeech2007, Singing Challenge. Vu, T. T., Seide, G., Unoki, M., and Akagi, M. (2007). "Method of LP-based blind restoration for improving intelligibility of bone-conducted speech," Proc. Interspeech2007, 966-969. Haniu, A., Unoki, M. and Akagi, M. (2007). "A study on a speech recognition method based on the selective sound segregation in various noisy environments," Proc. NOLTA2007, Vancouver, 445-448. Vu, T. T., Unoki, M., and Akagi, M. (2007). "A blind restoration model for bone-conducted speech based on a linear prediction scheme," Proc. NOLTA2007, Vancouver, 449-452. Uchiyama, H., Unoku, M., and Akagi, M. (2007). "Improvement in detectability of alarm signals in noisy environments by utilizing spatial cues," Proc. WASPAA2007, New Paltz, NY, pp.74-77. Saitou, T., Goto, M., Unoku, M., and Akagi, M. (2007). "Speech-to-singing synthesis: converting speaking voices to singing voices by controlling acoustic features unique to singing voices," Proc. WASPAA2007, New Paltz, NY, pp.215-218 Vu, T. T. Unoki, M. and Akagi, M. (2007). "The Construction of Large-scale Bone-conducted and Air-conducted Speech Databases for Speech Intelligibility Tests," Proc. Oriental COCOSDA2007, 88-91. Kusaba, M., Unoki, M., and Akagi, M. (2008/3/6). "A study on detectability of target signal in background noise by utilizing similarity of temporal envelopes in auditory search," Proc. NCSP08, 13-16. Haniu, A., Unoki, M., and Akagi, M. (2008/3/6). "A speech recognition method based on the selective sound segregation in various noisy environments," Proc. NCSP08, 168-171. Tezuka, T. and Akagi, M. (2008/3/6). "Influence of spectrum envelope on phoneme perception," Proc. NCSP08, 176-179. Shibata, T. and Akagi, M. (2008/3/6). "A study on voice conversion method for synthesizing stimuli to perform gender perception experiments of speech," Proc. NCSP08, 180-183. Nguyen B. P. and Akagi M. (2008/3/7). "Control of spectral dynamics using temporal decomposition in voice conversion and concatenative speech synthesis," Proc. NCSP08, 279-282. Vu, T. T., Unoki, M., and Akagi, M. (2008/3/7). "A study of blind model for restoring bone-conducted speech based on liner prediction scheme," Proc. NCSP08, 287-290. Tomoike, S. and Akagi, M. (2008/3/7). "Estimation of local peaks based on particle filter in adverse environments," Proc. NCSP08, 391-394 Akagi, M. (2008/6/5). "Voice conversion to add non-linguistic information into speaking voices," ICCE2008, Tutorial (Hoian, Vietnam). Vu, T. T. Unoki, M. and Akagi, M. (2008/6/5). "An LP-based blind model for restoring bone-conducted speech," Proc. ICCE2008, 212-217. Nguyen B. P. and Akagi M. (2008/6/6). "Phoneme-based spectral voice conversion using temporal decomposition and Gaussian mixture model," Proc. ICCE2008, 224-229. Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2008/06/30). "A two-stage binaural speech enhancement approach for hearing aids with preserving binaural benefits in noisy environments," Acoustics2008, Paris, 723-727. Lu, X., Unoki, M., and Akagi, M. (2008/07/01). "An MTF-based blind restoration for temporal power envelopes as a front-end processor for automatic speech recognition systems in reverberant environments," Acoustics2008, Paris, 1419-1424. Huang, C. F., Erickson, D., and Akagi, M. (2008/07/01). "Comparison of Japanese expressive speech perception by Japanese and Taiwanese listeners," Acoustics2008, Paris, 2317-2322. Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2008/08/16). "Improved two-stage binaural speech enhancement based on accurate interference estimation for hearing aids," IHCON2008 Li, J., Jiang, H., and Akagi, M. (2008/09/23). "Psychoacoustically-motivated adaptive ��-order generalized spectral subtraction based on data-driven optimization," Proc. InterSpeech2008, Brisbane, 171-174. Petric, R., Lu, X., Unoki, M., Akagi, M., and Hoffmann, R. (2008/09/24). "Robust front end processing for speech recognition in reverberant environments: Utilization of speech characteristics," Proc. InterSpeech2008, Brisbane, 658-661. Nguyen, B. P., Shibata, T., and Akagi, M. (2008/09/24). "High-quality analysis/synthesis method based on Temporal decomposition for speech modification," Proc. InterSpeech2008, Brisbane, 662-665. Huang, C-F. and Akagi, M. (2008/10) "A three-layered model for expressive speech perception," Speech Communication 50, 810-828. Kuroda, N., Li, J., Iwaya, Y., Unoki, M., and Akagi, M. (2009/03/01). "Effects from Spatial Cues on Detectability of Alarm Signals in Car Environments," Proc. NCSP'09, 45-48. Kinugasa, K., Unoki, M., and Akagi, M. (2009/03/01). "An MTF-based Blind Restoration Method for Improving Intelligibility of Bone-conducted Speech," Proc. NCSP'09, 105-108. Aoki, Y., Huang, C-F., and Akagi, M. (2009/03/01). "An emotional speech recognition system based on multi-layer emotional speech perception model," Proc. NCSP'09, 133-136. Nakamura, T., Kitamura, T. and Akagi, M. (2009/03/01). "A study on nonlinguistic feature in singing and speaking voices by brain activity measurement," Proc. NCSP'09, 217-220. Li, J., Fu, Q-J., Jiang, H., and Akagi, M. (2009/04/24). "Psychoacoustically-motivated adaptive ��-order generalized spectral subtraction for cochlear implant patients," Proc ICASSP2009, 4665-4668. Kinugasa, K., Unoki, M., and Akagi, M. (2009/6/24). "An MTF-based method for blindly restoring bone-conducted speech," Proc. SPECOM2009, St. Petersburg, Russia, 199-204. Akagi, M. (2009/08/14). "Multi-layer model for expressive speech perception and its application to expressive speech synthesis," Plenary lecture, NCMMSC2009, Lanzhou, China. Saitou, T., Goto, M., Unoki, M., and Akagi, M. "Speech-to-Singing Synthesis System: Vocal Conversion from Speaking Voices to Singing Voices by Controlling Acoustic Features Unique to Singing Voices," NCMMSC2009, Lanzhou, China (2009/08/15). Unoki, M., Yamasaki, Y., and Akagi, M. (2009/08/25). "MTF-based power envelope restoration in noisy reverberant environments," Proc. EUSIPCO2009, Glasgow, Scotland, 228-232. Nguyen B. P. and Akagi M. (2009/9/9). "Efficient modeling of temporal structure of speech for applications in voice transformation," Proc. InterSpeech2009, Brighton, 1631-1634. Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2009/9/21). "Advancement of two-stage binaural speech enhancement (TS-BASE) for high-quality speech communication," Proc. WESPAC2009, Beijing, CD-ROM. Haniu, A., Unoki, M., and Akagi, M. (2009/9/21). "A psychoacoustically-motivated conceptual model for automatic speech recognition," Proc. WESPAC2009, Beijing, CD-ROM. Akagi, M. (2009/10/6). "Analysis of production and perception characteristics of non-linguistic information in speech and its application to inter-language communications," Proc. APSIPA2009, Sapporo, 513-519. Dang, J., Li, A., Erickson, D., Suemitsu, A., Akagi, M., Sakuraba, K., Minematsu, N., and Hirose, K. (2009/10/6). "Comparison of emotion perception among different cultures," Proc. APSIPA2009, Sapporo, 538-544. Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2009/10/20). "Two-stage binaural speech enhancement with Wiener filter based on equalization-cancellation model," Proc. WASPAA, New Palts, NY, 133-136. Naoki Kuroda, Junfeng Li, Yukio Iwaya, Masashi Unoki, and Masato Akagi, "Effects of spatial cues on detectability of alarm signals in noisy environments," Proc. IWPASH2009, P7. Zaou, Japan, Nov. 2009 (CDROM). Morita, S., Unoki, M., and Akagi, M. (2010/03/04). "A study on the MTF-based inverse filtering for the modulation spectrum of reverberant speech," Proc. NCSP10, Hawaii, USA, 265-268. Li, J. Sasaki, Y., Akagi, M. and Yan, Y. (2010/03/04). "Experimental evaluations of TS-BASE/WF in reverberant conditions," Proc. NCSP10, Hawaii, USA, 269-272. Hamada, Y., Kitamura, T., and Akagi, M. (2010/03/04). "A study on brain activities elicited by synthesized emotional voices controlled with prosodic features," Proc. NCSP10, Hawaii, USA, 472-475. Ishida, M. and Akagi, M. (2010/03/04). "Pitch perception of complex sounds with varied fundamental frequency and spectral tilt," Proc. NCSP10, Hawaii, USA, 480-483. Hamada, Y., Kitamura, T., and Akagi, M. (2010/08/24). "A study on brain activities elicited by emotional voices with various F0 contours," Proc. ICA2010, Sydney, Australia. Chau, D. T., Li, J., and Akagi, M. (2010/09/30). "A DOA estimation algorithm based on equalization-cancellation theory," Proc. INTERSPEECH2010, Makuhari, 2770-2773. Akagi, M. (2010/11/29). "Rule-based voice conversion derived from expressive speech perception model: How do computers sing a song joyfully?" Tutorial, ISCSLP2010, Tainan, Taiwan �ʾ��Թֱ��. Li, J., Chau, D. T., Akagi, M., Yang, L., Zhang, J., and Yan, Y. (2010/11/30). "Intelligibility Investigation of Single-Channel Noise Reduction Algorithms for Chinese and Japanese," in: Proc. ISCSLP2010, Tainan, Taiwan. Phung, T. N., Unoki, M. and Akagi, M. (2010/12/14). "Improving Bone-Conducted Speech Restoration in noisy environment based on LP scheme," Proc. APSIPA2010, Student Symposium, 12. Trung-Nghia Phung, Mai Chi Luong, and Masato Akagi (2011/02). "An investigation on speech perception over coarticulation," Proc. ICSAP2011, VI, 507-511. Trung-Nghia Phung, Mai Chi Luong, and Masato Akagi (2011/02). "An investigation on perceptual line spectral frequency (PLP-LSF) target stability against the vowel neutralization phenomenon," Proc. ICSAP2011, VI, 512-514. Kosugi, T., Haniu, A., Miyauchi, R., Unoki, M., and Akagi, M. (2011/03/01). "Study on suitable-architecture of IIR all-pass filter for digital-audio watermarking technique based on cochlear-delay characteristics,"Proc. NCSP2011, Tianjin, China, 135-138. Mizukawa, S. and Akagi, M. (2011/03/02). "A binaural model accounting for spatial masking release," Proc. NCSP2011, Tianjin, China, 179-182. Yano, Y., Miyauchi, R., Unoki, M., and Akagi, M. (2011/03/02). "Study on detectability of target signal by utilizing differences between movements in temporal envelopes of target and background signals," Proc. NCSP2011, Tianjin, China, 231-234. Ikeda, T., Unoki, M., and Akagi, M. (2011/03/02). "Study on blind estimation of Speech Transmission Index in room acoustics,"Proc. NCSP2011, Tianjin, China, 235-238. Morita, S., Lu, X., Unoki, M., and Akagi, M. (2011/03/02). "Study on MTF-based power envelope restoration in noisy reverberant environments," Proc. NCSP2011, Tianjin, China, 247-250. Shih, T, Suemitsu, A., and Akagi, M. (2011/03/03). "Influences of transformed auditory feedback with first three formant frequencies," Proc. NCSP2011, Tianjin, China, 340-343. Chau, D. T., Li, J., and Akagi, M. (2011/03/03). "Towards an intelligent binaural speech enhancement system by integrating meaningful signal extraction,"Proc. NCSP2011, Tianjin, China, 344-347. Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2011/06). "Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication," Speech Communication 53 677-689. Unoki, M., Ikeda, T., and Akagi, M. (2011/06/27). "Blind estimation method of speech transmission index in room acoustics," Proc. Forum Acousticum 2011, Aalborg, Denmark, 1973-1978. Unoki, M., Lu, X., Petrick, R., Morita, S., Akagi, M., and Hoffmann R. (2011/08/30). "Voice activity detection in MTF-based power envelope restoration," Proc. INTERSPEECH 2011, Florence, Italy, 2609-2612. Shota Morita, Xugang Lu, Masashi Unoki, Masato Akagi, and Ruediger Hoffmann, "MTF-based sub-band power-envelope restoration for robust speech recognition in noisy reverberant environments," Proc. APSIPA2011, Xi'an, Oct. 2011 (CDROM). Sasaki, Y. and Akagi, M. (2012/03/05), "Speech enhancement technique in noisy reverberant environment using two microphone arrays," Proc. NCSP2012, Honolulu, HW, 333-336. Izumida, T. and Akagi, M. (2012/03/05). "Study on hearing impression of speaker identification focusing on dynamic features," Proc. NCSP2012, Honolulu, HW, 401-404. Yano, Y., Miyauchi, R., Unoki, M., and Akagi, M. (2012/03/06). "Study on detectability of signals by utilizing differences in their amplitude modulation," Proc. NCSP2012, Honolulu, HW, 611-614. Xia, R., Li, J., Akagi, M., and Yan, Y. (2012/03/28). "Evaluation of objective intelligibility prediction measures for noise-reduced signals in Mandarin," Proc. ICASSP2012, Kyoto, 4465-4468. Akagi, M. and Irie, Y. (2012/08/22). "Privacy protection for speech based on concepts of auditory scene analysis," Proc. INTERNOISE2012, New York, 485. Elbarougy, R. and Akagi, M. (2012/12/04). "Speech Emotion Recognition System Based on a Dimensional Approach Using a Three-Layered Model," Proc. APSIPA2012, Hollywood, USA. Phung, T. N., Luong, M. C., and Akagi, M. (2012/12/06). "A concatenative speech synthesis for monosyllabic languages with limited data," Proc. APSIPA2012, Hollywood, USA. Phung, T. N., Luong, M. C., and Akagi, M. (2012/12/12). "Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languages," Proc. O-COCOSDA2012, Macau, 129-134. Motoda, H. and Akagi, M. (2013/03/05). "A singing voices synthesis system to characterize vocal registers using ARX-LF model," Proc. NCSP2013, Hawaii, USA, 93-96. Hisatsune, H. and Akagi, M. (2013/03/05). "A Study on individualization of Head-Related Transfer Function in the median plane," Proc. NCSP2013, Hawaii, USA, 161-164. Kubo, R. and Akagi, M. (2013/06/04). "Exploring auditory aging can exclusively explain Japanese adults�� age-related decrease in training effects of American English /r/-/l/," Proc. ICA2013, 2aSC34, Montreal. Unoki, M., Ikeda, T., Sasaki, K., Miyauchi, R., Akagi, M., and Kim, N-S. (2013/07/08). "Blind method of estimating speech transmission index in room acoustics based on concept of modulation transfer function," Proc. ChinaSIP2013, Beijing, 308-312. Chau, D. T., Li, J., and Akagi, M. (2013/07/08). "Improve equalization-cancellation-based sound localization in noisy reverberant environments using direct-to-reverberant energy ratio," Proc. ChinaSIP2013, Beijing, 322-326. Li, J., Akagi, M., and Yan, Y. (2013/07/09). "Objective Japanese intelligibility prediction for noisy speech signals before and after noise-reduction processing," Proc. ChinaSIP2013, Beijing, 352-355. Li, J, Chen, F., Akagi, M., and Yan, Y. (2013/08/27), "Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and Japanese," Proc. InterSpeech2013, Lyon, 1184-1187. Phung, T. N., Luong, M. C., and Akagi, M. (2013/09/02). "A Hybrid TTS between Unit Selection and HMM-based TTS under limited data conditions," Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain 281-284. Nishie, S. and Akagi, M. (2013/09/11). "Acoustic sound source tracking for a moving object using precise Doppler-shift measurement," Proc. EUSIPCO2013, Marrakesh, Morocco. Akagi, M. and Hisatsune, H. (2013/10/17). "Admissible range for individualization of head-related transfer function in median plane," Proc. IIHMSP2013, Beijing. Elbarougy, R. and Akagi, M. (2013/11/01). "Cross-lingual speech emotion recognition system based on a three-layer model for human perception," Proc. APSIPA2013, Kaohsiung, Taiwan. Invited lecture Akagi, M. (2007/12/6). "Conversion of speaking voice into singing voice," ICT Forum Hanoi, Invited talk (Hanoi, Vietnam) Workshop in Japan Li, J. and Akagi, M. (2004). "A noise reduction system in localized and non-localized noise environments," Tech. Report of IEICE, EA2004-34. Li, J. and Akagi, M. (2005). "A noise reduction method based on a generalized subtractive beamformer," Tech. Report of IEICE, EA2005-44. Vu T. T., Unoki M., and Akagi, M. (2007). "An LP-based blind restoration method for improving intelligibility of bone-conducted speech," �ŻҾ��̿��ز񵻽��SP2006-172. Nguyen B. P. and Akagi M. (2007). "A flexible temporal decomposition-based spectral modification method using asymmetric Gaussian mixture model," �ŻҾ��̿��ز񵻽��SP2007-25. Huang C. F., Erickson, D., and Akagi M. (2007). "A study on expressive speech and perception of semantic primitives: Comparison between Taiwanese and Japanese," �ŻҾ��̿��ز񵻽��SP2007-32. Unoki M., Vu T. T., Seide J., and Akagi, M. (2007). "Evaluation of an LP-based blind restoration method to improve intelligibility of BC speech," �ŻҾ��̿��ز񵻽��SP2007-41. Petric, R., Lu, X., Unoki, M., Akagi, M., and Hoffmann, R. (2008/07/17). "Robust front end processing for speech recognition in reverberant environments: Utilization of Speech Properties," �ŻҾ��̿��ز񵻽��SP2008-44. Yang, L., Li, J., Zhang, J., Yan, Y., and Akagi, M. (2009/6/26). "Effects of single-channel enhancement algorithms on Mandarin speech intelligibility," IEICE Tech. Report, EA2009-32. Chau, D. T., Li, J., and Akagi, M. (2010/06/11). "A DOA estimation algorithm based on equalization-cancellation theory," IEICE Tech. Report, EA2010-28. Phung, T. N., Unoki, M., and Akagi, M. (2010/06/11). "Comparative evaluation of bone-conducted-speech restoration based on linear prediction scheme," IEICE Tech. Report, EA2010-31. Zhou, Y., Li, J., Sun, Y., Zhang, J., Yan, Y., and Akagi, M. (2010/10). "A hybrid speech emotion recognition system based on spectral and prosodic features," IEICE Trans. Info. & Sys., E93D (10): 2813-2821. Dang, J., Li, A., Erickson, D., Suemitsu, A., Akagi, M., Sakuraba, K., Mienmatasu, N., and Hirose, K. (2010/11/01). "Comparison of emotion perception among different cultures," Acoust. Sci. & Tech. 31, 6, 394-402. Phung, T. N., Luong, M. C., and Akagi, M. (2012/06/14). "A low-cost concatenative TTS for monosyllabic languages," IEICE Tech. Report, SP-2012-35. Elbarougy, R. and Akagi, M. (2012/06/14). "Comparison of methods for emotion dimensions estimation in speech using a three-layered model," IEICE Tech. Report, SP-2012-36. Elbarougy, R. and Akagi, M. (2013/03/01). "Automatic Speech Emotion Recognition Using A Three Layer Model," �ŻҾ��̿��ز񵻽��SP2012-127. Chau, D. T., Li, J., and Akagi, M. (2013/05/16). "Adaptive equalization-cancellation model and its application to sound localization in noisy reverberant environments," IEICE Tech. Report, EA2013-24, SP-2013-24. Verbal presentation Nandasena, A.C.R. and Akagi, M. (1997). "A new approach to temporal decomposition of speech", Proc. ASJ '97 Spring Meeting, 2-7-6. Nandasena, A.C.R. and Akagi, M. (1997). "S2BEL Temporal Decomposition for Efficient Spectral Coding", Proc. ASJ '97 Fall Meeting, 1-P-21. Nandasena, A.C.R. and Akagi, M. (1998). "Temporal decomposition of speech excitation parameters," Proc. ASJ '98 Spring Meeting, 3-7-12. Nguyen, P. C. and Akagi, M. (2001). "Limited error based event localizing temporal decomposition", Proc. ASJ '2002 Spring Meeting, 3-10-12. Nguyen, P. C. and Akagi, M.. (2002). "Variable rate speech coding based on STRAIGHT using temporal decomposition," Proc. ASJ '2002 Fall Meeting, 1-10-4. Nguyen, P. C. and Akagi, M. (2003). "On the application of temporal decomposition to VQ-based speaker identification," Proc. ASJ '2003 Spring Meeting, 1-10-4. Li, J. and Akagi, M. (2004). ��A hybrid noise reduction method using single- and multi-channel techniques,�� Proc. ASJ '2004 Spring Meeting, 3-P-2. Huang, C. F. and Akagi, M. (2004). "A perceptual model of emotional speech build by Fuzzy logic," Proc. ASJ '2004 Fall Meeting, 2-2-8. Li, J. and Akagi, M. (2004). "Multi-channel post-filtering in diffuse noise environment," Proc. ASJ '2004 Fall Meeting, 2-3-10. Huang, C. F. and Akagi, M. (2004). "A multi-layer fuzzy logical model for emotional speech perception," Trans. Psycho. & Physio. Acoust., ASJ, H-2004-95. Huang, C. F. and Akagi, M. (2005). "Rule-Based Speech Morphing for Evaluating Linguistic Descriptions of Emotional Speech Perception," Proc. ASJ '2005 Fall Meeting, 1-6-3. Li, J. and Akagi, M. (2005). "A noise reduction method based on a generalized subtractive beamformer," Proc. ASJ '2005 Fall Meeting, 2-2-19. Vu, T. T., Unoki, M., and Akagi, M. (2006). "A method for restoring bone-conducted speech base on LPC model," Proc. ASJ '2006 Spring Meeting, 1-3-3 Li, J., Akagi, M., and Suzuki, Y. (2006). "Two-microphone noise reduction with preserving ITD cues in highly non-stationary multi-noise-source environments," Proc. ASJ '2006 Spring Meeting, 3-5-10 Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2006). "Generalized spectral subtraction based on sub-band SNR," Proc. ASJ '2006 Fall Meeting, 1-1-21. Vu, T., Unoki, M., and Akagi, M. (2006). "A study on predicting parameters of LP-based model for restoring bone-conducted speech," Proc. ASJ '2006 Fall Meeting, 2-5-1. Nguyen B. P., Huang C. F., and Akagi M. (2007). "Temporal decomposition-based spectral modification and its application to emotional speech synthesis," Proc. ASJ '2007 Spring Meeting, 3-8-8. Huang C. F., Nguyen B. P., and Akagi M. (2007). "Rule-Based Speech Morphing for Evaluating Emotional Speech Perception Model," Proc. ASJ '2007 Spring Meeting, 3-8-9. Huang, C. F., Erickson, D., and Akagi, M. (2007). "Perception of Japanese expressive speech: Comparison between Japanese and Taiwanese listeners," Proc. ASJ '2007 Fall Meeting, 1-4-6. Nguyen B. P. and Akagi M. (2007). "Temporal decomposition-based speech spectra modeling using asymmetric Gaussian mixture model," Proc. ASJ '2007 Fall Meeting, 3-4-6. Lu, X., Unoki, M., and Akagi, M. (2008/3/17), "Comparative evaluation of modulation transfer function based dereverberation for robust speech recognition," Proc. ASJ '2008 Spring Meeting, 1-10-12. Nguyen B. P. and Akagi M. (2008/3/17). "Improvement of Peak Estimation using Gaussian Mixture Model for Speech Modification," Proc. ASJ '2008 Spring Meeting, 1-11-27. Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2008/3/19). "A two-stage binaural speech enhancement approach with adaptive filter and Wiener filter: Theory, implementation and evaluation," Proc. ASJ '2008 Spring Meeting, 3-6-8. Li, J., Jiang, H., Fu, Q., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2008/09/12). "Adaptive -order generalized spectral subtraction-based speech enhancement for cochlear implant patients," Proc. ASJ '2008 Fall Meeting, 3-8-5. Zhou, Y., Li, J., Akagi, M., and Yan, Y. (2009/9/15). "Physiologically-Inspired Feature Extraction for Emotion Recognition," Proc. ASJ '2009 Fall Meeting, 1-R-11. Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2009/9/16). "Subjective evaluation of TS-BASE/WF for speech enhancement and sound localization," Proc. ASJ '2009 Fall Meeting, 2-4-3. Li, J., Yang, L., Zhang, J., Yan, Y., and Akagi, M. (2009/9/16). "Comparative evaluations of single-channel speech enhancement algorithms on Mandarin and English speech intelligibility," Proc. ASJ '2009 Fall Meeting, 2-P-24. Li, J. and Akagi, M. (2009/3/8). "Intelligibility investigation of single-channel speech enhancement algorithms using Japanese corpus," Proc. ASJ '2010 Spring Meeting, 1-9-3. Shih, T., Suemitsu, A., and Akagi, M. (2010/10/16). "Influences of real-time auditory feedback on formant perturbations," Proc. Auditory Research Meeting, ASJ, 40, 8, H-2010-121. Phung, T. N., Luong, M. C., and Akagi, M. (2013/03/13). "Improving the flexibility of unit-selection speech synthesis with Temporal Decomposition," Proc. ASJ '2013 Spring Meeting, 1-7-16. Chau, T. D., Li, J. and Akagi, M. (2013/03/13). "Binaural multiple-source localization in noisy reverberant environments based on Equalization-Cancellation model," Proc. ASJ '2013 Spring Meeting, 1-P-44. Elbarougy, R., Tokuda, I., and Akagi, M. (2013/03/13). "Acoustic Analysis of Register Transition between Chest-to-Head Register in Singing Voice," Proc. ASJ '2013 Spring Meeting, 1-Q-7c. Elbarougy, R. and Akagi, M. (2013/09/25). "Cross-lingual Speech Emotion Dimensions Estimation Based on a Three-Layer Model," Proc. ASJ '2013 Fall Meeting, 1-P-1a. Phung, T. N. and Akagi, M. (2013/09/25). "Improving the naturalness of speech synthesized by HMM-based systems by producing an appropriate smoothness," Proc. ASJ '2013 Fall Meeting, 2-7-9. Gloss Nothing in English Books Ito, K. and Akagi, M. (2000). "A computational model of auditory sound localization based on ITD," In Recent Developments in Auditory Mechanics, World Scientific Publishing, 483-489. Maki, K., Akagi, M. and Hirota, K. (2000). "Effect of the basilar membrane nonlinearities on rate-place representation of vowel in the cochlear nucleus: A modeling approach," In Recent Developments in Auditory Mechanics, World Scientific Publishing, 490-496. Other Matsuoka, R., Lu, X., Dang, J., and Akagi, M. (2004). "Investigation of interaction between speech perception and speech production," Proc. KIT Int. Sympo. Brain and Language 2004, 27-28. Kozaki-Yamaguchi, Y., Suzuki, N., Fujita, Y., Yoshimasu, H., Akagi, M., and Amagasa, T. (2005). "Perception of hypernasality and its physical correlates," Oral Science International, 2, 1, 21-35. Saitou, T., Unoki, M. and Akagi, M. (2005). "Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis," Speech Communication 46, 405-417. Unoki, M., Toi, M., and Akagi, M. (2005). "Development of the MTF-based speech dereverberation method using adaptive time-frequency division," Proc. Forum Acousticum 2005, 51-56. Li, J., Lu, X., and Akagi, M. (2005). "Noise reduction based on microphone array and post-filtering for robust speech recognition in car environments," Proc. Workshop DSPinCar2005, S2-9 Maki, K. and Akagi, M. (2005). "A computational model of cochlear nucleus neurons," In Auditory Signal Processing, Springer, 84-90. Ito, K. and Akagi, M. (2005). "Study on improving regularity of neural phase locking in single neurons of AVCN via a computational model," In Auditory Signal Processing, Springer, 91-99. Huang, C. F. and Akagi, M. (2005). "Toward a rule-based synthesis of emotional speech on linguistic description of perception," Affective Computing and Intelligent Interaction, Springer LNCS 3784, 366-373. Li, J., Akagi, M., and Suzuki, Y. (2006). "Multi-channel noise reduction in noisy environments," Chinese Spoken Language Processing, Proc. ISCSLP2006, Springer LNCS 4274, 258-269. Dang, J., Akagi, M., and Honda, K. (2006). "Communication between speech production and perception within the brain - Observation and simulation," J. Comp. Sci. & Tech., 21, 1, 95-105. Li, J. and Akagi, M. (2006). "A noise reduction system based on hybrid noise estimation technique and post-filtering in arbitrary noise environments," Speech Communication, 48, 111-126. Vu, T. T., Unoki, M., and Akagi, M. (2006). "A study on restoration of bone-conducted speech with the lpc-based model," Proc. Int. Sympo. Frontiers in Speech and Hearing Research, 67-72. Lu, X., Unoki, M., and Akagi, M. (2006). "Sub-band temporal envelope restoration for ASR in reverberation environment," Proc. Int. Sympo. Frontiers in Speech and Hearing Research, 73-78 Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2006). "Adaptive -order generalized spectral subtraction for speech enhancement," Tech. Report of IEICE, EA2006-42. Vu, T., Unoki, M., and Akagi, M. (2006). "A parameter estimation method for a bone-conducted speech restoration based on the linear presiction," Trans. Tech. Comm. Psychol. Physiol. Acoust., ASJ, 36, 7, H-2006-104. Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y. (2007). "A speech enhancement approach for binaural hearing aids," Proc. 22th SIP Symposium, Sendai, 263-268. Vu, T. T., Unoki, M., and Akagi, M. (2008/3/20) "A study on the LP-based blind model in restoring bone-conducted speech," Asian Student workshop, Tokyo SP2007-189 Haniu, A., Unoki, M., and Akagi, M. (2008/3/20) "Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments," Asian Student workshop, Tokyo SP2007-196 Li, J. and Akagi, M. (2008). "A hybrid microphone array post-filter in a diffuse noise field," Applied Acoustics 69, 546-557. Li, J., Akagi, M., and Suzuki, Y. (2008). "A two-microphone noise reduction method in highly non-stationary multiple-noise-source environments," IEICE Trans. Fundamentals, E91-A, 6, 1337-1346. Nguyen, B. P. and Akagi, M. (2009/02/20). "Applications of Temporal Decomposition to Voice Transformation," International symposium on biomechanical and physiological modeling and speech science, 19-24. Akagi, M. (2009/02/20). "Introduction of SCOPE project: Analysis of production and perception characteristics of non-linguistic information in speech and its application to inter-language communications," International symposium on biomechanical and physiological modeling and speech science, 51-62. JAIST Technical Report Unoki, M. and Akagi, M. (1998). "A method of signal extraction from noisy signal based on auditory scene analysis", JAIST Tech. Report, IS-RR-98-0005P. Unoki, M. and Akagi, M. (1998). "A computational model of co-modulation masking release", JAIST Tech. Report, IS-RR-98-0006P. Ishimoto, Y., Unoki, M., and Akagi, M. (2005). "Fundamental frequency estimation for noisy speech based on instantaneous amplitude and frequency," JAIST Tech. Report, IS-RR-2005-006. Patent Nothing in English
modified by isoyama-tjaist.ac.jp at Apr. 01 2022 JST

Acoustic Information Science,Unoki Laboratory

Publication (by Contents, by Time, by Field)

Acoustic Information Science,
Unoki Laboratory