Activities

1.         PC member of AI 2006 (the 19th Australian Joint Conference on Artificial Intelligence)

2.         PC member of PACLIC19 (the 19th Pacific Asia Conference on Language, Information and Computation)

3.         PC member of PRICAI 2008 (The 10th Pacific Rim International Conference on Artificial Intelligence)

PhD Thesis:  “Statistical Machine Learning Approaches to Cross-language Text Summarization”, Japan Advanced Institute of Science and Technology, 2004.

 

SOFTWARES

 

FlexCRFs (NEW): A Flexible Conditional Random Field Toolkit for Sequential Learning Applications.

 

Publications in DBLP

 

Journal papers

 

o    T.T. Nguyen, M.L. Nguyen, A. Shimazu, “Using Semi-supervised Learning for Question Classification”,  Journal Natural Language Processing 15(1) (2008)

o    P.T. Nguyen, A. Shimazu, M.L. Nguyen, V.V. Nguyen, “A Syntactic Transformation Model for Statistical Machine Translation”, International Journal of Computer Processing of Oriental Languages 20 (2), pp 79-99 (2007).

o    C. A. Le,  A. Shimazu, V.N. Huynh,  M.L. Nguyen, “Semi-Supervised Learning Integrated with Classifier Combination for Word Sense Disambiguation”, Computer Speech & Language  (2008)  pp 330-345

o    M.L. Nguyen, H.X. Phan, S. Horiguchi, S., and A. Shimazu, “A New Sentence Reduction Technique Based on a Decision Tree Model", International Journal on Artificial Intelligence Tools 16(1): 129-138 (2007)

o    H.X. Phan, M.L. Nguyen, Y. Inoguchi, and S. Horiguchi: High-Performance Training Conditional Random Fields for Large-Scale Applications of Labeling Sequence Data,IEICE Transactions on Information and Systems, Vol.E90-D, No.1, pp.13-21, 2007.

o    Phan, X.H., Nguyen, L.M., Inoguchi, Y., Ho, T.B., and Horiguchi, S.: Improving Discriminative Sequential Learning by Discovering Important Associations of Statistics. ACM Transactions on Asian Language and Information Processing 5(4): 413-438 (2006)

o    Phan, X. H., Nguyen, L. M., and Horiguchi, S.: Personal Name Resolution Crossover Documents by A Semantics-Based Approach. IEICE Transactions on Information and Systems, Vol.E89-D, No.2, pp.825-836, 2005.

o    M.L. Nguyen and S. Horiguchi, “Accuracy Enhancement for the Decomposition of Human-Written Summary”, International Journal of Computer Processing of Oriental Languages (IJCPOL), Vol.18, No.1 (2005) 1-22.

o    M.L. Nguyen, M. Fukushi, and S. Horiguchi, “A Probabilistic Sentence Reduction Using Maximum Entropy Model",  IEICE Transactions On Information and Systems, Japan, vol. E88-D, no. 2, pp.278--288, 2005.

o    M. L. Nguyen, S. Horiguchi, A. Shimazu, B.T. Ho, “Example-Based Sentence Reduction Using the Hidden Markov Model”,  ACM Transactions on Asian Language Processing, September, Issue 3, Vol 3, pp. 146-158, 2004.

Papers in Conferences

2008

o    Xuan-Hieu Phan, Le-Minh Nguyen, and Susumu Horiguchi. Learning to Classify Short and Sparse Text & Web with Hidden Topics from Large-scale Data Collections. In Proc. of The 17th International World Wide Web Conference (WWW 2008)    

 

o    Nguyen, P.T., Shimazu, A., Ho, T.B., Nguyen, L.M., Nguyen, V.V. (2008). A Tree-to-String Phrase-based Model for Statistical Machine Translation, Twelfth Conference on Computational Natural Language Learning, Manchester, 16-17 August (in press).

 

o    Vinh Van Nguyen, Thai Phuong Nguyen, Akira Shimazu, and Minh Le Nguyen.  Reordering  Phrase-Based Machine Translation over Chunks. In Proceedings of RIVF-08. July 13-17, 2008, Ho Chi Minh, Vietnam.

 

o    Vinh Van Nguyen, Thai Phuong Nguyen, Akira Shimazu, and Minh Le Nguyen.  A Reordering Model for Phrase-based Machine Translation. In Proceedings of Advanced in Natural Language-08, LNCS/LNAI vol 5521. August 25-27, 2008, Gothenburg, Sweden.

 

2007

 

o    Phan, X.H., Nguyen, M.L, Nguyen, C.T., Horiguchi, S.  Semantic Analysis of Entity Context Towards Open Named Entity Classification On The Web, PACLING 2007

 

o    Nguyen, V.V., Nguyen,  M.L.,  Shimazu, A.,  Using Conditional Random Fields For Clause Splitting,  PACLING 2007

 

o    Le-Minh Nguyen; Akira Shimazu; Phuong-Thai Nguyen; Xuan-Hieu Phan A Multilingual Dependency Analysis System Using Online Passive-Aggressive Learning, Shared Task paper Proceedings of EMNLP-CONLL 2007, pp 1149—1155

 

o    Nguyen, M.L., Shimazu, A., Nguyen, T.T.: Subtree mining for question classification problem. Twentieth International Joint Conference on Artificial Intelligence 

            (IJCAI 2007) Hyderabad, India, January 6-12, 2007,  pp 1695-1700.

2006

o    Nguyen, M.L., Shimazu, A., and Phan, X.H.: Semantic Parsing with Structured SVM Ensemble Classification Models. The 44th Annual Meeting of The Association for Computational Linguistics (ACL) and the 21st International Conference on Computational Linguistics (COLING), 17th – 21st July 2006, Sydney, Australia, pp 619-626.

 

o   Le, A.C., Shimazu, A., Nguyen, M.L.: Investigating Problems of Semi-Supervised Learning for Word Sense Disambiguation. 21st International Conference on the Computer Processing of Oriental Languages (ICCPOL2006) pp 482-489.

 

o    Nguyen, T.T., Shimazu,A, Le, A.C, Nguyen, M.L.: Applying RST Relations to Semantic Search. Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD 2006), pp 189-196.

 

o   Nguyen, T.T., Nguyen, M.L, and Shimazu, A.: Using Semi-supervised Learning for Question Classification. 21st International Conference on the Computer Processing of Oriental Languages (ICCPOL2006) pp 31-41.

 

o   Nguyen, P.T., Shimazu,A, Nguyen,M.L, Le, A.C : Rule-Based Transformation for Improving  Phrase-Based SMT from English to Vietnamese  The First International Conference on Knowledge, Information and Creativity Support Systems KICSS'06, 1-4 August, Ayutthaya .

 

o    Phan, X.H., Nguyen, L.M., Horiguchi, S., Inoguchi, Y., Ho, T.B.(2006). Parallel Training of CRFs: A Practical Approach to Build Large-Scale Prediction Models for Sequence Data, Workshop on Parallel Data Mining, ECML/PKDD 2006

 

o    C.T. Nguyen, T.K. Nguyen, X.H. Phan, L.M. Nguyen, and Q.T. Ha: Vietnamese Word Segmentation with CRFs and SVMs: An Investigation, The 20th Pacific Asia Conference on Language, Information, and Computation (PACLIC), 1st-3rd November, 2006, Wuhan, China

 

 

o    Phan, X.H., Nguyen, L.M., Inoguchi, Y., Ho, T.B., Horiguchi, S. (2006). High-Performance Training of Conditional Random Fields for Large-Scale Sequential Labeling Applications, International Conference on High Performance Scientific Computing, March 6-10, 2006, Hanoi.

 

2005

o    Nguyen, M. L., Shimazu, A., and Phan, X. H.: A Maximum Entropy Model for Transforming Sentences to Logical Form. The 18th Australian Joint Conference on Artificial Intelligence (AI05), pp.800-804, December 5-9, 2005, Sydney, Australia.

 

o    Nguyen, M.L., Shimazu, A., and Phan, X.H.: A Structured SVM Semantic Parser Augmented by Semantic Tagging with Conditional Random Fields. The 19th Pacific Asia Conference on Language Information and Computation, pp 167-178. December 1-3, 2005, Academia Sinica, Taipei.

 

o    Phan, X. H., Nguyen, M. L., Horiguchi, S., Ho, T. B., Inoguchi, Y.: Classification with Maximum Entropy Modeling of Predictive Association Rules. The 16th European Conference on Machine Learning (ECML-2005), pp.682-689, October 3-7, 2005, Porto, Portugal, LNAI, Springer.

 

o    Phan, X.H., Nguyen, L.M., Ho, T.B., Inoguchi, Y., Horiguchi, S. (2005). Co-Training of Conditional Random Fields for Segmenting Sequence Data, First World Congress of the International Federation for Systems Research (IFSR'05), Symposium on Data/Text Mining from Large Databases, Kobe, 15-17 November, S5-2-4 (Best student paper award).

 

o    Nguyen, L.M., Shimazu, A., Ho, T.B., Phan, X.H., Horiguchi, S. (2005). Sentence extraction with support vector machine ensemble, First World Congress of the International Federation for Systems Research (IFSR'05), Symposium on Data/Text Mining from Large Databases, Kobe, 15-17 November, S5-2-4.

 

o    Phan, X. H., Nguyen, M. L., Ho, T. B., and Horiguchi, S.: Improving Discriminative Sequential Learning with Rare-but-Important Associations. The 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.304-313, August 21-24, 2005, Chicago, IL, USA.

 

2004

o    M.L. Nguyen, A. Shimazu, S. Horiguchi, B.T. Ho, M. Fukushi,  "Probabilistic Sentence Reduction Using Support Vector Machines", The 20th International Conference on Computational Linguistics COLING 2004, 23-27 August, Geneva, pp 743-749, 2004.

 

o    Phan, X. H., Horiguchi, S., Ho, T. B., and Nguyen, M. L.: An Unsupervised Approach to Coreference Resolution. The 5th International Symposium on Knowledge and System Sciences. November 10-12, 2004, Ishikawa, Japan.

 

o    M.L. Nguyen, A. Shimazu, B.T. Ho, S. Horiguchi, H. X. Phan, “A cross-language text summarization system using statistical machine learning”,  The 5th International Symposium on Knowledge and System Sciences. November 10-12, 2004, Ishikawa, Japan. pp 132-136. 

 

2003

o    M.L. Nguyen and S. Horiguchi, A Sentence Reduction Using Syntax Control, Proc. of 6th Information Retrieval with Asian Language, pp. 139-146, 2003

 

o    M.L. Nguyen and S. Horiguchi, Sentence Reduction and Query Translation for Cross Language Information Retrieval: An Internet Search Application, Proc. of the International Symposium on Towards Peta-Bit Ultra-Network, pp. 103-107, 2003.

 

o    M.L. Nguyen, A. Shimazu and S. Horiguchi, Translation Template Learning Using Hidden Markov Modeling,  Proc. of 17th Pacific Asia Conference on Language, Information and Computation, pp. 269-276, 2003.

 

o    M.L. Nguyen and S. Horiguchi, A New Sentence Reduction Based on Decision Tree Model, Proc. of 17th Pacific Asia Conference on Language, Information and Computation, pp. 269-276, 2003.

 

o    N.H. Pham, M.L. Nguyen, A.C. Le, N.P. Thai, N.V. Vinh, and H.S. Dam, ``LVT: An English Vietnamese Translation System", First National Conference on Fundamental and Applied Research in Information Technology FAIR-03, Hanoi, October 2003.

2002

o    M.L. Nguyen and S. Horiguchi, “An Efficient Decomposition of Human-Written Summary Sentence”,  Proc. of 9th International Conference on Neural Information Processing, November, Singapore, Vol.2, pp.705-710, 2002.

 

o    M.L. Nguyen, S. Horiguchi, T. B. Ho,  A. C. Le,  V. V. Nguyen and P.T. Nguyen, “ Sentence Reduction Using Semantic Parsing with Rich Knowledge Base”, Proc of Joint Third International Conference on Intelligent Technologies and Third Vietnam-Japan Symposium on Fuzzy Systems and Application, Hanoi, pp. 376-379, 2002.

 

o    M.L. Nguyen and S. Horiguchi, “Approach to scalable statistical text summarization", JAIST Reasearch Report, IS-RR-2002-016, 2002, April. pp.1-16, 2002