Publications – Nguyen Lab

140.

Sinh, Vu Trong; Minh, Nguyen Le

A Study on Self-attention Mechanism for AMR-to-text Generation Journal Article

In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11608 LNCS, pp. 321-328, 2019, (cited By 0).

Abstract | Links | BibTeX | Tags:

139.

Nguyen, Minh-Tien; Tran, Viet Cuong; Nguyen, Xuan Hoai; Nguyen, Le-Minh

Web document summarization by exploiting social context with matrix co-factorization Journal Article

In: Information Processing and Management, vol. 56, no. 3, pp. 495-515, 2019, (cited By 14).

Abstract | Links | BibTeX | Tags:

@article{Nguyen2019495,

title = {Web document summarization by exploiting social context with matrix co-factorization},

author = {Minh-Tien Nguyen and Viet Cuong Tran and Xuan Hoai Nguyen and Le-Minh Nguyen},

url = {https://www.scopus.com/inward/record.uri?eid=2-s2.0-85059551583&doi=10.1016%2fj.ipm.2018.12.006&partnerID=40&md5=65cb397e22003733491d825ae47b8f68},

doi = {10.1016/j.ipm.2018.12.006},

year  = {2019},

date = {2019-01-01},

journal = {Information Processing and Management},

volume = {56},

number = {3},

pages = {495-515},

abstract = {In the context of social media, users usually post relevant information corresponding to the contents of events mentioned in a Web document. This information posses two important values in that (i) it reflects the content of an event and (ii) it shares hidden topics with sentences in the main document. In this paper, we present a novel model to capture the nature of relationships between document sentences and post information (comments or tweets) in sharing hidden topics for summarization of Web documents by utilizing relevant post information. Unlike previous methods which are usually based on hand-crafted features, our approach ranks document sentences and user posts based on their importance to the topics. The sentence-user-post relation is formulated in a share topic matrix, which presents their mutual reinforcement support. Our proposed matrix co-factorization algorithm computes the score of each document sentence and user post and extracts the top ranked document sentences and comments (or tweets) as a summary. We apply the model to the task of summarization on three datasets in two languages, English and Vietnamese, of social context summarization and also on DUC 2004 (a standard corpus of the traditional summarization task). According to the experimental results, our model significantly outperforms the basic matrix factorization and achieves competitive ROUGE-scores with state-of-the-art methods. © 2018 Elsevier Ltd},

note = {cited By 14},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

138.

Nguyen, Van-Nhat; Nguyen, Ha-Thanh; Vo, Dinh-Hieu; Nguyen, Le-Minh

Relation Extraction in Vietnamese Text via Piecewise Convolution Neural Network with Word-Level Attention Conference

2019, (cited By 0).

Abstract | Links | BibTeX | Tags:

@conference{Nguyen201999,

title = {Relation Extraction in Vietnamese Text via Piecewise Convolution Neural Network with Word-Level Attention},

author = {Van-Nhat Nguyen and Ha-Thanh Nguyen and Dinh-Hieu Vo and Le-Minh Nguyen},

url = {https://www.scopus.com/inward/record.uri?eid=2-s2.0-85061922441&doi=10.1109%2fNICS.2018.8606824&partnerID=40&md5=667afcb4c73744ba20dc5d56012c5bb1},

doi = {10.1109/NICS.2018.8606824},

year  = {2019},

date = {2019-01-01},

journal = {NICS 2018 - Proceedings of 2018 5th NAFOSTED Conference on Information and Computer Science},

pages = {99-103},

abstract = {With the explosion of information technology, the Internet now contains enormous amounts of data, so the role of information extraction systems becomes very important. Relation Extraction is a sub-task of Information Extraction, which focuses on classifying the relationship between the entity pairs mentioned in the text. In recent years, despite the many new methods have been introduced, Relation Extraction still receives attention from researchers for languages in general and Vietnamese in particular.Relation Extraction can be addressed in a variety of ways, including supervised learning methods, unsupervised and semi-supervised methods. Recent studies in the English language have shown that Relation Extraction using deep learning method in the supervised or semi-supervised domains is achieving optimal and superior results over traditional non-deep learning methods. However, researches in Vietnamese are few and in the process of searching documents, the results of deep learning applying for Relation Extraction in Vietnamese are not found. Therefore, the research focuses on studying and research the method of using deep learning to solve Relation Extraction task in Vietnamese. In order to solve the Relation Extraction task, the research proposes and constructs a deep learning model named Piecewise Convolution Neural Network with Word-Level Attention. © 2018 IEEE.},

note = {cited By 0},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close

137.

Nguyen, Minh-Tien; Lai, Dac Viet; Nguyen, Huy Tien; Nguyen, Minh Le

Tsix: A Human-involved-creation Dataset for Tweet Summarization Conference

2019, (cited By 3).

Abstract | Links | BibTeX | Tags:

136.

Tran, Van-Khanh; Nguyen, Le-Minh

Gating mechanism based Natural Language Generation for spoken dialogue systems Journal Article

In: Neurocomputing, vol. 325, pp. 48-58, 2019, (cited By 4).

Abstract | Links | BibTeX | Tags:

135.

Trieu, Hai-Long; Tran, Duc-Vu; Ittoo, Ashwin; Nguyen, Le-Minh

Leveraging additional resources for improving statistical machine translation on asian low-resource languages Journal Article

In: ÄCM Transactions on Asian and Low-Resource Language Information Processing", vol. 18, no. 3, 2019, (cited By 2).

Abstract | Links | BibTeX | Tags:

@article{Trieu2019,

title = {Leveraging additional resources for improving statistical machine translation on asian low-resource languages},

author = {Hai-Long Trieu and Duc-Vu Tran and Ashwin Ittoo and Le-Minh Nguyen},

url = {https://www.scopus.com/inward/record.uri?eid=2-s2.0-85075758647&doi=10.1145%2f3314936&partnerID=40&md5=aace4f12b003e0135cdd783a3019482e},

doi = {10.1145/3314936},

year  = {2019},

date = {2019-01-01},

journal = {ÄCM Transactions on Asian and Low-Resource Language Information Processing"},

volume = {18},

number = {3},

abstract = {Phrase-based machine translation (MT) systems require large bilingual corpora for training. Nevertheless, such large bilingual corpora are unavailable for most language pairs in the world, causing a bottleneck for the development of MT. For the Asian language pairs-Japanese, Indonesian, Malay paired with Vietnamese-they are also not excluded from the case, in which there are no large bilingual corpora on these low-resource language pairs. Furthermore, although the languages are widely used in the world, there is no prior work on MT, which causes an issue for the development of MT on these languages. In this article, we conducted an empirical study of leveraging additional resources to improve MT for the Asian low-resource language pairs: Translation fromJapanese, Indonesian, andMalay to Vietnamese.We propose an innovative approach that lies in two strategies of building bilingual corpora from comparable data and phrase pivot translation on existing bilingual corpora of the languages paired with English. Bilingual corpora were built from Wikipedia bilingual titles to enhance bilingual data for the low-resource languages. Additionally,we introduced a combined model of the additional resources to create an effective solution to improveMT on the Asian low-resource languages. Experimental results show the effectiveness of our systems with the improvement of +2 to +7 BLEU points. This work contributes to the development of MT on low-resource languages, especially opening a promising direction for the progress of MT on the Asian language pairs. © 2019 Association for Computing Machinery.},

note = {cited By 2},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

134.

Nguyen, Huy Tien; Nguyen, Minh Le

An ensemble method with sentiment features and clustering support Journal Article

In: Neurocomputing, vol. 370, pp. 155-165, 2019, (cited By 14).

Abstract | Links | BibTeX | Tags:

133.

Trieu, Hai Long; Tran, Duc-Vu; Nguyen, Minh Le

Investigating phrase-based and neural-based machine translation on low-resource settings Conference

2019, (cited By 2).

Abstract | Links | BibTeX | Tags:

132.

Vu, Trong Sinh; Nguyen, Le Minh

An Empirical Evaluation of AMR Parsing for Legal Documents Journal Article

In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11717 LNAI, pp. 131-145, 2019, (cited By 0).

Abstract | Links | BibTeX | Tags:

131.

Nguyen, Huy Xuan; Nguyen, Le Minh

Attention mechanism for recommender systems Conference

2019, (cited By 0).

Abstract | Links | BibTeX | Tags:

130.

Vu, Sinh Trong; Nguyen, Minh Le; Satoh, Ken

Legal Text Generation from Abstract Meaning Representation Inproceedings

In: Legal Knowledge and Information Systems, pp. 229–234, IOS Press, 2019.

Links | BibTeX | Tags:

129.

Nguyen, Phuong Minh; Than, Khoat; Nguyen, Minh Le

Marking Mechanism in Sequence-to-sequence Model for Mapping Language to Logical Form Inproceedings

In: 2019 11th International Conference on Knowledge and Systems Engineering (KSE), pp. 1–7, IEEE 2019.

Links | BibTeX | Tags:

128.

Trong, Sinh Vu; Le, Minh Nguyen

An Empirical Evaluation of AMR Parsing for Legal Documents Journal Article

In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11717 LNAI, pp. 131-145, 2019, (cited By 0).

Abstract | Links | BibTeX | Tags:

127.

Chen, Laifu; Nguyen, Minh Le

Sentence selective neural extractive summarization with reinforcement learning Conference

2019, (cited By 5).

Abstract | Links | BibTeX | Tags:

126.

Nguyen, Ha-Thanh; Nguyen, Le-Minh

Swarm filter - A simple deep learning component inspired by swarm concept Conference

vol. 2019-November, 2019, (cited By 0).

Abstract | Links | BibTeX | Tags:

125.

Trang, Luu; Minh, Nguyen Le; Thuy, Nguyen Thanh

Message from the KSE’19 General & TPC Chairs Journal Article

In: 2019.

BibTeX | Tags:

124.

Tran, Vu D; Nguyen, Minh L; Shirai, Kiyoaki; Satoh, Ken

An approach of rhetorical status recognition for judgments in court documents using deep learning models Conference

2019, (cited By 0).

Abstract | Links | BibTeX | Tags:

123.

Tran, Nhu-Thuat; Nguyen, Le-Minh; Phan, Xuan-Hieu; others,

Learning to transform Vietnamese natural language queries into SQL commands Conference

2019, (cited By 2).

Abstract | Links | BibTeX | Tags:

122.

Nguyen, Huy Tien; Nguyen, Minh Le

Multilingual opinion mining on YouTube--A convolutional N-gram BiLSTM word embedding Journal Article

In: Information Processing & Management, vol. 54, no. 3, pp. 451–462, 2018.

BibTeX | Tags:

121.

Do, Khac-Phong; Nguyen, Le-Minh

A study on integrating distinct classifiers with bidirectional LSTM for Slot Filling task Conference

2018, (cited By 0).

Abstract | Links | BibTeX | Tags:

120.

Long, Dang Hoang; Nguyen, Minh-Tien; Bach, Ngo Xuan; Nguyen, Le-Minh; Phuong, Tu Minh

An entailment-based scoring method for content selection in document summarization Conference

2018, (cited By 1).

Abstract | Links | BibTeX | Tags:

@conference{Long2018122,

title = {An entailment-based scoring method for content selection in document summarization},

author = {Dang Hoang Long and Minh-Tien Nguyen and Ngo Xuan Bach and Le-Minh Nguyen and Tu Minh Phuong},

url = {https://www.scopus.com/inward/record.uri?eid=2-s2.0-85059963311&doi=10.1145%2f3287921.3287976&partnerID=40&md5=e44073cf17dac5225fe08ebf62f60139},

doi = {10.1145/3287921.3287976},

year  = {2018},

date = {2018-01-01},

journal = {ÄCM International Conference Proceeding Series"},

pages = {122-129},

abstract = {This paper introduces a scoring method to improve the quality of content selection in an extractive summarization system. Different from previous models mainly using local information inside sentences such as sentence position or sentence length, our method judges the importance of a sentence based on its own information and the relation between sentences. For the relation between sentences, we utilize textual entailment, a relationship indicating that the meaning of a sentence can be inferred from another one. Unlike previous work on using textual entailment for summarization, we go a step further by looking at aligned words in an entailment sentence pair. Assuming that important words in a salient sentence can be aligned by several words in other sentences, word alignment scores are exploited to compute the entailment score of a sentence. To take advantage of local and neighbor information for facilitating the salient estimation of sentences, we combine entailment scores with sentence position scores. We validate the proposed scoring method with greedy or integer linear programming approaches for extracting summaries. Experiments on three datasets (including DUC 2001 and 2002) in two different domains show that our model obtains competitive ROUGE-scores with state-of-the-art methods for single-document summarization. © 2018 Association for Computing Machinery.},

note = {cited By 1},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close

119.

Nguyen, Khanh Duy Tung; Tuan, Tran Minh; Le, Son Hai; Viet, Anh Phan; Ogawa, Mizuhito; Minh, Nguyen Le

Comparison of Three Deep Learning-based Approaches for IoT Malware Detection Conference

2018, (cited By 12).

Abstract | Links | BibTeX | Tags:

118.

Le, Tung; Minh, Nguyen Le

Combined Objective Function in Deep Learning Model for Abstractive Summarization Inproceedings

In: Proceedings of the Ninth International Symposium on Information and Communication Technology, pp. 84–91, 2018.

BibTeX | Tags:

117.

Tran, Van-Khanh; Nguyen, Le-Minh

Dual latent variable model for low-resource natural language generation in dialogue systems Conference

2018, (cited By 5).

Abstract | Links | BibTeX | Tags:

116.

Nguyen12, Minh-Tien; Tran, Duc-Vu; Phan, Viet-Anh; Nguyen, Le-Minh

Towards Social Context Summarization with Convolutional Neural Networks Journal Article

In: 2018.

BibTeX | Tags:

115.

Tran, Vu; Nguyen, Minh Le; Satoh, Ken

Automatic catchphrase extraction from legal case documents via scoring using deep neural networks Journal Article

In: ärXiv preprint arXiv:1809.05219", 2018.

BibTeX | Tags:

114.

Nguyen, Huy-Tien; Vo, Quan-Hoang; Nguyen, Minh-Le

A Deep Learning Study of Aspect Similarity Recognition Conference

2018, (cited By 2).

Abstract | Links | BibTeX | Tags:

113.

Nguyen, Minh Le

A Study on Social Context Summarization Journal Article

In: 2018.

BibTeX | Tags:

112.

Tran, Van-Khanh; Nguyen, Le-Minh

Adversarial domain adaptation for variational neural language generation in dialogue systems Conference

2018, (cited By 9).

Abstract | Links | BibTeX | Tags:

111.

Tran, Hong Viet; Nguyen, Van Vinh; Vu, Thuong Huyen; Nguyen, Le Minh

Dependency-based Pre-ordering For English-Vietnamese Statistical Machine Translation Journal Article

In: 2018.

BibTeX | Tags:

110.

Nguyen, Minh-Tien; Tran, Duc-Vu; Nguyen, Le-Minh; Phan, Xuan-Hieu

Exploiting user posts for web document summarization Journal Article

In: ÄCM Transactions on Knowledge Discovery from Data", vol. 12, no. 4, 2018, (cited By 4).

Abstract | Links | BibTeX | Tags:

109.

Ngo, Thi-Vinh; Ha, Thanh-Le; Nguyen, Phuong-Thai; Nguyen, Le-Minh

Combining Advanced Methods in Japanese-Vietnamese Neural Machine Translation Conference

2018, (cited By 5).

Abstract | Links | BibTeX | Tags:

108.

Nguyen, Truong-Son; Nguyen, Le-Minh; Tojo, Satoshi; Satoh, Ken; Shimazu, Akira

Recurrent neural network-based models for recognizing requisite and effectuation parts in legal texts Journal Article

In: Ärtificial Intelligence and Law", vol. 26, no. 2, pp. 169-199, 2018, (cited By 19).

Abstract | Links | BibTeX | Tags:

107.

Trieu, Hai-Long; Nguyen, Le-Minh

Enhancing Pivot Translation Using Grammatical and Morphological Information Journal Article

In: Communications in Computer and Information Science, vol. 781, pp. 137-151, 2018, (cited By 0).

Abstract | Links | BibTeX | Tags:

106.

Nguyen, Truong-Son; Nguyen, Le-Minh

Nested Named Entity Recognition Using Multilayer Recurrent Neural Networks Journal Article

In: Communications in Computer and Information Science, vol. 781, pp. 233-246, 2018, (cited By 5).

Abstract | Links | BibTeX | Tags:

105.

Nguyen, Minh-Tien; Tran, Duc-Vu; Nguyen, Le-Minh

Social context summarization using user-generated content and third-party sources Journal Article

In: Knowledge-Based Systems, vol. 144, pp. 51-64, 2018, (cited By 8).

Abstract | Links | BibTeX | Tags:

104.

Nguyen, Huy; Nguyen, Minh-Le

A Deep Neural Architecture for Sentence-Level Sentiment Classification in Twitter Social Networking Journal Article

In: Communications in Computer and Information Science, vol. 781, pp. 15-27, 2018, (cited By 4).

Abstract | Links | BibTeX | Tags:

103.

Tran, Van-Khanh; Nguyen, Le-Minh

Semantic Refinement GRU-Based Neural Language Generation for Spoken Dialogue Systems Journal Article

In: Communications in Computer and Information Science, vol. 781, pp. 63-75, 2018, (cited By 4).

Abstract | Links | BibTeX | Tags:

102.

Tran, Viet Hong; Vu, Huyen Thuong; Nguyen, Vinh Van; Nguyen, Minh Le

A classifier-based preordering approach for English-Vietnamese statistical machine translation Journal Article

In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9624 LNCS, pp. 74-87, 2018, (cited By 0).

Abstract | Links | BibTeX | Tags:

101.

Tran, Phuoc; Nguyen, Le; Dinh, Dien

PRE-ORDERING OF “DE PHRASE” IN CHINESE-VIETNAMESE MACHINE TRANSLATION Journal Article

In: ICIC express letters. Part B, Applications: an international journal of research and surveys, vol. 9, no. 10, pp. 983–990, 2018.

BibTeX | Tags:

100.

Tin, Pham Trung; Minh, Nguyen Le

Memory Networks for Fake News Detection Journal Article

In: 2018.

BibTeX | Tags:

99.

Ittoo, Ashwin; Minh, Nguyen Le; Tojo, Satoshi

Knowledge & Systems Engineering Journal Article

In: Data and Knowledge Engineering, vol. 114, pp. 1–86, 2018.

BibTeX | Tags:

98.

Phan, Anh Viet; Chau, Phuong Ngoc; Nguyen, Minh Le; Bui, Lam Thu

Automatically classifying source code using tree-based approaches Journal Article

In: Data and Knowledge Engineering, vol. 114, pp. 12-25, 2018, (cited By 2).

Abstract | Links | BibTeX | Tags:

@article{Phan201812,

title = {Automatically classifying source code using tree-based approaches},

author = {Anh Viet Phan and Phuong Ngoc Chau and Minh Le Nguyen and Lam Thu Bui},

url = {https://www.scopus.com/inward/record.uri?eid=2-s2.0-85026469561&doi=10.1016%2fj.datak.2017.07.003&partnerID=40&md5=ca2c752fa936998718b687bd3e957faf},

doi = {10.1016/j.datak.2017.07.003},

year  = {2018},

date = {2018-01-01},

journal = {Data and Knowledge Engineering},

volume = {114},

pages = {12-25},

abstract = {Änalyzing source code to solve software engineering problems such as fault prediction, cost, and effort estimation always receives attention of researchers as well as companies. The traditional approaches are based on machine learning, and software metrics obtained by computing standard measures of software projects. However, these methods have faced many challenges due to limitations of using software metrics which were not enough to capture the complexity of programs. To overcome the limitations, this paper aims to solve software engineering problems by exploring information of programs' abstract syntax trees (ASTs) instead of software metrics. We propose two combination models between a tree-based convolutional neural network (TBCNN) and k-Nearest Neighbors (kNN), support vector machines (SVMs) to exploit both structural and semantic ASTs' information. In addition, to deal with high-dimensional data of ASTs, we present several pruning tree techniques which not only reduce the complexity of data but also enhance the performance of classifiers in terms of computational time and accuracy. We survey many machine learning algorithms on different types of program representations including software metrics, sequences, and tree structures. The approaches are evaluated based on classifying 52000 programs written in C language into 104 target labels. The experiments show that the tree-based classifiers dramatically achieve high performance in comparison with those of metrics-based or sequences-based; and two proposed models TBCNN + SVM and TBCNN + kNN rank as the top and the second classifiers. Pruning redundant AST branches leads to not only a substantial reduction in execution time but also an increase in accuracy. © 2017 Elsevier B.V."},

note = {cited By 2},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

97.

Phan, Anh Viet; Nguyen, Minh Le; Nguyen, Yen Lam Hoang; Bui, Lam Thu

DGCNN: A convolutional neural network over large-scale labeled graphs Journal Article

In: Neural Networks, vol. 108, pp. 533-543, 2018, (cited By 31).

Abstract | Links | BibTeX | Tags: