Reference Literature¶

This section lists out some of the literature that we consulted in our work in one way or another.

CoreNLP Pipelines¶

Qi, P., Zhang, Y., Zhang, Y., Bolton, J., & Manning, C.D. (2020). Stanza: A Python Natural Language Processing Toolkit for Many Human Languages. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, July 2020 (pp. 101-108).
Nguyen, M.V., Lai, V.D., Veyseh, A.P.B., & Nguyen, T.H. (2021). Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, April 2021 (pp. 80-90).
Vu, T., Nguyen, D.Q., Nguyen, D.Q., Dras, M., & Johnson, M. (2018). VnCoreNLP: A Vietnamese Natural Language Processing Toolkit. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, June 2018 (pp. 56-60).

Model Architectures¶

Segmentation¶

Constituency Parsing¶

Dependency Parsing¶

Pre-trained Models¶

Datasets / Tagsets¶

General¶