3. Kim JH, Nam HJ, Park HS. Trends in Genomics and Informatics: a statistical review of publications from 2003 to 2018 focusing on the most-studied genes and document clusters. Genomics Inform 2019;17:e25.
6. Genomics & Informatics. Seoul: Korea Genome Organization, 2021. Accessed 2021 Sep 8. Available from:
https://genominfo.org/.
17. PubAnnotation. Kashiwa: Database Center for Life Science, 2021. Accessed 2021 Nov 23. Available from:
https://pubannotation.org/.
18. Nam HJ, Yamada R, Park HS. Using the PubAnnotation ecosystem to perform agile text mining on Genomics and Informatics: a tutorial review. Genomics Inform 2020;18:e13.
19. Biber D, Conrad S, Reppen R. Corpus linguistics: Investigating Language Structure and Use. New York: Cambridge University Press, 1998.
20. Forgy E. Cluster analysis of multivariate data: efficiency versus interpretability of classifications. Biometrics 1965;21:768–769.
21. Rajaraman A, Ullman JD. Mining of Massive Datasets. New York: Cambridge University Press, 2011.
22. Manning C, Schutze H. Foundations of Statistical Natural Language Processing. Cambridge: MIT Press, 1999.
23. Koopman R, Wang S. Where should I publish? Detecting journal similarity based on what have been published there. In: IEEE/ACM Joint Conference on Digital Libraries, 2014 Sep 8-12; London, UK.
24. Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. Preprint at:
https://arxiv.org/abs/1301.3781 (2013).
25. Huang A. Similarity measures for text document clustering. In: Proceedings of the 6th New Zealand Computer Science Research Student Conference (NZCSRSC2008), 2008 Apr 14; New Zealand.
26. Hinton GE, Roweis S. Stochastic neighbor embedding. In: Advances in Neural Information Processing Systems 15 (NIPS 2002) (Becker S, Thurn S, Obermayer K, eds.), 2002 Dec 9-14, Vancouver, BC, Canada.