Have a personal or library account? Click to login
Word pattern prediction using Big Data frameworks Cover
By: Bence Szabari and  Attila Kiss  
Open Access
|Jul 2020

References

  1. [1] G. Erin. Processing time of TFIDF and Naive Bayes on Spark 2.0, Hadoop 2.6 and Hadoop 2.7: Which Tool Is More Efficient?, Msc Thesis, National College of Ireland Dublin, 2016. ⇒52
  2. [2] K. Rattanaopas, S. Kaewkeeree. Improving Hadoop MapReduce performance with data compression: A study using wordcount job, 2017 14th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTICON). IEEE, 2017. p. 564-567 ⇒5210.1109/ECTICon.2017.8096300
  3. [3] KM. Lee, CS. Han, KI. Kim, SH. Lee, Word recommendation for English composition using big corpus data processing, Cluster Computing, (2019), 1911-1924. ⇒56, 65
  4. [4] M. Kontagora, H. Gonzalez-Velez, Benchmarking a MapReduce Environment on a Full Virtualisation Platform, The 4th International Conference on Complex, Intelligent and Software Intensive Systems, 433-438. 10.1109/CISIS.2010.45. ⇒62
  5. [5] M. Bartík, S. Ulbik, P. Kubalik Matěj. LZ4 compression algorithm on FPGA, 2015 IEEE International Conference on Electronics, Circuits, and Systems (ICECS). IEEE, 2015 ⇒6310.1109/ICECS.2015.7440278
  6. [6] RY Rubinstein, DP. Kroese, Simulation and the Monte Carlo method. Vol. 10. John Wiley & Sons, 2016. ⇒6310.1002/9781118631980
  7. [7] R Lenhardt,J Alakuijala, Gipfeli-high speed compression algorithm. 2012 Data Compression Conference (pp. 109-118). IEEE ⇒6210.1109/DCC.2012.19
  8. [8] H. Karloff, S. Suri, S. Vassilvitskii, A model of computation for MapReduce. Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms. Society for Industrial and Applied Mathematics, 2010. ⇒5310.1137/1.9781611973075.76
  9. [9] Apache Hadoop, Apache, https://hadoop.apache.org/ ⇒52
  10. [10] Apache Spark, Apache, https://spark.apache.org/ ⇒52, 55
  11. [11] E. Brill, A simple rule-based part of speech tagger, Proceedings of the third conference on Applied natural language processing. Association for Computational Linguistics, 1992. ⇒5210.3115/974499.974526
  12. [12] Apache Yarn, Apache, https://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html ⇒53
  13. [13] Apache HDFS docs, https://hadoop.apache.org/docs/r1.2.1/ ⇒53
  14. [14] Hadoop Native Library, https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/NativeLibraries.html ⇒61
  15. [15] Project repository, https://gitlab.com/thelfter/word-prediction ⇒64
  16. [16] Spark Sql, https://spark.apache.org/docs/latest/sql-programming-guide.html ⇒55
  17. [17] Stanford part-of-speecg tagger, https://nlp.stanford.edu/software/tagger.html ⇒57
  18. [18] Wikipedia dumps, https://dumps.wikimedia.org/ ⇒63
Language: English
Page range: 51 - 69
Submitted on: Jan 31, 2020
|
Accepted on: Feb 28, 2020
|
Published on: Jul 16, 2020
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2020 Bence Szabari, Attila Kiss, published by Sapientia Hungarian University of Transylvania
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.