Practical Issues using Distributed Computing Environments – Apache Hadoop


  • Toma Cristian Bucharest Academy of Economic Studies, Romania


Apache Hadoop, HTC – High Throughput Computing/HPC – High Performance Computing, distributed computing, map-reduce, distributed file system


The paper presents practical results obtained with sequential standard programming, RPC/RMI mechanism and Apache Hadoop distributed computing platform for a problem of computation time and power that might be used in e-mail text searching. First section is about Distributed Computing technologies and middleware introduction. In second and third section are shown few details about RPC/RMI and Apache Hadoop approaches. The fourth section presents the results of the computation for a classic problem such as word counting from large text files using standard versus remote procedure call versus map-reducing approach. In the end are shown the main advantages of the distributed systems and computing environments.

Author Biography

Toma Cristian, Bucharest Academy of Economic Studies, Romania

Faculty of Cybernetics, Statistics and Economic Informatics

Department of IT&C Technologies


Tom White, Hadoop – The Definitive Guide, O'Reilly Media, 528 pp, ISBN-10: 0596521979, ISBN-13: 978-0596521974, US 2009.

Chuck Lam, Hadoop in Action, Manning Publishing, 325 pp, ISBN-10: 1935182196 , ISBN-13: 978-1935182191, US 2010.

Apache Hadoop Project,

Hadoop Project, HDFS Architecture, available at

Vincent McBurney, So what is better, ETL or ELT?,, Available at:

Gopalan Suresh Raj, A Detailed Comparison of CORBA, DCOM and Java/RMI, available at:

Wikipedia, Distributed computing, available at:

Yahoo, Yahoo! Hadoop Tutorial, available at:

The Outline of Science, Vol. 1 (of 4) by J. Arthur Thomson -

The Notebooks of Leonardo Da Vinci — Complete by Leonardo da Vinci -

James Joyce, Ulysses, available at:

Wikipedia, Cloud Computing, avaialble at:

Google, Map-Reduce Programming, available at:




How to Cite

Cristian, T. (2013). Practical Issues using Distributed Computing Environments – Apache Hadoop. Journal of Mobile, Embedded and Distributed Systems, 5(1), 18-28. Retrieved from