Distributed Data Technologies
Distributed Computing involves the use of many various computer and servers to store data. Hadoop is an example of a distributed data technology. Hadoop was a pioneer of distributed data, using the open software Apache Hadoop was able to connect multiple servers to store various amounts of data at a time lowering the need for one large server to handle data for many. Hadoop, like many other services, allows a user to work with copious amounts of data seamlessly by spanning the information throughout many different storage machines.
In MIS 2502 we often use large data set that are provide for us. Hadoop would allow us to use great amounts of data to run more accurate R scripts and analytics due to the sheer size data we would be able to store. Distributed data services would further enable us to examine data structures to make better predictions and analyze data better. Hadoop changes the way we store and access data increasing efficiency for users like us in MIS 2502.
One of the most prominent users of Hadoop is Amazon. Amazon offers web services to a variety of users, big and small. They use a Distributed data system to offer such amenities to those who need it. By used such services Amazon is able to distribute different amounts of computing power based on how much the user would need at any given time. Distributed data systems allows the seamless change of storage by its use of spreading the information throughout many different servers throughout the world.