You have been tasked to setup a Hadoop cluster. What are the three things you need to focus on?
Popular posts from this blog
In the just concluded Wimbledon Tennis Tournament IBM showcased some cool technology. IBM SlamTracker™ - a real time statistics and data visualization platform that leverages IBM's predictive analytics technology. It provides an ‘at a glance’ visual representation of a match using scores and statistics and encourages fans to get more involved by interacting with the data to gain deeper insight into the game. IBM SlamTracker™ analyses over eight years of Grand Slam data (over 41 million data points), to identify patterns in players and their styles. Before each match, IBM analyses historical matches between the players (or between players of similar styles if the players in question have not met before). In the last couple of years IBM did trial runs of SecondSight , which for the first time enabled the viewer to track the direction, speed and distance of players as they moved about the court. Data from SecondSight enables displays such as one below. Hopefully
Following is a great resource for any one considering different choices for their "NoSQL" style frame work. As always, "one-size-fits-all" approach does * not * work for NoSQL frameworks. Just as a side note that MongoDB, Cassandra and CouchDB are the top three skills sought out on indeed.com (popular aggregator of job boards) of all the databases in the table below. Having a large pool of talent is always a good thing for your technology choice. Source : International Journal of Database Theory and Application, NoSQL Database: New Era of Databases for Big data Analytics Classification, Characteristics and Comparison, A B M Moniruzzaman and Syed Akhter