This article takes you through an example how to query, transform and visualize data from social media. We are going to collect tweets from twitter, store them in HDFS (Hadoop distributed file system) and use JAQL and Java MapReduce application to manipulate and transform the data. Finally, we will visualize the results using a spreadsheet-style […]
InfoSphere Streams radically extends the state-of-the-art in big data processing; it’s a high-performance computing platform that allows users to develop and reuse applications to rapidly ingest, analyze, & correlate information as it arrives from thousands of real-time sources. You can download the 90-day trial version here. Here is the list of products I used in […]