By Phil Whelan on February 14, 2011
In this blog-post Bradford Stephens, Drawn To Scale’s founder, answers a series of technical, business and personal questions to give an overview of what Drawn To Scale is and where it is going. Who are the founders? What is their background, technology and business model? How were they going to manage other people’s big data? Can one tool fit the demands from a broad range of data challenges that different businesses are seeing?
Posted in Interview, NoSQL, Start-Ups | Tagged big data, bradford stephens, business intelligence, cloud-computing, data management, data processing, drawn to scale, ec2, gaming, hadoop, hbase, high scalability, iaas, media, paas, rackspace, social networks, spire, startup |
By Phil Whelan on January 11, 2011
In this post I will define what I believe to be the most important projects within the Apache Projects for building scalable web sites and generally managing large volumes of data.
Posted in Data processing, Web Development | Tagged activemq, apache projects, apache software foundation, asf, cassandra, hadoop, hbase, high scalability, lucene, mahout, rabbitmq, solr, zeromq, zookeeper |
By Phil Whelan on October 1, 2010
This is a follow-up to my recent blog-post on Working With Large Data Sets. That post had some interest, so I thought it would be a good idea to go through the methodologies I had used for processing this data. I entitled this “Data Mining Without Hadoop”, because I have experience using Hadoop, and although [...]
Posted in Data processing, Hadoop | Tagged data mining, data processing, hadoop, hdfs, high scalability, large data, sort, streaming |