Follow @philwhln

sort

Data Mining Without Hadoop

By Phil Whelan on October 1, 2010

This is a follow-up to my recent blog-post on Working With Large Data Sets. That post had some interest, so I thought it would be a good idea to go through the methodologies I had used for processing this data. I entitled this “Data Mining Without Hadoop”, because I have experience using Hadoop, and although [...]

Posted in Data processing, Hadoop | Tagged data mining, data processing, hadoop, hdfs, high scalability, large data, sort, streaming | 1 Response

Top Posts

  • Homebrew - Intro To The Mac OS X Package Installer
  • Quora's Technology Examined
  • Gitolite Installation Step-By-Step
  • How To Get Experience Working With Large Datasets
  • Install Gitolite To Manage Your Git Repositories
  • Embed Base64-Encoded Images Inline In HTML
  • Map-Reduce With Ruby Using Hadoop
  • Highchart Vs Flot.js - Comparing JavaScript Graphing Engines

Tags

amazon ec2 android apple cassandra customers data processing entrepreneur entrepreneurship eventmachine gem git gitolite google hadoop hbase hdfs high scalability homebrew install iphone java location mac osx memcached mongodb mysql nginx nosql perl phone postgresql python rails redis ruby ruby on rails scala ssh-keygen startup tornado twitter vancouver web-development whirr wikipedia

Copyright © 2016 Big Fast Blog | Vancouver, BC, Canada