Sam PiersonSam Pierson
Railsconf: Building a Mini-Google in Ruby - Ilya Grigorik
edit Posted by Sam Pierson on Tuesday May 05, 2009 at 09:23PM

Ilya's slides are already on the web.

A few random notes:

  • In 1994-1995 term frequency was state of the art in search engine relevancy.
  • State of the art today = TF-IDF = Term Frequency - Inverse Document Frequency
  • http://rubyforge.org/projects/gratr/ graph theory gem - gets slow after 1000 nodes but can manage about a million.
  • Working with math in Ruby is not the best idea. Use GSL with one of the ruby binding gems.