Daily Archives: April 17, 2008

Disk is the New Tape

An interesting scenario from Doug Cutting: Say you have a terabyte of data, on a disk with 10ms seek time and 100MB/s max throughput. You want to update 1% of the records. If you do it with random-access seeks, it … Continue reading

Posted in Programming | Tagged | 4 Comments

Continuous Integration for Data

As I told a friend recently, I’m pretty happy with the front-end code of AltLaw.  It’s just a simple Ruby on Rails app that uses Solr for search and storage.  The code is small and easy to maintain. What I’m … Continue reading

Posted in Programming | Tagged , , | Leave a comment